Skip to main content
Log in

A simple and efficient optical character recognition system for basic symbols in printed Kannada text

  • Published:
Sadhana Aims and scope Submit manuscript

Abstract

Optical Character Recognition (OCR) systems have been effectively developed for the recognition of printed characters of non-Indian languages. Efforts are on the way for the development of efficient OCR systems for Indian languages, especially for Kannada, a popular South Indian language. We present in this paper an OCR system developed for the recognition of basic characters (vowels and consonants) in printed Kannada text, which can handle different font sizes and font types. Hu’s invariant moments and Zernike moments that have been progressively used in pattern recognition are used in our system to extract the features of printed Kannada characters. Neural classifiers have been effectively used for the classification of characters based on moment features. An encouraging recognition rate of 96.8% has been obtained. The system methodology can be extended for the recognition of other south Indian languages, especially for Telugu.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ashwin T V, Sastry P S 2002 A font and size-independent OCR system for printed Kannada documents using support vector machines. Sādhanā 27: 35–58

    Google Scholar 

  • Chong Chee-Way, Raveendran P, Mukundan R 2003 A comparative analysis of algorithms for fast computation of Zernike moments. Pattern Rec. 36: 731–742

    Article  MATH  Google Scholar 

  • Girosi F, Poggio T 1990 Networks and the best approximation property. Bio. Cybernetics. 63: 169–176

    Article  MATH  MathSciNet  Google Scholar 

  • Gonzalez R C, Woods R E 1993 Digital image processing (Boston, MA, USA: Addison Wesley Longman Publishing Co. Inc.)

    Google Scholar 

  • Hu M-K 1962 Visual pattern recognition by moment invariants. IRE Trans. Inf. Theory. IT-8: 179–187

    Google Scholar 

  • Jawahar C V, Pavan Kumar, Ravi Kiran S S 2003 A Bilingual OCR for Hindi-Telugu documents and its applications. Proc. Seventh Int. Confer. on Document Anal. and Rec. 408–412

  • Khotanzad A 1998 Rotation invariant pattern recognition using Zernike moments. Proc. Int. Conf. on Pattern Rec. 326–328

  • Kunte Sanjeev R, Sudhaker Samuel R D 2006 A two-stage character segmentation scheme for Printed Kannada text. J. Graphics, Vision and Image Processing 6: 1–8

    Google Scholar 

  • Moody J, Darken C J 1989 Fast learning in network of locally-tuned processing units. J. Neural Comput. 1: 281–294

    Article  Google Scholar 

  • Mukundan R, Ong S H, Lee P A 2001 Image analysis by Tchebichef moments. IEEE Trans. Image Processing 10: 1357–1364

    Article  MATH  MathSciNet  Google Scholar 

  • Mohammed Al-Rawi, Yang Jie 2002 Practical fast computation of Zernike moments. J. Comput. Sci. and Technol. 17: 181–188

    Article  MATH  Google Scholar 

  • Nagabhushan P, Pai Radhika M 1999 Modified region decomposition method and optimal depth decision tree in the recognition of non-uniform sized characters — An experimentation with Kannada characters. Pattern Rec. Lett. 20: 1467–1475

    Article  Google Scholar 

  • Negi Atul, Chakravarthy Bhagavathi, Krishna B 2001 An OCR system for Telugu. Proc. Sixth Inter. Confer. on Document Anal. and Rec. 1110–1114

  • Park J, Wsandberg J 1991 Universal approximation using radial basis function neural networks. J. Neural Comput. 1: 246–257

    Article  Google Scholar 

  • Teague M R 1980 Image analysis via the general theory of moments. J. Optical Soc. Amer. 70: 920–930

    MathSciNet  Google Scholar 

  • VijayaKumar B, Ramakrishnan A G 2004 Radial basis function and sub-space approach for printed Kannada text recognition. Proc. IEEE ICASSP 2004 5: 321–324

    Google Scholar 

  • Zernike F 1934 Physica. 1: 689–704

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Sanjeev Kunte.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Sanjeev Kunte, R., Sudhaker Samuel, R.D. A simple and efficient optical character recognition system for basic symbols in printed Kannada text. Sadhana 32, 521–533 (2007). https://doi.org/10.1007/s12046-007-0039-1

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12046-007-0039-1

Keywords

Navigation