Open Access Open Access  Restricted Access Subscription Access
Open Access Open Access Open Access  Restricted Access Restricted Access Subscription Access

A Font and Size Independent Ocr for Machine Printed Gujarati Numerals


     

   Subscribe/Renew Journal


Character recognition is major research area since its inspiration. So, far very limited progress has been made in it, specifically for Indian languages. Recognition of Gujarati script is a less studied area and no significant attempt is made so far to recognize Gujarati glyphs. In this paper we have presented a simple yet robust solution for recognition of offline multi-font computer generated and machine printed Gujarati Numerals. Pursued by the pre-processing techniques, we used a method called correlation based template matching where a numeral is identified by analyzing its shape and comparing its features that distinguish each numeral. The system appears to be very robust against font variations and large shape variations.

Keywords

Template, Correlation, Segmentation, Normalization, Probability, etc
Subscription Login to verify subscription
User
Notifications
Font Size


  • R.K. Sinha, H.N. Mahabala, “Machine recognition of Devnagari script”, IEEE transactions on Systems, Man, and Cybermetics (1979) 435-441.
  • P. Rao and T. Ajitha, “Telugu script recognition”, In International Conference on Document Analysis and Recognition, pages 323–326, 1995.
  • Judith Hochberg, Lila Kerns, Patrick Kelly, and Timothy Thomas automatic script identification from images using cluster-based templates Proceedings of the Third International Conference on Document Analysis and Recognition (ICDAR '95), 1995 IEEE.
  • Hadar L. Avi Itzhak, Jan A. Van Mieghem, Leonardo Rub, “ Multiple Subclass pattern recognition: A maximin correlation approach” IEEE transaction on pattern analysis and machine intelligence VOL.17 No. 4, April-1995.
  • S. Antani, L. Agnihotri, “Gujarati character recognition”, fifth International Conference on Document Analysis and Recognition (ICDAR’99), 1999, pp. 418-421.
  • Negi, C. Bhagvati, and B. Krishna, ”An OCR System for Telugu” Proceedings of the Sixth International Conference on Document Analysis and Recognition (ICDAR’01) 0-7695-1263-1/01 $10.00 © 2001 IEEE
  • Faruq Al-Omari Hand-Written, “Indian numerals recognition system using template matching”, Proceedings of the ASC/IEEE International conference on computer systems and applications, 2001 IEEE.
  • C.V. Lakshmi, C. Patvardhan, “A multi-font OCR system for Telugu text”, Proceedings of the Language Engineering Conference (LEC’02) 2002 IEEE
  • C.V. Lakshmi, C. Patvardhan, “Optical character recognition of basic symbols in printed Telugu text”, IE(I) Journal-CP 84 (2003) 66-71.
  • Hung-Ming Sun Multi-Linguistic, “Optical Font Recognition Using Stroke Templates”, The 18th International Conference on Pattern Recognition (ICPR'06) 2006 IEEE
  • J. Dholkia, A. Yajnik, and A. Negi, “Wavelet Feature Based Confusion character sets for Gujarati script” International Conference on Computational Intelligence and Multimedia Applications 2007.
  • R.S. Kunte, R.D.S. Samuel, “A simple and efficient optical character” recognition system for basic symbols in printed Kannada text S¯adhan¯a Vol. 32, Part 5, October 2007, pp. 521–533. © Printed in India
  • Sang sung park, Won gyo Jung, Young geun shin, Dong-sik Jang, “Optical character recognition system using BP algorithm”, IJCSNS International journal of computer science and network security, VOL. 08 No. 12, December 2008.
  • M.C. Padma and P.A. Vijaya, “ Identification Of Telugu, Devanagari And English Scripts Using Discriminating Features”, International Journal of Computer science & Information Technology (IJCSIT), Vol 1, No 2, November 2009
  • K. Freedman, “A Cognitive Model of Character Recognition Using Support Vector Machines”, World Academy of Science, Engineering and Technology 58 2009

Abstract Views: 521

PDF Views: 2




  • A Font and Size Independent Ocr for Machine Printed Gujarati Numerals

Abstract Views: 521  |  PDF Views: 2

Authors

Abstract


Character recognition is major research area since its inspiration. So, far very limited progress has been made in it, specifically for Indian languages. Recognition of Gujarati script is a less studied area and no significant attempt is made so far to recognize Gujarati glyphs. In this paper we have presented a simple yet robust solution for recognition of offline multi-font computer generated and machine printed Gujarati Numerals. Pursued by the pre-processing techniques, we used a method called correlation based template matching where a numeral is identified by analyzing its shape and comparing its features that distinguish each numeral. The system appears to be very robust against font variations and large shape variations.

Keywords


Template, Correlation, Segmentation, Normalization, Probability, etc

References