Hindi Optical Character Recognition and Its Applications

Hindi Optical Character Recognition and Its Applications

Rashmi Gupta (AIACTR, India), Dipti Gupta (AIACTR, India), Megha Dua (AIACTR, India) and Manju Khari (AIACTR, India)
Copyright: © 2017 |Pages: 12
DOI: 10.4018/978-1-5225-2154-9.ch003
OnDemand PDF Download:
$30.00
List Price: $37.50

Abstract

Recognition is an important part in the computer vision. Optical character recognition is nowadays gaining its importance in terms of the digital and handwritten documents recognition. Devanagari is widely spoken script with more than 300 million people relying on it for their day-to-day activities, so recognition of Devanagari characters is gaining its importance in the recent times. Tasksin handwritten recognition handle the differences along with alteration of Hindi characters written in offline mode. Furthermore, Hindi character are written in different sizes shapes and orientation in contrast to hand writing usually written along a particular baseline in a horizontal direction. Handwritten and machine printed documents are needed to be recognized for the applications like bank Cheque processing, library automation, publication house, manuscripts, Granths and other forms and documents. In this paper an attempt has been made to shortlist the methods and processing techniques studied so far in the field of Devanagari character recognition. The performance analysis and the results for the various techniques are given in the chapter.
Chapter Preview
Top

Introduction

Hindi HCR is considered as one of the grave situation today in the world. Typed or handwritten Hindi characters are identified by a computer. Though it is not easy to recognize the Hindi hand written text (Garg, N. K. 2015). They are not recognized efficiently or more accurately by the computer or any optical character recognition machine. There have been a lot of researches in the same context and may different algorithms have been proposed for the recognition purpose along with the availability of the software in the market. For recognizing characters, there are a series of procedures and processes are done. A lone process or only machine is developed so far that can do the entire recognition itself.

The setback in this recognition is recognized by many researchers. The simplest technique is the template matching in which the templates of each word are already taken and preserved for checking the input image in terms of error and the difference between the two is thus calculated. There are techniques which works very fine for different fonts but owes a limitation of giving poor output in terms of the performance for the hand written characters. The next approach is the feature extraction which takes the statistical distribution of points and is then analyzed. Furthermore the properties are extracted which are orthogonal. Features are calculated for each and every symbol and then are stored in the database. Though this technique can be used for the handwritten characters but it is subjected to noise and thickness. We can also use geometric approach the feature extraction is quite precise and can be easily understood.

The ANN (artificial neural networks) has been so far found the best alternative for the recognition due to the ease in the design and the can be used easily worldwide. Hindi character recognition is a significant part of the modern world as it makes the jobs of the people easier than it was used to be. The complex problems can be solved with much accuracy and in lesser time. Even if there is a richness in the ongoing research still there is a certain need in the area of the research. With the increase in the use of the office automation it is an urgent need of the hour to provide practical and effective solutions. However there are many different factors such as topological, arithmetical, structural does not help in the identification process.. The main focus is on the handwritten or hand printed character recognition due to the differences in character shape and size.

Over the past few years there are many companies which are involved in the thorough research of the same which is rising repeatedly. However, handwritten text identification is not a new technique but still it is not popular among masses. The ultimate goal of designing is achieving 100% accuracy which is just an illusion as of now and it is certain that an advanced technology is however necessary to enable the people who cannot even read their own notes.

Complete Chapter List

Search this Book:
Reset