Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Devnagari Script Recognition: Techniques and Challenges

P. Mukherji, P.P. Rege

Source Title: Cross-Disciplinary Applications of Artificial Intelligence and Pattern Recognition: Advancing Technologies

DOI: 10.4018/978-1-61350-429-1.ch014

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Devnagari script is the most widely used script in India and its Optical Character Recognition (OCR) poses many challenges. Handwritten script has many variations, and existing methods used are discussed. The authors have also collected a database on which the techniques are tested. The techniques are based on structural methods as opposite to statistical methods. There are some special properties of Devnagari script like the topline, curves, and various types of connections that have been exploited in the methods discussed in this chapter.

Chapter Preview

Top

Background

Optical Character Recognition (OCR) is the study of teaching machines to observe the environment and learn to read characters and make decisions. Character and pattern recognition are basic requirements in Artificial Intelligence. A character also comes in the general category of a pattern. In Jain A. K., Duin R. P. W. & Mao J. (2000), pattern is defined “as opposite to chaos; it is an entity and could be given a name”.

OCR Basic Principles

Handwritten or typed data is converted to digital form either by scanning the writing on paper or by writing with a special pen on an electronic surface such as a digitizer combined with a liquid crystal display. The two approaches are distinguished as off-line and on-line OCR Plamondon R. & Srihari S. N. (2000), respectively.

Prior to feature extraction, preprocessing improves recognition efficiency. Preprocessing includes noise removal, machine and handwritten character segmentation, script identification, graphic and text segmentation and all such techniques that lead to improved recognition accuracy.

Feature extraction based methods work on extracting a set of invariant features from the test pattern and the classification is done in feature space.

Character classification can be achieved in two stages: coarse classification and fine classification. Coarse classification is accomplished by class set partitioning or dynamic character selection Duda R. O., Hart P. E. & Stork D.G. (2001). A tree classifier Gonzalez R. C. & Woods R. E.(2003) is used to selectively examine presence or absence of certain feature at each node thereby reducing the search.

The Devnagari Script

Devnagari script is the most widely used script in India. Just as Kanji is used in Japanese and Chinese language, Devnagari is used in over forty languages including Sanskrit, Hindi, and Marathi etc.

The basic character set of Devnagari script is of 48 characters and Shivaji 01 font is shown in Figure 1(a). The character set of Devnagari script with 45 characters is shown in Figure 1(b).

Figure 1.

Devnagari character set

Every individual word has a horizontal header line or the ‘shirorekha’. This line serves as a reference to divide the character into two distinct portions: Head and Body, if the top modifier is present. Devnagari word may be divided in three zones. Zone 1 is the region of top-modifier; Zone 2 is the body of the word and Zone 3 is the lower modifier region. Another feature is the inter-character gap in a word that facilitates character segmentation and isolation.

Top

In this section existing techniques for feature extraction for OCR of other scripts used all over the world and Devnagari in particular are discussed.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Devnagari Script Recognition: Techniques and Challenges

Abstract

Background

OCR Basic Principles

The Devnagari Script

Complete Chapter List

Devnagari Script Recognition: Techniques and Challenges

Abstract

Background

OCR Basic Principles

The Devnagari Script

Literature Survey And Related References Of Existing Techniques For Ocr

Complete Chapter List