Hershey, Pennsylvania

New York, New YorkBeijing, China

Special Offers
- Up to 50% off Thousands of Research Books
  From July 1st through October 31st, 2025, we are offering discounts of up to 50% across thousands of titles in Business & Management; Science, Technology, & Medicine; and Education & Social Sciences. Through this campaign, we’re committed to ensuring that our mutual library customers worldwide can continue to access high-quality, peer-reviewed content during these challenging times. If this campaign is successful, we will extend through the end of the year and beyond if there’s a benefit to all parties involved. When hosted on the InfoSci^® Platform, e-books feature no DRM, no additional cost for unlimited-user licensing, full-text PDF & HTML formats, and more. Discount is automatically added at checkout.
  Browse Titles
- IGI Global Scientific Publishing Launches International Brand Ambassador Program
  IGI Global Scientific Publishing has launched a new Ambassador Program, designed to empower research professionals to help spread scholarly resources and foster global research engagement. As a local, mid-sized publisher, this initiative offers IGI Global Scientific Publishing an exciting opportunity to expand its global presence in the academic community and foster meaningful connections among scholars around the world. With currently over 130 ambassadors worldwide, these scholarly experts are dedicated to supporting the publisher’s initiative of disseminating cutting-edge research.
  Learn More
- Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 20 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no hosting or maintenance fees, no additional cost for unlimited-user licensing, full-text PDF & HTML format, and more.
  Learn More
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all available IGI Global Scientific Publishing open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all available IGI Global Scientific Publishing open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through the IGI Global Scientific Publishing Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global Scientific Publishing to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open access endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global Scientific Publishing to publish your work under open access? Review the IGI Global Scientific Publishing open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Segmentation-Free Word Spotting in Handwritten Documents Using Scale Space Co-HoG Feature Descriptors

Prabhakar C. J. (Kuvempu University, India)

Source Title: Applications of Advanced Machine Intelligence in Computer Vision and Object Recognition: Emerging Research and Opportunities

DOI: 10.4018/978-1-7998-2736-8.ch009

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

In this chapter, the author present a segmentation-free-based word spotting method for handwritten documents using Scale Space co-occurrence histograms of oriented gradients (Co-HOG) feature descriptor. The chapter begin with introduction to word spotting, its challenges, and applications. It is followed by review of the existing techniques for word spotting in handwritten documents. The literature survey reveals that segmentation-based word spotting methods usually need a layout analysis step for word segmentation, and any segmentation errors can affect the subsequent word representations and matching steps. Hence, in order to overcome the drawbacks of segmentation-based methods, the author proposed segmentation-free word spotting using Scale Space Co-HOG feature descriptor. The proposed method is evaluated using mean Average Precision (mAP) through experimentation conducted on popular datasets such as GW and IAM. The performance of the proposed method is compared with existing state-of-the-segmentation and segmentation-free methods, and there is a considerable increase in accuracy.

Chapter Preview

Top

Introduction

There is a huge amount of information in libraries and institutions all over the world in the form of books, documents and in other conventional methods. We need to be digitized in order to preserve and for efficient searching and browsing of information for different applications. In order to create digital libraries, thousands of digitized documents have to be transcribed (George, N, et al., 2006). Optical Character Recognition (OCR) is first used to transcribe documents where image-based documents are converted into ASCII format through automatic recognition. The automatic recognition by OCR system achieves best performance for modern high-quality printed documents with simple layouts and known fonts. The performance of OCR is very poor for handwritten text due to various challenges posed by handwritten text such as unconstrained writing styles, open vocabulary and paper degradation such as stains, ancient fonts, and faded ink.

To overcome the aforementioned limitations of OCR, the Document Image Analysis (DIA) community has developed a technique called as word spotting. Word spotting is a technique for recognition and retrieval of words in any form of document images. Word spotting can be defined as process aimed at locating and retrieving a particular word from a document image collection. The main objective of word spotting systems is to propose methods that show high accuracy, high speed and work on any language with minimum preprocessing steps. A word spotting method requires a collection of documents/document corpus and an input element is a query word. The output of word spotting method is spotting and retrieval of documents or sub images that are similar to the query word. Figure 1 illustrates a general architecture of word spotting method where the whole procedure is divided in an offline and an online phase. In the offline stage, a set of features are extracted from either word images, or text lines or whole document pages which are then represented by feature vectors. In the online phase, a user formulates a query either by selecting an actual example from the collection or by typing an ASCII text word. Then matching process is applied to these representations in order to obtain a similarity score which yields a ranking list of results according to their similarity with the query.

Figure 1.

General architecture of word spotting (Courtesy: Giotis et al., 2017)

Top

Challenges Posed By Word Spotting Problem

The word spotting in handwritten documents is not completely solved due to various challenges posed by handwritten documents and the challenges involved in handwritten documents are:

•
Either historical or modern Handwritten documents suffer from variability in writing style, not only for different authors but also for documents of the same writer.
•
The handwritten words may be skewed, characters may be slanted, non-text content such as symbols may be present and letters may be broken or connected in a cursive manner
•
Degradations such as missing data, non-stationary noise due to illumination changes, low contrast, and warping effects, which directly affect the segmentation and feature extraction stages of a word spotting method.

Top

Applications Of Word Spotting

There are a variety of applications of word spotting in handwritten documents such as:

•
Searching and browsing historical handwritten documents collections written by a single or several authors. Retrieval of documents with a given word in company/organization files. Retrieval of keywords in hospital care reports.
•
Helps human transcribers in identifying words in degraded documents
•
Sorting of mails based on significant words like urgent, cancellation and complain
•
Identification of figures and their corresponding captions. Word spotting in geographical maps.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference