Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Segmentation Free Word Spotting for Handwritten Documents Using Bag of Visual Words Based on Co-HOG Descriptor

Thontadari C., Prabhakar C. J.

Source Title: International Journal of Information Retrieval Research (IJIRR) 9(2)

DOI: 10.4018/IJIRR.2019040105

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

In this article, the authors propose a segmentation-free word spotting in handwritten document images using a Bag of Visual Words (BoVW) framework based on the co-occurrence histogram of oriented gradient (Co-HOG) descriptor. Initially, the handwritten document is represented using visual word vectors which are obtained based on the frequency of occurrence of Co-HOG descriptor within local patches of the document. The visual word representation vector does not consider their spatial location and spatial information helps to determine a location exclusively with visual information when the different location can be perceived as the same. Hence, to add spatial distribution information of visual words into the unstructured BoVW framework, the authors adopted spatial pyramid matching (SPM) technique. The performance of the proposed method evaluated using popular datasets and it is confirmed that the authors' method outperforms existing segmentation free word spotting techniques.

Article Preview

Top

1. Introduction

Nowadays, documents digitization has become more popular for storage and transmission instead of the traditional paper documents. In order to access the content of digitized documents, the manuscripts are transcribed into machine understandable format, so users can perform the textual search. When dealing with huge collections of handwritten documents, automatic transcription processes are carried out using Optical Character Recognition (OCR) strategies. An automatic recognition of poor quality handwritten text is not feasible by traditional OCR approaches which mainly adequate for modern printed documents with simple layouts and known fonts. Most of the constraints encountered by OCR systems for handwritten documents stem from difficulties in segmenting characters or words, the variability of the handwriting and the open vocabulary. In order to overcome the drawbacks of OCR, the researchers have developed word spotting technique which becomes an essential tool to retrieve the historical and modern handwritten documents based on user interest information. Word spotting can be defined as the pattern recognition task aimed at locating and retrieving a particular word from a document image collection without explicitly transcribing the whole corpus.

The researchers have proposed techniques for word spotting in handwritten documents either using segmentation or without segmentation of handwritten documents. The main drawback of segmentation-based word spotting techniques is that they need to perform segmentation step to select candidate words. Any segmentation errors affect the subsequent steps such as word representation and matching, so it is desirable to avoid segmentation of documents. This motivated to the researchers of word spotting domain move towards segmentation free word spotting methods. In segmentation free methods (Leydier et al., 2005; Gatos et al., 2009), the document images are represented by feature descriptor such as Surface Invariant Feature Transform(SIFT). Then, sliding window or patch-based approaches are used to locate the document regions that are most similar to the query word (Rusinol et al., 2015; Shekhar et al., 2012; Rothacker et al., 2013 and Zhang et al., 2013). The drawback of SIFT-based word spotting is that they are memory intensive; window size cannot be adapted to the length of the query, relatively slow to compute and match. In order to avoid matching all the key points among them, the Bag of Visual Words (BoVW) technique has been used for word spotting in handwritten documents (Rusinol et al., 2011; Shekhar et al., 2012). The BoVW based word spotting methods yield holistic and fixed-length image representation while keeping the discriminative power of local descriptor.

Almazan et al. (2014) have proposed unsupervised segmentation free word spotting method based on HOG descriptor. Documents images are represented through a grid of HOG descriptor, and a sliding-window approach is used to locate the document regions that are most similar to the query. HOG feature descriptor captures orientation of only isolated pixels, whereas spatial information of neighboring pixels is ignored. In order to capture the spatial information of neighboring pixels, we propose a Co-occurrence Histogram of Oriented Gradient (Co-HOG) descriptor (Watanabe et al., 2009) for word spotting in handwritten documents. The Co-HOG is an extension of HOG descriptor, which encodes gradient orientation of neighboring pixel pairs and accordingly captures more spatial and relative information, making it more dominant to represent the characters shape precisely and effectively.

Complete Article List

Search this Journal:

Reset

Volume 14: 1 Issue (2024)

Volume 13: 1 Issue (2023)

Volume 12: 4 Issues (2022): 3 Released, 1 Forthcoming

Volume 11: 4 Issues (2021)

Volume 10: 4 Issues (2020)

Volume 9: 4 Issues (2019)

Volume 8: 4 Issues (2018)

Volume 7: 4 Issues (2017)

Volume 6: 4 Issues (2016)

Volume 5: 4 Issues (2015)

Volume 4: 4 Issues (2014)

Volume 3: 4 Issues (2013)

Volume 2: 4 Issues (2012)

Volume 1: 4 Issues (2011)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Segmentation Free Word Spotting for Handwritten Documents Using Bag of Visual Words Based on Co-HOG Descriptor

Abstract

1. Introduction

Complete Article List