Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

From Image to XML: Monitoring a Page Layout Analysis Approach for the Visually Impaired

Robert Keefer, Nikolaos Bourbakis

Source Title: International Journal of Monitoring and Surveillance Technologies Research (IJMSTR) 2(1)

DOI: 10.4018/ijmstr.2014010102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Page layout analysis and the creation of an XML document from a document image are useful for many applications including the preservation of archived documents, robust electronic access to printed documents, and access to print materials by the visually impaired. In this paper, the authors describe a document image process pipeline comprised of techniques for the identification of article headings and the related body text, the aggregation of the body text with the headings, and the creation of an XML document. The pipeline was developed to support multiple document images captured by the head-mounted cameras of a reading device for the visually impaired. Both automatic and manual adaptations of the pipeline processed a sample of 25 newspaper document images. By comparing the automatic and manual processes, we show that overall our approach generates high-quality XML encoded documents for use in further processing, such as a text-to-speech for the visually impaired.

Article Preview

Top

1. Introduction

Page layout analysis and the creation of an XML document from a document image are useful for many applications including the preservation of archived documents (Wang, et al., 2009) and accessibility by those with visual impairments. TYFLOS (Keefer, et al., 2009a,b) is a prototype wearable mobile reading device for the visually impaired. TYFLOS is equipped with two web cameras mounted into a pair of glasses and the software for performing document image rectification and segmentation. Traditional document image analysis techniques play an important role in the operation of the TYFLOS prototype, including document image capture, binarization, page perspective correction in 3-dimensions, page curl correction, and page segmentation. In this paper we describe techniques for headline identification, page segment aggregation, and the creation of an XML document from the document image. The XML document supports various forms of interaction with the text of the document, including a voice user interface (Keefer, et al., 2013).

Much work has been performed to identify headlines within web sites and document images. This work has been in the context of both improving access to documents for the visually impaired, as well as the digital access of archived documents. For example, Brudvick, et al. (2008) have developed a method to predict whether web page content semantically functions as a headline by considering the visual features of text when rendered in a browser. Similarly, Kohlschütter, et al. (2010) describe a method for identifying text elements within a web page.

Document segmentation has been of interest to the document image processing community for many years. O’Gorman’s (1993) Docstrum method offered an original and well organized analysis of document layout analysis based on K-nearest neighbors to identify connected components and from these to identify regions of text. Akram et al. (2010) offer a review on the way to process a document and generally segment the layout area. In another approach, Winder et al. (2011) describe a method for page segmentation based on an analysis of the Voronoi zones of a histogram of the connected component heights of image segments. Similarly, Breuel et al. (2011) also patented a method for document image layout deconstruction. Finally, Ferilli, et al. (2011) apply supervised machine learning techniques to document image layout analysis.

For the purposes of supporting robust interaction with document images converted to XML, Ishitani (2003) proposed a method for transforming a document image into XML. This method extracts document elements such as title, headings, and body text from a document image. The hierarchical structure of the document is also extracted and described by a document object model (DOM). The XML document is created through a set of transforms applied to the extracted document elements and the DOM.

WISDOM++ (Altamura, et al., 2001) is a document processing system that performs document analysis, classification, and text transformation to generate an XML document from a document image. Agrawal and Doermann (2010) also discuss a method for page segmentation that produces GEDI XML files.

To create an XML document from the document image, a document image segmentation method must separate images from text, identify headings within the document image, and identify article content within the document image. The methods described in (Ishitani, 2003), (Altamura, et al., 2001), (Agrawal and Doermann, 2010), (Antonacopoulos and Karatzas, 2004), and (Pletschacher and Antonacopoulos, 2010) all rely on robust document analysis methods to identify the structure and format of the document image, followed by an OCR step to convert the text within a segment to XML.

Complete Article List

Search this Journal:

Reset

Open Access Articles: Forthcoming

Volume 5: 4 Issues (2017)

Volume 4: 4 Issues (2016)

Volume 3: 4 Issues (2015)

Volume 2: 4 Issues (2014)

Volume 1: 4 Issues (2013)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

From Image to XML: Monitoring a Page Layout Analysis Approach for the Visually Impaired

Abstract

1. Introduction

Complete Article List