Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Automated Text Detection and Recognition in Annotated Biomedical Publication Images

Soumya De, R. Joe Stanley, Beibei Cheng, Sameer Antani, Rodney Long, George Thoma

Source Title: Medical Imaging: Concepts, Methodologies, Tools, and Applications

DOI: 10.4018/978-1-5225-0571-6.ch018

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Images in biomedical publications often convey important information related to an article's content. When referenced properly, these images aid in clinical decision support. Annotations such as text labels and symbols, as provided by medical experts, are used to highlight regions of interest within the images. These annotations, if extracted automatically, could be used in conjunction with either the image caption text or the image citations (mentions) in the articles to improve biomedical information retrieval. In the current study, automatic detection and recognition of text labels in biomedical publication images was investigated. This paper presents both image analysis and feature-based approaches to extract and recognize specific regions of interest (text labels) within images in biomedical publications. Experiments were performed on 6515 characters extracted from text labels present in 200 biomedical publication images. These images are part of the data set from ImageCLEF 2010. Automated character recognition experiments were conducted using geometry-, region-, exemplar-, and profile-based correlation features and Fourier descriptors extracted from the characters. Correct recognition as high as 92.67% was obtained with a support vector machine classifier, compared to a 75.90% correct recognition rate with a benchmark Optical Character Recognition technique.

Chapter Preview

Top

1. Introduction

Essential information is often conveyed through images in biomedical publications. Images such as diagrams, tables, histograms and flowcharts are typically rich in content, summarizing the important results/methods presented in an article. Such images, when used in conjunction with either the image caption text or the image citations (mention) in the publications, can enhance the performance of Clinical Decision Support (CDS) systems (Demner-Fushman, 2008, 2009; Deserno, 2009). In previous studies, the retrieval of biomedical information for CDS has been primarily text-based, limited mainly to bibliographic information. To that end, traditional Content-Based Image Retrieval (CBIR) provides automated indexing and retrieval of large image collections. Biomedical images for a given modality (e.g. MRI, Histology or X-Ray) are however very similar in nature. Therefore, existing CBIR techniques based only on the visual features (texture/shape) of images are not sufficient for accurate retrieval of biomedical images (Pfund, 2002; Müller, 2004; Tang, 1999). In addition to text (image captions/citations) and visual features, retrieving characters from biomedical images can be used as part of a broader process to obtain complementary information for enhanced CBIR.

As part of CBIR, regions of interest (ROIs) within biomedical images are those which contain illustrations such as arrows/symbols/text-labels. Commonly used methods for CBIR, however, do not utilize these ROIs. The semantic gap in biomedical image analysis can be reduced by characterizing the ROIs, as compared to only analyzing the image as a single entity (Demner-Fushman, 2009, Deserno, 2009). Lehmann et al. proposed that three additional semantic abstraction levels are required of CBIR systems to understand complex medical knowledge (Lehmann, 2004). These include low-level medical information to understand the imaging modality, mid-level information obtained from ROIs, and high-level information obtained from the spatial relationships of ROIs (Lehmann, 2004, 2005).

Images in biomedical articles are generally of two types: medical images and analytical images. Medical images include MRIs, CT-Scans, X-rays, photographs and so forth. Analytical images, such as diagrams, statistical charts, flowcharts, and tables represent images that are created to either illustrate biomedical concepts or allow for biomedical data analysis. In previous studies involving CBIR, classification of analytical images into its various modalities have been successfully performed (Rahman, 2008; Pourghassem, 2008; Stanley, 2011). The information present within these analytical images must be extracted, however, to support both multimodal (image + text) biomedical information retrieval and CDS (Demner-Fushman, 2007; Cheng, 2011). The study presented in this paper is focused on enhancing the retrieval of textual information from analytical images.

As previously stated, authors often include several forms of annotations with their images. These annotations include but are not limited to text, text labels (e.g., A, B, and C), pointers (e.g., arrows and arrowheads) and symbols (e.g., asterisk). Such annotations are used to identify a ROI in the image. In previous CBIR-based research, arrow detection has been found to be successful in several types of biomedical images (Cheng, 2011; Dov, 1999; Park, 2008; Herold, 2010; Hearst, 2007). The integration of semantic annotation and information visualization was performed by Herold et al to analyze fluorescence micrographs of tissue samples for CBIR applications (Herold, 2010). Previous studies have analyzed biomedical images with text-like characteristics for both the extraction and recognition of textual characters (Hearst, 2007; Wu, 1999; Xu, 2008 2010; You, 2009, 2010).

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Automated Text Detection and Recognition in Annotated Biomedical Publication Images

Abstract

1. Introduction

Complete Chapter List