Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Image-Abstraction Framework as a Preprocessing Technique for Extraction of Text From Underexposed Complex Background and Graphical Embossing Images

Pavan Kumar, Poornima B., H. S. Nagendraswamy, C. Manjunath, B. E. Rangaswamy

Source Title: International Journal of Distributed Artificial Intelligence (IJDAI) 13(1)

DOI: 10.4018/IJDAI.2021010101

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Underexposed heterogeneous complex-background and graphical embossing text documents are treated using proposed preprocessing image-abstraction framework that can deliver the effective structure preserved abstracted output by manipulating visual-features from input images. Reading of the text character in such images is extremely poor; hence, the framework effectively boosted the significant image properties and quality features at every stage. Work effectively preserves the foreground structure of an image by comprehensively integrating the sequence of NPR filters and diminishes the background content of an image, and in this way, the framework contributes to separation of foreground text from image background. Effectiveness of the proposed work has been validated by conducting the trials on the selected dataset. In addition, user's visual-feedback and image quality assessment techniques were also used to evaluate the framework. Based on the obtained abstraction output, this work extracts text-character by wisely utilizing traditional image processing techniques with an average accuracy of 98.91%.

Article Preview

Top

1. Introduction

In the prehistoric days palm leaves were used to carry information in script form. Extraction of text information from preserved leaves is a real challenging issue due to the organic nature of leaves which makes them decay gradually over the years, resulting in difficulty in extracting the information. Hence, recognition and extraction via traditional computer vision techniques may not yield the good success rate. The advancement in science and technology facilitates end users to synthesize, modify and capture the heterogeneous data using image acquisition devices and multimedia interactive tools. This led to collection of millions of digital documents in the form of digital electronic medium, digital-videos and still photographs published and stored in various social sites and storage repositories. According to the 2019 Flickr survey there are about 100 millions images stored every month in the Flickr repository. Availability of multimedia interactive tools allows the end users to stylize and enhance the image background using rich graphical elements in a sophisticated way and multimedia documents become more attractive and colorful although extraction of text information remains elusive. Analysis and extraction of text information in multimedia still photography is very much essential in the computer vision domain for the analysis of information in the image. Text in an image plays a very important role and furnishes indispensable information to make optimal decisions like document analysis, content based text information retrieval, identification of vehicle number plate, identification of street sign, guidance to blind people, automatic geo-coding, automatic email sorting, unmanned assistive navigation of vehicle, recognition of varies physical parts in industrial automations, guidance for foreign tourists (Lu, S, et al., 2015).

Text extraction from uniform background is much easier than underexposed heterogeneous complex background and graphical embossing images (to be referred as sampled images). The sampled 2D images not only consist of the text information but also non-textual information and are considered to be a mixture of natural scene text images, caption text images and documentary text images. Graphical embossing images are the combination of document text images and natural scene text images and in most of the situations they are captured via camera by amateur users and the text extraction process becomes tedious under this condition. Some of the examples of sampled 2D images are magazine papers, marks/grade cards, decorated power point slides, news papers, children story books and random clicking of images captured under low-illumination condition etc (P.Nagabhushan and S. Nirmala, 2010).

Since two decades various traditional text extraction techniques are being developed to extract the text from complex background by many researchers such as multilevel thresholding, adaptive local thresholding, global thresholding, gamma correction algorithm, histogram equalization, wavelet decomposition and combination of wavelet and moments of DWT wavelets and HAAR wavelets, homomorphic filtering, local binarization, connected component analysis (CCA) technique, hybrid binarization K-mean clustering (HBK), histogram oriented gradients, expectation maximization (EM), constrained run length algorithm (CRLA), morphological operations, median filtering, canny edge detection, sobel edge detection, markov random field (MRF), spiral run length smearing algorithm (SRLSA), support vector machine (SVM), hybrid CCA etc. Despite of all this, the best result is not possible by adopting the above mentioned one or two text extraction techniques. With the increase of multimedia image data is vehemently demands extraction of text from the sampled images. However, extraction of text from these 2D color images is not so easy because of various constraints posed in the extraction process by situations such as text characters embedded in graphically embossed background images, shading of characters being mixed with graphical embossing images, background of image content being stylized and text having multiple color in the background and foreground, varying text size with respect to row and column with variable distance in between them. A typical single page line text character may contain multiple colors for better visualization, edge strength of the text character may vary from character to character due to improper illumination effect and orientation of the text character in an image may be inconsistent and poor in contrast. In addition to the aforementioned problems, text extraction from sampled images also are constrained by shadows and weak text formatting and images captured in underexposed lighting conditions.

Complete Article List

Search this Journal:

Reset

Volume 15: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 14: 2 Issues (2022)

Volume 13: 2 Issues (2021)

Volume 12: 2 Issues (2020)

Volume 11: 2 Issues (2019)

Volume 10: 2 Issues (2018)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Image-Abstraction Framework as a Preprocessing Technique for Extraction of Text From Underexposed Complex Background and Graphical Embossing Images

Abstract

1. Introduction

Complete Article List