LoG and Structural Based Arbitrary Oriented Multilingual Text Detection in Images/Video

LoG and Structural Based Arbitrary Oriented Multilingual Text Detection in Images/Video

Basavaraju H. T. (Sri Jayachamarajendra College of Engineering, Mysore, India), Manjunath Aradhya V.N. (Sri Jayachamarajendra College of Engineering, Mysore, India), Guru D. S. (University of Mysore, Mysore, India) and Harish H. B. S. (Sri Jayachamarajendra College of Engineering, Mysore, India)
Copyright: © 2018 |Pages: 16
DOI: 10.4018/IJNCR.2018070101

Abstract

Text in an image or a video affords more precise meaning and text is a prominent source with a clear explanation of the content than any other high-level or low-level features. The text detection process is a still challenging research work in the field of computer vision. However, complex background and orientation of the text leads to extremely stimulating text detection tasks. Multilingual text consists of different geometrical shapes than a single language. In this article, a simple and yet effective approach is presented to detect the text from an arbitrary oriented multilingual image and video. The proposed method employs the Laplacian of Gaussian to identify the potential text information. The double line structure analysis is applied to extract the true text candidates. The proposed method is evaluated on five datasets: Hua's, arbitrarily oriented, multi-script robust reading competition (MRRC), MSRA and video datasets with performance measures precision, recall and f-measure. The proposed method is also tested on real-time video, and the result is promising and encouraging.
Article Preview
Top

Introduction

The multimedia data is drastically increasing due to the availability of economical digital cameras. Hence, the multimedia data consists of a massive number of images and videos. Text in an image or a video delivers more detailed and explicit meaning. The textual information is a prominent source with a clear explanation of the content than any other high-level and low-level visual features. Therefore, text present in an image or a video helps us to understand an event or content. There are two types of text present in an image or a video; they are Caption Text: artificial text or superimposed text and Scene Text: Natural existence text captured by the camera. The separation of these texts from the background is challenging and interesting research work in the field of computer vision and pattern recognition.

Text detection is one of the prime part to separate the text from its background. Text detection is a process of determining the existence of text in an image or a video frame. This process does not have any pre-knowledge about textual properties present in an image or a video. The text detection stage is an initial stage of the text information extraction process. Hence it should be a fast and minimum false alarm. The ample of works have done on text detection process, and many of those methods considered particular properties and challenges to distinguish the text region from non-text region. The existing approaches to text detection can be categorized into four techniques, such as edge based, connected component based, texture based, eigenvalue based and region-based methods. But these methods are still facing a problem to detect the text accurately due to illumination, complex background, and variation of size, style, color, orientation and alignment of the text. Arbitrarily oriented text detection is much challenging than horizontal text detection process because the text is present in the form of various direction and text line does not maintain any uniformity. Arbitrary oriented multilingual language is much more difficult than any other text detection process, because the multilingual textual information consists multiples of languages, with different geometrical shapes, multi-color, multi-size, and multi-fonts. Hence, there is a demand for a new technique, which can take care of all the above-discussed problems. The text detection process plays a prominent role in applications like, automatic annotation, indexing, tracking, retrieval, event understanding, assisting tourists and assisting a blind person.

Complete Article List

Search this Journal:
Reset
Open Access Articles: Forthcoming
Volume 10: 4 Issues (2021): Forthcoming, Available for Pre-Order
Volume 9: 4 Issues (2020): 2 Released, 2 Forthcoming
Volume 8: 4 Issues (2019)
Volume 7: 4 Issues (2018)
Volume 6: 2 Issues (2017)
Volume 5: 4 Issues (2015)
Volume 4: 4 Issues (2014)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing