Methods of Semantic Integrity Preservation in the Pattern Recognition Process

Methods of Semantic Integrity Preservation in the Pattern Recognition Process

Iuliia Kim (ITMO University, Saint Petersburg, Russia), Anastasiia Matveeva (ITMO University, Saint Petersburg, Russia), Ilya Viksnin (ITMO University, Saint Petersburg, Russia) and Roman Patrikeev (ITMO University, Saint Petersburg, Russia)
DOI: 10.4018/IJERTCS.2019070108


In this article, much attention is paid to pattern recognition quality, especially the visual information semantic integrity preservation. The main purpose is to find the ways of its possible improvement to the three basic stages of the pattern recognition process: image preparation, image processing, and classification. To avoid semantic integrity violations of information, in the initial stage of the image analysis, normalization is proposed. In the second stage, a new clustering method was developed, based on particle swarm optimization and the k-means algorithm. In the final stage of the pattern recognition process the Haar classifier was used with normalized training samples. The proposed algorithm and only Haar classifier with non-normalized samples were tested on 500 blurred images: in 8% of samples both algorithms provided semantic integrity preservation and in 64% only the developed algorithm worked effectively.
Article Preview

1. Introduction

The field of machine learning, despite its recent appearance, has already spread in many aspects of human life, such as medical and technical diagnostics, augmented reality development, speech recognition and computer vision. This article is going to focus on computer vision, namely, pattern recognition and problems connected with its quality.

Computer vision is a wide area of theoretical investigations and technical methods connected with visual information processing for object detection, object tracking and object classification; it is a discipline studying the ways of obtaining and analyzing information from images. Computer vision is considered as one of the most prospective directions in the technical developments. It found an implementation in the field of sports (Thomas, Gade, Moeslund, Carr & Hilton, 2017) and assistive technologies (Leo, Medioni, Trivedi, Kanade & Farinella, 2017), namely:

  • Elaboration of socially assistive robots for supporting peoples’ mental functions states computer vision tasks to make these robots able to adapt to various situations and react to surrounding changes;

  • Computer vision techniques were used during the development process of intelligent wheelchairs;

  • Computer vision is involved in the area of prosthetic limb control, which intends to use visual information recognition in order to select orientation, grasp shape, and size of the manipulated object.

Computer vision is able to automate and accelerate processes, for instance, production, transportation, monitoring, etc. In the epoch of high-level Industry 4.0 development, this field becomes more relevant, especially in the context of elaborating unmanned elements: robots, drones, vehicles.

Computer vision was implemented for unmanned drones for landslide monitoring (Lucieer, Jong & Turner, 2014). Their systems use SfM (Structure from Motion) algorithms and image correlation for accurate result provision.

Plans dedicated to unmanned vehicles are being developed actively and are on the verge of integrating it into the everyday life of society. For instance, project Spirit of Berlin was started in 2007 at the Free University of Berlin. As a base, it uses the car model Dodge Caravan. The car body is equipped with a GPS system and lasers that fix objects at a distance of 150 meters. In order to clarify the car position, the Kalman filter is used. Spirit of Berlin also builds the environment model with help of laser scanner, and this model is used for the further decision-making process based on the existing systems’ knowledge. The cars use general and omnidirectional cameras in analyzing road marking and surrounding objects. Two codirectional cameras are responsible for forming a stereo image of the environment. In the process of image recognition to extract object limits, the Sobel operator is used. After that, the obtained image is converted into a black and white format, which reduces the size of consumed memory and increases information processing speed. The car has two onboard computers, which cooperate with each other by dint of Ethernet communicators that raises system’s fault tolerance. One of the shortcomings of the project consists in the case that the image recognition algorithm does not work effectively if the road marking is not clearly notable.

In Italy a commercial project ARGO dedicated to the development of a car unmanned management system was initiated. A new prototype of car body equipped with cameras was created. The model focuses on using only two codirectional cameras in cooperation with board computer in order to minimize the costs for future users. The developed system proposes three work modes:

  • Manual management, when the devices only track and store driver’s actions;

  • Supervised management, which is intended to provide automatic driving except for emergency situations;

  • Automatic management, when the car is completely under system control.

Complete Article List

Search this Journal:
Open Access Articles: Forthcoming
Volume 11: 4 Issues (2020): 1 Released, 3 Forthcoming
Volume 10: 4 Issues (2019)
Volume 9: 2 Issues (2018)
Volume 8: 2 Issues (2017)
Volume 7: 2 Issues (2016)
Volume 6: 2 Issues (2015)
Volume 5: 4 Issues (2014)
Volume 4: 4 Issues (2013)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing