Lightweight Neural Networks for Pedestrian Detection in Intelligent Vehicles

Lightweight Neural Networks for Pedestrian Detection in Intelligent Vehicles

Riadh Ayachi, Mouna Afif, Yahia Said, Abdessalem Ben Abdelali
DOI: 10.4018/978-1-6684-6937-8.ch022
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Most actual intelligent vehicles (IV) are powered by a variety of sensors and cameras. Vision-based applications for IV mainly require visual information. In this paper, the authors introduce a pedestrian detection application used for pedestrian safety. The authors proposed a deep fully convolutional neural network (DFCNN) for pedestrian detection. The proposed model is suitable for mobile implementation. To do this, the authors propose to build lightweight blocks using convolution layers, and replace pooling layers and fully connected layers with convolution layers. Training and testing of the proposed DFCNN model for pedestrian detection were performed using the Caltech dataset. The proposed DFCNN has achieved 85% of average precision and an inference speed of 30 FPS. The reported results have demonstrated the robustness of the proposed DFCNN for pedestrian detection. The achieved performance was low computation complexity and high performance.
Chapter Preview
Top

Introduction

The explosion in the number of both vehicles and pedestrians makes it hard to share the same environment safely. According to the Association for Safe International Road Travel (2019), 1.25 million persons die in a crash accident each year with an average of 3,287 deaths per day. Besides, 20-50 million persons are injured or disabled.

One of the proposed solutions is IV which is used to help the driver in a complicated situation or to perform a simple task like driving on a highway. Most IVs are equipped with the latest technologies. An IV is equipped with a big number of active sensors like ultrasonic and passive sensors like cameras. Most IV applications are based on passive sensors thanks to their low power consumption. In addition, an IV can control the vehicle through the engine, gearbox, and brakes. It can apply a set of authorized actions in case of potential danger such as a crush on another vehicle or pedestrian. A vision-based application for an IV is the most important. Many vision applications like traffic sign detection and recognition (Ayachi, 2019a), (Ayachi, 2019b), and traffic light detection (Dimian, 2019) have been stacked in IV to provide the driver with important information about the traffic environment. IV has been developed to enhance traffic security and to ensure a safer environment. Human beings, either the driver or the pedestrian, represent the most important element to focus on their safety.

Motivated by pedestrian safety, the pedestrian detection system was designed through an intelligent vision that processes camera data and localizes pedestrians automatically. A high-performance pedestrian detection system can locate pedestrians under challenging conditions such as occlusion and warn the driver to avoid potential accidents. This can help to improve driving safety in urban spaces. However, implementing an accurate pedestrian detection system on a limited computation resources device is considered a hard challenge that must be addressed. The mentioned challenge was considered a motivation for the proposed approach in addition to pedestrian safety.

With a focus on pedestrian safety, a pedestrian detection application for an IV was proposed. The main idea of the proposed application is to process the visual data and detect pedestrians while crossing the street to warn the driver. The main challenge of building a robust pedestrian detection application comes first from the limited computation resources of the platform used for IV and from the complexity of the environment because of the lighting conditions, the point of view, the complex background, and occlusion. A reliable and robust pedestrian detection application must overcome those challenges as a priority. Besides reliability, real-time operation is another challenge for pedestrian detection applications. The application must detect pedestrians even at high speed and moving pedestrians. Advances in artificial intelligence (AI) techniques have achieved great success in different fields including computer vision and its applications. Those AI techniques are considered a suitable solution for pedestrian detection in IV.

The proposed pedestrian detection application is based on a convolutional neural network (CNN) (Krizhevsky, 2012). It is a deep learning (Schmidhuber, 2015) model generally deployed to solve vision-based applications. Convolutional neural networks are inspired by the biological system (Fukushima, 1980) where the connectivity between the artificial neurons is similar to the organization of the visual cortex of an animal. This connectivity allows the building of a deep convolutional neural network (DCNN) with tens of hidden layers and enables feeding the network with a huge amount of training data without overfitting it (Zeiler, 2014). DCNN was deployed successfully to solve vision-based applications. Many DCNN models were proposed to solve vision-based applications such as object detection (Ayachi, 2020), indoor object recognition (Afif, 2018), face identification (Ranjan, 2019), and human pose estimation (Omran, 2018). In this paper, we propose to use the DCNN for pedestrian detection applications. Although the DCNN is computationally expensive and energy intensive.

Key Terms in this Chapter

Image Processing: It is a method of performing operations on an image in order to improve it or extract relevant information from it. It is a sort of signal processing in which the input is an image and the output can be an image or features.

Intelligent Vehicles: A vehicle that can collect and process information about its own status and the environment in order to make decisions, provide information, and react at the due time.

Residual Connection: It is a skip connection that connects the output of one convolutional layer to the input of another convolutional layer several layers later (e.g. a number of intermediate convolutional steps are skipped). A simple sum is used to merge the mainstream model's input and the convolutional prior's input.

Pedestrian Detection: It is an important task in many intelligent systems since it gives basic information for semantic interpretation of scenes. Because of the potential to improve safety systems, it has an apparent extension to automobile applications.

Lightweight Convolutional Neural Network: It is convolutional neural network with low computation complexity that is suitable for real-time application and implementation on edge devices.

Computer Vision: It is a field that studies how computers may learn to recognize digital images or video using artificial intelligence techniques. From an engineering standpoint, it tries to comprehend and automate operations that the human visual system is capable of performing.

Mobile Devices: It is a computation hardware system with a microprocessor that is designed to fulfill a certain purpose, either as an independent system or as part of a larger system. At the heart of the system is an integrated circuit built to perform calculation for real-time processes. It is characterized with a limited computation resources and energy based generally on battery.

Complete Chapter List

Search this Book:
Reset