Nighttime Object Detection: A Night-Patrolling Mechanism Using Deep Learning

Nighttime Object Detection: A Night-Patrolling Mechanism Using Deep Learning

V. Dinesh Reddy, Sai Vishnu Vamsi Senagasetty, Krishna Teja Vanka, Mohana Vamsi Dhara, Rupini Durga Puvvada, Muzakkir Hussain
DOI: 10.4018/978-1-6684-6937-8.ch024
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

These days, it's becoming harder to feel safer when we go out at night. So, to tackle this security problem, the authors propose a night patrolling mechanism to detect objects in low light conditions. Images taken during the nighttime have difficulties with less contrast, brightness, and noise owing to inadequate light or insufficient exposure. Deep learning-based methods accomplish end-to-end, unsupervised object recognition using convolutional neural networks, which abolishes the requirement to describe and draw out attributes separately. Despite the fact that deep learning has led to the invention of many successful object detection algorithms; many state-of-the-art object detectors, like Faster-RCNN and others, can't carry out at their best under low-light situations. Even with an extra light source, it is hard to detect the features of an item due to the uneven division of brightness. This chapter proposes a deep learning algorithm called single shot detector, with Mobilenet v2 as the backbone to tackle the issues of object detection under low-light situations.
Chapter Preview
Top

1. Introduction

Object detection is the primary job in many computer vision applications because it allows more information about the identified object and the scene to be obtained. Object detection has been used in a broad range of applications, including human-computer interaction, robotics service robots, electronic goods like smartphones, security like recognition, tracking, and retrieval such as photo management, search engines, and transportation. Object localization involves locating and sizing a single object that is known to be present in the image; object presence classification involves determining whether at least one object of a given class is present in the image (without providing any information about other elements) and object recognition involves determining whether a specific object is present in the image. The fourth associated issue, view and posture estimation, requires evaluation of both the view and the location of the object. There are different types of object detection algorithms.

1.1. Coarse-to-Fine and Boosted Classifiers

Viola and Jones' enhanced cascade classifier is the most popular study in this area (2004). It operates by effectively rejecting picture patches that do not belong to the object using a series of tests and filters. The employment of cascade techniques with boosted classifiers is common for two motives: I Because boosting produces an additive classifier, it is simple to manage the complexity of each level of the cascade; (ii) boosting may also be used for feature selection during training, allowing the usage of huge (parametrized) families of data.

1.2. Dictionary Based

The Bag of Word approach (Serre et al. (2005) and Mutch & Lowe (2008)) is the greatest example in this area. This method is primarily meant to detect a single item per picture, but it can also detect the remaining objects after eliminating an identified object (Lampert et al. (2009)). This technique has two flaws: it cannot reliably handle the circumstance of two instances of the item appearing near one another, and the object's localization may be inaccurate.

1.3. Deep Learning

Convolutional neural networks are one of the first effective approaches in this class. This strategy differs from the others in that the feature representation is learned rather than defined by the user, but it comes with the disadvantage of requiring a large number of training examples to train the classifier. Deep learning is the cause of all recent developments in artificial intelligence. Self-driving cars, chatbots, Alexa and Siri would not exist without deep learning. Neural networks are the root of all these technologies.

Traditional machine learning models like SVM and Naive Bayes classifiers stop progressing beyond a saturation point, however, deep learning models tend to get more accurate with more training data.

Key Terms in this Chapter

Object Classification: The classification of objects is the grouping of objects into groups, each with its own set of characteristics.

R-CNN: It is a sort of machine learning model used for computer vision applications, particularly for object detection. The full form of R-CNN is a Region-Based Convolutional Neural Network.

Image-Level Metadata: Image metadata is the term for the precise information and facts pertaining to a single photo file. Author, creation date, file name, themes, content, etc. are frequently included in the information.

Atmospheric Conditions: The elements of the atmosphere, including temperature, wind, clouds, and precipitation, that make up its current state.

MobileNetV2: A convolutional neural network design called MobileNetV2 aims to function well on mobile devices. It is built on an inverted residual structure where the bottleneck layers are connected by residual connections. Lightweight depthwise convolutions are used in the intermediate expansion layer as a source of nonlinearity to filter features.

Deeping Learning: It is a kind of machine learning that uses artificial neural networks and numerous processing layers to extract increasingly more complex properties from data.

Object Localization: It is a process to determine the location of an object in an image. The algorithm used for the process of localization will reveal the coordinates of the position of an object in comparison with the image.

Object Detection: It is a method of computer vision that enables us to find and recognize things in an image or video. Using this form of localization and identification, object detection can be used to count the items in a scene, as well as to locate and track them in real time while precisely labeling them.

Bounding Boxes: In the process of object detection, where the goal is to determine the location and nature of various items in a picture, bounding boxes are primarily used.

Feature Extraction: For the categorization of biological signals to perform better, feature extraction and dimension reduction are necessary. Finding the most condensed and informative set of features (distinct patterns) is the goal of feature extraction in order to improve the effectiveness of the classifier.

Complete Chapter List

Search this Book:
Reset