Semantic Segmentation: A Systematic Analysis From State-of-the-Art Techniques to Advance Deep Networks

Semantic Segmentation: A Systematic Analysis From State-of-the-Art Techniques to Advance Deep Networks

Aakanksha, Arushi Seth, Shanu Sharma
Copyright: © 2022 |Pages: 28
DOI: 10.4018/JITR.299388
Article PDF Download
Open access articles are freely available for download

Abstract

Semantic segmentation was traditionally performed using primitive methods; however, in recent times, a significant growth in the advancement of deep learning techniques for the same is observed. In this paper, an extensive study and review of the existing deep learning (DL)-based techniques used for the purpose of semantic segmentation is carried out along with a summary of the datasets and evaluation metrics used for the same. The paper begins with a general and broader focus on semantic segmentation as a problem and further narrows its focus on existing DL-based approaches for this task. In addition to this, a summary of the traditional methods used for semantic segmentation is also presented towards the beginning. Since the problem of scene understanding is being vastly explored in the computer vision community, especially with the help of semantic segmentation, the authors believe that this paper will benefit active researchers in reviewing and studying the existing state-of-the-art as well as advanced methods for the same.
Article Preview
Top

Introduction

From the past few decades, tremendous growth can be seen in the computer vision community. Researchers have provided optimal solutions in different vision based tasks like image classification, object detection, object labelling, saliency estimation, image compression and many more (Lu, 2007; Verschae, 2015; Messer, 2017). Almost all the vision-based applications include a basic step of segmenting an image into meaningful regions, which is basically the process of linking each pixel in an image with a class label. Although many optimal solutions have been provided till date for segmenting an image (Guo, 2018; Zaitoun, 2015), due to unpredictable real world situations and dependency of a majority of vision applications on this step, segmentation of image is still an open research problem for the computer vision community.

In this study, our focus is on performing the semantic segmentation for scene understanding. Semantic segmentation is a process of assigning a meaningful label to each pixel based on the context of the environment (Lateef, 2019). It is a very useful step for a variety of computer vision applications where it is important to understand the context of the operating environment in which the systems are operating, for example in robotics (Kim, 2018), self-driving cars (Kaymak, 2019) etc. Further, scene understanding is a computer vision application which includes analysis and perception of an image of the scene to create an overview of the event depicted in the scene (Xiao, 2013). A scene shows a real-world situation which is extracted from the environment. It includes multiple objects which are interacting with each other, thereby having some meaning. A scene can represent a variety of real-world events ranging from personal events to public events. Scene understanding is the process of interpreting scenes, which are captured through devices like cameras, microphones, contact sensors etc. to get an in depth understanding of it (Aarthi, 2017). The data of a scene can be expressed using various features like color, texture, and light intensity, thus, the process of creating a good understanding of a scene includes proper extraction of features from an image of a scene that characterizes it efficiently. It is based on the idea of vision and cognition, in which the functionality of detection, localization, recognition and understanding is performed first, followed by cognition, which is used to add functionalities like learning, adaption, finding alternatives, interpretation and analysis. The models that perform scene understanding include the capability to analyze events and modify accordingly. It can adapt to unforeseen data and perform robustly in such situations (Li, 2009). Scene understanding is included in machines to make them capable of interpreting events in a similar manner as humans do.

Complete Article List

Search this Journal:
Reset
Volume 16: 1 Issue (2024): Forthcoming, Available for Pre-Order
Volume 15: 6 Issues (2022): 1 Released, 5 Forthcoming
Volume 14: 4 Issues (2021)
Volume 13: 4 Issues (2020)
Volume 12: 4 Issues (2019)
Volume 11: 4 Issues (2018)
Volume 10: 4 Issues (2017)
Volume 9: 4 Issues (2016)
Volume 8: 4 Issues (2015)
Volume 7: 4 Issues (2014)
Volume 6: 4 Issues (2013)
Volume 5: 4 Issues (2012)
Volume 4: 4 Issues (2011)
Volume 3: 4 Issues (2010)
Volume 2: 4 Issues (2009)
Volume 1: 4 Issues (2008)
View Complete Journal Contents Listing