Real-Time Vehicle Traffic Prediction in Apache Spark Using Ensemble Learning for Deep Neural Networks

Real-Time Vehicle Traffic Prediction in Apache Spark Using Ensemble Learning for Deep Neural Networks

Anveshrithaa Sundareswaran, Lavanya K.
Copyright: © 2020 |Pages: 18
DOI: 10.4018/IJIIT.2020100102
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Escalating traffic congestion in large and rapidly evolving metropolitan areas all around the world is one of the inescapable problems in our daily lives. In light of this situation, traffic monitoring and analytics is becoming the need of the hour in today's world. Real-time traffic analysis requires processing of data streams that are being generated continuously in real time to gain quick insights. The challenge of analyzing streaming data for real-time prediction can be overcome by exploiting deep learning techniques. Taking this as a motivation, this work aims to integrate big data technologies and deep learning techniques to develop a real-time data stream processing model for vehicle traffic forecast using ensemble learning approach. Real-time traffic data from an API is streamed using a distributed streaming platform called Kafka into Apache Spark where it is processed, and the traffic flow is predicted by a neural network ensemble model. This will reduce the travel time, cost, and energy through efficient decision making, thus having a positive impact on the environment.
Article Preview
Top

Introduction

With the dawn of the era of Big Data upon us, there has been a huge surge in the volume of available data and it is anticipated to increase several folds in the coming years. All-embracing research is being carried out to put this data into effective use. The growth in big data and big data technologies have opened up interesting prospects that were once unforeseen by researchers and businesses. Research on how to take advantage of the available data with Artificial Intelligence is achieving enthralling and striking results. One of the areas which needs the consideration of this is vehicular traffic analysis which has emerging demands in today’s world. As stated by the World Urbanization Prospects by United Nations, the urban population of the world has increased swiftly from 751 million in 1950 to 4.2 billion in 2018 which accounts for 55% of the world’s population, a proportion that is likely to surge to 68% by 2050 (Anonymous, 2018). With such rapid urbanization, vehicular traffic on roads is set to intensify drastically leading to congestion and accidents. This issue of increasing vehicle traffic needs to be addressed by exploiting big data technologies and machine learning to offer efficient solutions that offer convenience for commuters. With the easy and wide accessibility of massive amounts of data along with the advancement in big data analytics, it has been made possible to bring progress in traffic analysis.

Though the development in technology has already on track creating a great impact on the field of transportation and traffic management, by addressing difficulties and providing advanced, effective and intelligent solutions, it is still a challenging problem to carry out real time traffic analysis. This is because of the intricacies in analysing and learning from enormous volumes of real time streaming data and offer accurate predictions which need more consideration. This can be expedited by leveraging Machine Learning and big data techniques such as stream processing to develop more advantageous and desirable solutions. In order to process data streams at a faster rate, we need high computing capacity. Big data frameworks like Apache Spark, Hadoop and Kafka, with their capability of processing a multitude of data, have resolved the major issue of processing and storing continuously flowing data stream. With the progress of various big data frameworks that process large volumes of data in a distributed computing environment, it is now convenient and easy to manage, process, analyse and store data for real time analytics. These big data technologies along with the remarkable growth in the field of artificial intelligence, have made it plausible to develop advanced and efficient data stream processing systems that can curb the problem of this ever-increasing vehicular traffic congestion around the world by real time traffic analysis.

The proposed work focuses on analysing the traffic data and thus predicting traffic flow by utilizing deep Learning techniques and an efficient big data framework based on Hadoop MapReduce called Apache Spark that effectively handles unstructured, real time, streaming data. The deep learning model used to predict traffic is an ensemble of many deep neural networks. The significance of the proposed work is the implementation of an end to end traffic prediction framework by integrating Apache Kafka, Apache Spark and MongoDB for streaming, processing and storing of traffic data along with the use of the deep neural network ensemble learning model for efficiently performing real time traffic analysis and forecast. The model predicts the traffic flow in a lane at different times of a particular day or for the upcoming days based on the historical as well as real-time data by using an ensemble of neural networks which yields a better accuracy in prediction than an individual neural network.

Complete Article List

Search this Journal:
Reset
Volume 20: 1 Issue (2024)
Volume 19: 1 Issue (2023)
Volume 18: 4 Issues (2022): 3 Released, 1 Forthcoming
Volume 17: 4 Issues (2021)
Volume 16: 4 Issues (2020)
Volume 15: 4 Issues (2019)
Volume 14: 4 Issues (2018)
Volume 13: 4 Issues (2017)
Volume 12: 4 Issues (2016)
Volume 11: 4 Issues (2015)
Volume 10: 4 Issues (2014)
Volume 9: 4 Issues (2013)
Volume 8: 4 Issues (2012)
Volume 7: 4 Issues (2011)
Volume 6: 4 Issues (2010)
Volume 5: 4 Issues (2009)
Volume 4: 4 Issues (2008)
Volume 3: 4 Issues (2007)
Volume 2: 4 Issues (2006)
Volume 1: 4 Issues (2005)
View Complete Journal Contents Listing