Real-Time Weather Analytics: An End-to-End Big Data Analytics Service Over Apach Spark With Kafka and Long Short-Term Memory Networks

Real-Time Weather Analytics: An End-to-End Big Data Analytics Service Over Apach Spark With Kafka and Long Short-Term Memory Networks

Lavanya K., Sathyan Venkatanarayanan, Anay Anand Bhoraskar
Copyright: © 2020 |Pages: 17
DOI: 10.4018/IJWSR.2020100102
OnDemand:
(Individual Articles)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Weather forecasting is one of the biggest challenges that modern science is still contending with. The advent of high-power computing, technical advancement of data storage devices, and incumbent reduction in the storage cost have accelerated data collection to turmoil. In this background, many artificial intelligence techniques have been developed and opened interesting window of opportunity in hitherto difficult areas. India is on the cusp of a major technology overhaul with millions of people's data availability who were earlier unconnected with the internet. The country needs to fast forward the innovative use of available data. The proposed model endeavors to forecast temperature, precipitation, and other vital information for usability in the agrarian sector. This project intends to develop a robust weather forecast model that learns automatically from the daily feed of weather data that is input through a third-party API source. The weather feed is sourced from openweathermap, an online service that provides weather data, and is streamed into the forecast model through Kafka components. The LSTM neural network used by the forecast model is designed to continuously learn from predictions and perform actual analysis. The model can be architected to be implemented across very large applications having the capability to process large volumes of streamed or stored data.
Article Preview
Top

Introduction

Weather forecasting is attracting researchers from the beginning to produce simulations based on actual atmospheric conditions. Its prediction is accomplished by using the physical principles, together with a variety of statistical and empirical techniques. Besides forecast of the atmosphere themselves, it includes earth-surface changes caused by atmospheric conditions such as snow and ice cover, storm tides, and flooding. The current wide accessibility of monstrous climate perception information and rise in big data management techniques have roused the investigative pattern in the expansive dataset for climate prediction. Climate estimation is a fascinating examination issue with wide potential applications starting from air travel to agribusiness industries. The difficulties of climate interpretation reside on learning over a huge volume of real time streaming dataset and building a more accurate climate forecast demonstration. In the most recent decade for climate anticipation, numerous noteworthy endeavors utilizing measurable displaying procedures including Artificial Intelligence (AI) have been accounted with more accuracies. The investigations on profound conviction nets (DBNs), deep networks systems, vitality-based models have risen towards established as profound design generative models. The word “profound” in profound learning demonstrates that neural Network (NN) with larger number of layers than the “shallow” ones as utilized in ordinary AI models. This multi-layered NN has widened research enthusiasm after effective usage of the layer-wise unsupervised pre-preparing instrument to illuminate the preparation challenges productively with higher learning capacity. The fruitful utilizations of profound learning in different spaces, by different specialists have inspired its utilization in climate portrayal and forecasting. The target of the examinations is to investigate the capability of profound learning procedure for climate gauging utilizing rich climate portrayals gained from enormous climate time arrangement information. In spite of numerous models’ availability, climate determination based on ground-based perception information remains a difficult errand. As per findings of Baklanov Maunder J. RW Katz and AH Murphy (Maunder, 2000), the principle challenge of climate gauging is the after-effect management techniques which is very hard to achieve in a single scientific model. In the constrained extension, numerous inquiries about endeavored to ensemble climate gauging models reveals that they use measurable strategies to foresee climate utilizing single or multiple factors as indicators.

Data Streaming is a technique for conveying data so that it can be processed as a secure and uninterrupted stream. Spark Streaming dataflow is of four high-level phases. The first is to convey data from multiple sources. They can be like Kafka, MySQL, Elastic Search, PostgreSQL, HBase, Mongo DB and Cassandra for static/batch streaming. Then Spark can execute deep learning on the data through its MLlib API. Further, Spark SQL/NoSQL is used to implement procedures on this data. Finally, the streaming output can be pushed into various data storage systems and local file system. In the current framework, the authors have integrated technologies such as Apache Spark, Kafka, MongoDB and LSTM for efficient real time streaming and weather prediction.

The following section highlights the background study and aids in understanding the contemporary works done henceforth and the section following that specifies the main characteristics of a real-time weather forecasting system, also gives synopsis of the big data tools from the Apache environment that are incorporated in the framework presented in this paper. It also details the learning and the streaming functionalities of the platform. Finally, the authors assess the precision of weather forecasting models conducted on a dataset across several experiments, concluding with highlights of the inferences obtained from the observations.

Complete Article List

Search this Journal:
Reset
Volume 21: 1 Issue (2024)
Volume 20: 1 Issue (2023)
Volume 19: 4 Issues (2022): 1 Released, 3 Forthcoming
Volume 18: 4 Issues (2021)
Volume 17: 4 Issues (2020)
Volume 16: 4 Issues (2019)
Volume 15: 4 Issues (2018)
Volume 14: 4 Issues (2017)
Volume 13: 4 Issues (2016)
Volume 12: 4 Issues (2015)
Volume 11: 4 Issues (2014)
Volume 10: 4 Issues (2013)
Volume 9: 4 Issues (2012)
Volume 8: 4 Issues (2011)
Volume 7: 4 Issues (2010)
Volume 6: 4 Issues (2009)
Volume 5: 4 Issues (2008)
Volume 4: 4 Issues (2007)
Volume 3: 4 Issues (2006)
Volume 2: 4 Issues (2005)
Volume 1: 4 Issues (2004)
View Complete Journal Contents Listing