Improved Data-Driven Root Cause Analysis in a Fog Computing Environment

Improved Data-Driven Root Cause Analysis in a Fog Computing Environment

Chetan M. Bulla, Mahantesh N. Birje
Copyright: © 2022 |Pages: 28
DOI: 10.4018/IJIIT.296238
Article PDF Download
Open access articles are freely available for download

Abstract

Internet of things (IoT) and cloud computing are used in many real-time smart applications such as smart healthcare, smart traffic, smart city, and smart industries. Fog computing has been introduced as an intermediate layer to reduce communication delay between cloud and IoT devices. To improve the performance of these smart applications, a predictive maintenance system needs to adopt an anomaly detection and root cause analysis model that helps to resolve anomalies and avoid such anomalies in the future. The state-of-the-art work on data-driven root cause analysis suffers from scalability, accuracy, and interpretability. In this paper, a multi-agent-based improved data-driven root cause analysis technique is introduced to identify anomalies and their root causes. The deep learning model LSTM autoencoder is used to find the anomalies, and a game theory approach called SHAP algorithm is used to find the root cause of the anomaly. The evaluation result shows the improvement in accuracy and interpretability as compared to state-of-the-art works.
Article Preview
Top

Introduction

Internet of Things (IoT) and cloud computing are used in many real-time smart applications such as smart health-care, smart traffic, smart industries and smart city. To reduce communication delay between cloud and IoT Devices, Cisco introduced fog computing (Yousefpour et al., 2019) as intermediate computing infrastructure. To improve performance of fog computing, a monitoring system is required that keep track of all the activities and behavior of fog infrastructure with associated IoT Devices (Birje & Bulla, 2019). Monitoring system is also used in predictive maintenance system to detect and predict faulty or deviating behavior of fog nodes and IoT devices (Birje & Manvi, 2011). The anomaly detection and root cause analysis models play a vital role in improving performance of smart applications such as smart industries and smart healthcare system.

The anomaly detection techniques find unknown pattern or outliers in unlabeled data when something unusual occurs or when condition deviates from normal behavior. Root cause Analysis (RCA) is a systematic process to understand reason for anomalies or faults that helps operator to diagnose the problem and solve the issue within short period of time (Singh, 2020). The root cause analysis allows end users to accurately identify anomalies and its root cause to avoid failures that may occur in future. The following requirements have to be satisfied for effective root cause analysis (Steenwinckel et al., 2021) i) Accuracy: the RCA should give accurate cause of the problem and reduce false positives, ii) Minimal human effort: the RCA work automatically without human involvement, iii) Context aware: the RCA should provide context to increase the performance, iv) Adaptive: the RCA system should be capable of adapting detection behavior of changing conditions, v) Interpretable: The user of system must easily understand the failures and its cause to plan appropriate action. vi) Scalable: The RCA must work efficiently even with huge data.

There are three main techniques (Steenwinckel et al., 2021, Solé et al., 2017) to find root cause analysis in fog computing infrastructure: i) data-driven model: it identifies anomaly and its root cause based on the unusual pattern using machine learning or deep learning approach ii) Knowledge-driven techniques: it works on expert knowledge iii) Hybrid: it combines the both data-driven and knowledge-driven technique to meet the requirement of RCA. The above techniques suffer from few of the critical issues such as data-driven technique suffers from interpretability and accuracy, knowledge-driven technique is unable to find new types of faults and its causes and hybrid model consumes more computational resources.

In the new era of smart industries, Anomaly detection and root cause analysis play a vital role in improve the performance of machine and reduce maintainance cost. Anomaly detection identify abnormal behavior of production machine and outliers /quality deviation in the production line. The root cause analysis of anomaly may help to resolve tor fix the issue. The existing the anomaly detection and root cause analysis models does not meet all the requirement and are computationaly expansive. Therefore, there is a need of an effective root cause analysis model which provides high interpretability and accuracy with minimum overhead. The existing works have focused on data-driven root cause analysis considering the above mentioned techniques, but failed to meet requirements such as accuracy, scalability and interpretability. Also, no work has been carried out highlighting the importance of views of multi-agent in predictive maintainance system. Hence, this paper proposes a multi-agent based data-driven root cause analysis model using SHAP algorithm. The main objectives of proposed root cause analysis model are: first, increase the accuracy and reduce the false positives in detecting anomalies in fog computing environment. second, 2) To develop a light weight root cause analysis that fulfill all requirments of root cause analysis with reduced the overhead.

Complete Article List

Search this Journal:
Reset
Volume 20: 1 Issue (2024)
Volume 19: 1 Issue (2023)
Volume 18: 4 Issues (2022): 3 Released, 1 Forthcoming
Volume 17: 4 Issues (2021)
Volume 16: 4 Issues (2020)
Volume 15: 4 Issues (2019)
Volume 14: 4 Issues (2018)
Volume 13: 4 Issues (2017)
Volume 12: 4 Issues (2016)
Volume 11: 4 Issues (2015)
Volume 10: 4 Issues (2014)
Volume 9: 4 Issues (2013)
Volume 8: 4 Issues (2012)
Volume 7: 4 Issues (2011)
Volume 6: 4 Issues (2010)
Volume 5: 4 Issues (2009)
Volume 4: 4 Issues (2008)
Volume 3: 4 Issues (2007)
Volume 2: 4 Issues (2006)
Volume 1: 4 Issues (2005)
View Complete Journal Contents Listing