Predicting Software Abnormal State by using Classification Algorithm

Predicting Software Abnormal State by using Classification Algorithm

Yongquan Yan, Ping Guo
DOI: 10.4018/978-1-7998-3016-0.ch050
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

Software aging, also called smooth degradation or chronics, has been observed in a long running software application, accompanied by performance degradation, hang/crash failures or both. The key for software aging problem is how to fast and accurately detect software aging occurrence, which is a hard work due to the long delay before aging appearance. In this paper, two problems about software aging prediction are solved, which are how to accurately find proper running software system variables to represent system state and how to predict software aging state in a running software system with a minor error rate. Firstly, the authors use proposed stepwise forward selection algorithm and stepwise backward selection algorithm to find a proper subset of variables set. Secondly, a classification algorithm is used to model software aging process. Lastly, t-test with k-fold cross validation is used to compare performance of two classification algorithms. In the experiments, the authors find that their proposed method is an efficient way to forecast software aging problems in advance.
Chapter Preview
Top

1. Introduction

It has been reported that 9% of overall business revenues (Bixby, 2010) is affected by application performance problems. Taking the most popular web service as an example, when the web server runs in a very long time, the system cannot respond quickly for the upcoming request. Even worse, it may not respond any requests even if the workload level is modest. In generally, this performance degradation does not show an instantaneous failure. This performance degradation phenomenon may last a few days or even several weeks, if there is no any manual intervention. This phenomenon of performance degradation, unplanned software outages, or suddenly failure is called software aging (Cotroneo, Natella, Pietrantuono, & Russo, 2014), smooth degradation (Alencar, Santos, Santana, & Fernandes, 2014), or chronics (Kavulya et al., 2012). In this work, we call it as software aging.

In fact, software aging is a consequence of problems with the software itself. When a fault is activated, an error will appear as part of the internal state of the software system. When multiple errors sufficiently accumulate and the proliferation of errors reaches the service interface of the software system, the system will incur performance degradation, or even failure. Fig. 1 gives chain of software failure.

Figure 1.

Chain of software failure

978-1-7998-3016-0.ch050.f01

In order to counteract problems caused by software aging, Huang, Kintala, Kolettis, and Fulton (1995) proposed the technique of software rejuvenation, including occasionally ceasing software application, removing accumulated error environments and then rebooting application. However, execution of software rejuvenation can cause both direct and indirect costs during the downtime of the system. In order to minimize the loss caused by rejuvenation, the implementation of software rejuvenation need be executed based on the current state of the system. There are two core problems which need be solved to identify system state and execute rejuvenation.

  • Problem 1:

    How to accurately find proper system and application variables to represent system state for running software.

  • Problem 2:

    How to predict software aging state in a running software system with a minor error rate.

This paper, which gives a practice framework for forecasting software aging using a classification algorithm, focuses on these two problems. Firstly, we propose two feature selection algorithms to choose a subset of variables of operating system and application system. Secondly, a classification algorithm called support vector machine (SVM) is carefully analyzed and used to model the software aging process through the collected variables of an IIS web server that is a running commercial server. Lastly, statistical analysis is used to analyze the performance between SVM and artificial neural network (ANN).

Complete Chapter List

Search this Book:
Reset