HONNs with Extreme Learning Machine to Handle Incomplete Datasets

HONNs with Extreme Learning Machine to Handle Incomplete Datasets

Shuxiang Xu (University of Tasmania, Australia)
DOI: 10.4018/978-1-4666-2175-6.ch013
OnDemand PDF Download:
$30.00
List Price: $37.50

Abstract

An Extreme Learning Machine (ELM) randomly chooses hidden neurons and analytically determines the output weights (Huang, et al., 2005, 2006, 2008). With the ELM algorithm, only the connection weights between hidden layer and output layer are adjusted. The ELM algorithm tends to generalize better at a very fast learning speed: it can learn thousands of times faster than conventionally popular learning algorithms (Huang, et al., 2006). Artificial Neural Networks (ANNs) have been widely used as powerful information processing models and adopted in applications such as bankruptcy prediction, predicting costs, forecasting revenue, forecasting share prices and exchange rates, processing documents, and many more. Higher Order Neural Networks (HONNs) are ANNs in which the net input to a computational neuron is a weighted sum of products of its inputs. Real life data are not usually perfect. They contain wrong, incomplete, or vague data. Hence, it is usual to find missing data in many information sources used. Missing data is a common problem in statistical analysis (Little & Rubin, 1987). This chapter uses the Extreme Learning Machine (ELM) algorithm for HONN models and applies it in several significant business cases, which involve missing datasets. The experimental results demonstrate that HONN models with the ELM algorithm offer significant advantages over standard HONN models, such as faster training, as well as improved generalization abilities.
Chapter Preview
Top

Introduction

Artificial Neural Networks (ANNs) have been providing significant benefits in many business applications. They have been actively used for applications such as bankruptcy prediction, predicting costs, forecast revenue, processing documents, and more (Kurbel, et al., 1998; Atiya, et al., 2001; Baesens, et al., 2003). Almost any neural network model would fit into at least one business area or financial analysis. Traditional statistical methods have been used for business applications with many limitations (Azema-Barac, et al., 1997; Blum, et al., 1991; Park, et al., 1993).

This chapter addresses using ANNs for handling business data for the following reasons. First, although usually considered a black-box approach, ANNs are a natural technology for data mining. ANNs are non-linear models that resemble biological neural networks in structure and learn through training. ANNs present a model based on the massive parallelism and the pattern recognition and prediction abilities of the human brain. ANNs learn from examples in a way similar to how the human brain learns. Then ANNs take complex and noisy data as input and make educated guesses based on what they have learned from the past, like what the human brain does. Given the requirements of data mining within large databases of historical data, ANNs are a natural technology for this application (McCue, et al., 2007). Next, ANNs (especially higher order ANNs) are able to handle incomplete or noisy data (Peng, et al., 2007; Wang, 2003). Databases usually contain noise in the form of inaccuracies and inconsistencies. Lack of data validation procedures may allow a user to enter incorrect data. Data can also become corrupt during migration from one system to another. Missing data is a common problem especially when data is collected from many different sources. Finally, ANNs hold superior predictive capability, compared with other data mining approaches (Xu, 2009; Zhang, et al., 2007; Fulcher, et al., 2006; Browne, et al., 2004; Kohonen, et al., 2000). The predictive accuracy of a data mining approach strongly influences its effectiveness and popularity. Higher predictive accuracy with real data is an obviously desirable feature.

Complete Chapter List

Search this Book:
Reset