Linguistic Rule Extraction from Support Vector Machine Classifiers

Linguistic Rule Extraction from Support Vector Machine Classifiers

Xiuju Fu (Institute of High Performance Computing, Singapore), Lipo Wang (Nanyang Technological University, Singapore), GihGuang Hung (Institute of High Performance Computing, Singapore) and Liping Goh (Institute of High Performance Computing, Singapore)
Copyright: © 2007 |Pages: 15
DOI: 10.4018/978-1-59904-271-8.ch010
OnDemand PDF Download:
$37.50

Abstract

Classification decisions from linguistic rules are more desirable compared to complex mathematical formulas from support vector machine (SVM) classifiers due to the explicit explanation capability of linguistic rules. Linguistic rule extraction has been attracting much attention in explaining knowledge hidden in data. In this chapter, we show that the decisions from an SVM classifier can be decoded into linguistic rules based on the information provided by support vectors and decision function. Given a support vector of a certain class, cross points between each line, which is extended from the support vector along each axis, and an SVM decision hyper-curve are searched first. A hyper-rectangular rule is derived from these cross points. The hyper-rectangle is tuned by a tuning phase in order to exclude those out-class data points. Finally, redundant rules are merged to produce a compact rule set. Simultaneously, important attributes could be highlighted in the extracted rules. Rule extraction results from our proposed method could follow SVM classifier decisions very well. We compare the rule extraction results from SVM with RBF kernel function and linear kernel function. Experiment results show that rules extracted from SVM with RBF nonlinear kernel function are with better accuracy than rules extracted from SVM with linear kernel function. Comparisons between our method and other rule extraction methods are also carried out on several benchmark data sets. Higher rule accuracy is obtained in our method with fewer number of premises in each rule.

Complete Chapter List

Search this Book:
Reset
Table of Contents
Preface
David Taniar
Chapter 1
Torben Pedersen, Jesper Thorhauge, Søren Jespersen
Enormous amounts of information about Web site user behavior are collected in Web server logs. However, this information is only useful if it can be... Sample PDF
Combining Data Warehousing and Data Mining Techniques for Web Log Analysis
$37.50
Chapter 2
Lixin Fu
In high-dimensional data sets, both the number of dimensions and the cardinalities of the dimensions are large and data is often very sparse, that... Sample PDF
Computing Dense Cubes Embedded in Sparse Data
$37.50
Chapter 3
Karlton Sequeira, Mohammed J. Zaki
Very often, related data may be collected by a number of sources, which may be unable to share their entire datasets for reasons like... Sample PDF
Exploring Similarities Across High-Dimensional Datasets
$37.50
Chapter 4
Irene Ntoutsi, Nikos Pelekis, Yannis Theodoridis
Many patterns are available nowadays due to the widespread use of knowledge discovery in databases (KDD), as a result of the overwhelming amount of... Sample PDF
Pattern Comparison in Data Mining: A Survey
$37.50
Chapter 5
Fedja Hadzic, Tharam Dillon, Henry Tan, Ling. Feng, Elizabeth Chang
Association rule mining is one of the most popular pattern discovery methods used in data mining. Frequent pattern extraction is an essential step... Sample PDF
Mining Frequent Patterns Using Self-Organizing Map
$37.50
Chapter 6
Mafruz Ashrafi, David Taniar, Kate Smith
Association rule mining is one of the most widely used data mining techniques. To achieve a better performance, many efficient algorithms have been... Sample PDF
An Efficient Compression Technique for Vertical Mining Methods
$37.50
Chapter 7
Alex Freitas, André Carvalho
In machine learning and data mining, most of the works in classification problems deal with flat classification, where each instance is classified... Sample PDF
A Tutorial on Hierarchical Classification with Applications in Bioinformatics
$37.50
Chapter 8
Daniel Wu, Xiaohua Hu
In this chapter, we report a comprehensive evaluation of the topological structure of protein-protein interaction (PPI) networks, by mining and... Sample PDF
Topological Analysis and Sub-Network Mining of Protein-Protein Interactions
$37.50
Chapter 9
Yong Shi, Yi Peng, Gang Kou, Zhengxin Chen
This chapter provides an overview of a series of multiple criteria optimization-based data mining methods, which utilize multiple criteria... Sample PDF
Introduction to Data Mining Techniques via Multiple Criteria Optimization Approaches and Applications
$37.50
Chapter 10
Xiuju Fu, Lipo Wang, GihGuang Hung, Liping Goh
Classification decisions from linguistic rules are more desirable compared to complex mathematical formulas from support vector machine (SVM)... Sample PDF
Linguistic Rule Extraction from Support Vector Machine Classifiers
$37.50
Chapter 11
Graph-Based Data Mining  (pages 291-307)
Wenyuan Li, Wee-Keong Ng, Kok-Leong Ong
With the most expressive representation that is able to characterize the complex data, graph mining is an emerging and promising domain in data... Sample PDF
Graph-Based Data Mining
$37.50
Chapter 12
Richi Nayak
Web services have recently received much attention in businesses. However, a number of challenges such as lack of experience in estimating the... Sample PDF
Facilitating and Improving the Use of Web Services with Data Mining
$37.50
About the Authors