Comparative Study of Classification Models with Genetic Search Based Feature Selection Technique

Comparative Study of Classification Models with Genetic Search Based Feature Selection Technique

Sanat Kumar Sahu, A. K. Shrivas
Copyright: © 2018 |Pages: 11
DOI: 10.4018/IJAEC.2018070101
(Individual Articles)
No Current Special Offers


Feature selection plays a very important role to retrieve the relevant features from datasets and computationally improves the performance of a model. The objective of this study is to evaluate the most important features of a chronic kidney disease (CKD) dataset and diagnose the CKD problem. In this research work, the authors have used a genetic search with the Wrapper Subset Evaluator method for feature selection to increase the overall performance of the classification model. They have also used Bayes Network, Classification and Regression Tree (CART), Radial Basis Function Network (RBFN) and J48 classifier for classification of CKD and non-CKD data. The proposed genetic search based feature selection technique (GSBFST) selects the best features from CKD dataset and compares the performance of classifiers with proposed and existing genetic search feature selection techniques (FSTs). All classification models give the better result with proposed GSBFST as compared to without FST and existing genetic search FSTs.
Article Preview

Literature Survey

This part consists of reviews of various technical and related articles on machine learning techniques applied to predict kidney disease.

The two types (Polat et al., 2017) of feature selection methods, i.e., wrapper and filter approach have been used to diagnose CKD. In wrapper approach, a classifier subset evaluator with the greedy stepwise search engine and wrapper subset evaluator with the Best First Search(BFS) engine were used. In filter approach, correlation feature selection subset evaluator with a greedy stepwise search engine and filtered subset evaluator with the BFS engine were used. Results showed that the Support Vector Machine (SVM) classifier has used filtered subset evaluator with the BFS engine feature selection method gives a higher accuracy rate (98.5%) in the diagnosis of CKD.

A number of different ML classifiers (Subas et al., 2017) Artificial Neural Network(ANN), SVM, k-Nearest Neighbor, C4.5 and Random Forest (RF) have experiment validated to a real data set, taken from the UCI Machine Learning Repository. The result reveals that the random forest (RF) classifier reaches the maximum performances on the classification of CKD.

Complete Article List

Search this Journal:
Volume 14: 1 Issue (2023): Forthcoming, Available for Pre-Order
Volume 13: 4 Issues (2022): 2 Released, 2 Forthcoming
Volume 12: 4 Issues (2021)
Volume 11: 4 Issues (2020)
Volume 10: 4 Issues (2019)
Volume 9: 4 Issues (2018)
Volume 8: 4 Issues (2017)
Volume 7: 4 Issues (2016)
Volume 6: 4 Issues (2015)
Volume 5: 4 Issues (2014)
Volume 4: 4 Issues (2013)
Volume 3: 4 Issues (2012)
Volume 2: 4 Issues (2011)
Volume 1: 4 Issues (2010)
View Complete Journal Contents Listing