Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Population-Based Feature Selection for Biomedical Data Classification

Seyed Jalaleddin Mousavirad, Hossein Ebrahimpour-Komleh

Source Title: Data Mining and Analysis in the Engineering Field

DOI: 10.4018/978-1-4666-6086-1.ch016

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Classification of biomedical data plays a significant role in prediction and diagnosis of disease. The existence of redundant and irrelevant features is one of the major problems in biomedical data classification. Excluding these features can improve the performance of classification algorithm. Feature selection is the problem of selecting a subset of features without reducing the accuracy of the original set of features. These algorithms are divided into three categories: wrapper, filter, and embedded methods. Wrapper methods use the learning algorithm for selection of features while filter methods use statistical characteristics of data. In the embedded methods, feature selection process combines with the learning process. Population-based metaheuristics can be applied for wrapper feature selection. In these algorithms, a population of candidate solutions is created. Then, they try to improve the objective function using some operators. This chapter presents the application of population-based feature selection to deal with issues of high dimensionality in the biomedical data classification. The result shows that population-based feature selection has presented acceptable performance in biomedical data classification.

Chapter Preview

Top

Introduction

Data mining or knowledge discovery is a computational process of extracting hidden knowledge in large databases. The goal of data mining process is to extract useful information from a dataset. Figure 1 illustrates the phases of a data mining process. The first step in data mining process is to understanding of the problem. In the next step, data collect and prepare. In this step, data is cleaned from outlier instances or missing data and dataset reduces to only variables that are useful in a given data mining process. In the third step, a mining model or model is built. The quality of a model can evaluate using a number of the techniques. The last step in the data mining process is to deploy the models to a real environment.

Figure 1.

The data mining process

Data mining techniques have been successfully used in various biomedical domains, for example the detection of tumors, the diagnosis of cancers and other diseases. One of the main challenge in biomedical data mining and analysis is the so called “curse of dimensionality”. Especially the biomedical data are presented by relatively few instances and exhibited in a high dimensional feature space(Peng, Wu, & Jiang, 2010). Feature selection, a process in data transformation phase, reduces the number of features, removes irrelevant, redundant and misleading features, which leads to expediting learning algorithm and improves predictive performance. Feature selection algorithms are divided into three categories: wrapper methods that uses the learning algorithms to evaluate the usefulness of features, filter methods that evaluate features according to the statistical characteristics of the data, and embedded methods that feature selection embed in the learning algorithm. Population based metaheuristics such as genetic algorithm, particle swarm optimization, Imperialist competitive algorithm, artificial bee algorithm, Ant colony optimization, and leap frog optimization have been considered as effective wrapper feature selection approach. These metaheuristics are based on a population of solutions and an iterative procedure. At each iteration, they try to find a better solution than previous solutions using some operators. Feature selection algorithms have been successfully applied in various biomedical domains. A. Antoniadis et al, (2003) presented a statistical feature reduction approach for the classification of tumors. I. Guyan et al, (2002) address the problem of selection of a small subset of genes from broad patterns of gene expression data, recorded on DNA micro-arrays. Using available training examples from cancer and normal patients, they build a classifier suitable for genetic diagnosis, as well as drug discovery. In another work, wrapper approaches was applied for gene selection(Blanco, Larrañaga, Inza, & Sierra, 2004). Y. Peng et al. (2010) presents a novel feature selection approach to deal with issues of high dimensionality in biomedical data classification. The approach proposed in this paper integrated filter and wrapper methods into a sequential search procedure with the aim to improve the classification performance of the features selected. In this chapter, we focus on application of population based feature selection algorithms for biomedical data classification. To this purpose, four population based metaheuristics are considered: genetic algorithm, particle swarm optimization, artificial bee algorithm, and imperialist competitive algorithm. We also analyze the efficiency of this approach on four biomedical dataset: Wisconsin Diagnostic Breast Cancer, Wisconsin Prognostic Breast Cancer, SPECTF heart dataset, and Hepatitis diagnosis.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Population-Based Feature Selection for Biomedical Data Classification

Abstract

Introduction

Complete Chapter List