Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Hybrid Wrapper/Filter Gene Selection Using an Ensemble of Classifiers and PSO Algorithm

Anouar Boucheham, Mohamed Batouche

Source Title: International Journal of Applied Metaheuristic Computing (IJAMC) 8(2)

DOI: 10.4018/IJAMC.2017040102

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Bioinformatics has grown very quickly for the last 20 years, and it will grow even faster in the future. One of the long-standing open challenges in bioinformatics is biomarker identification and cancer diagnosis from gene expression. In this paper, the authors propose a novel hybrid wrapper/filter feature selection approach to identify the most informative genes for cancer diagnosis, named HWF-GS. It handles selection through two steps. The first one is an iterative filter-based mechanism to generate potential subsets of genes. The second step is the aggregation of the best-selected subsets by means of a wrapper-based consensus process that relies on a particle swarm optimization adapted to feature selection. An ensemble of classifiers (SVM and KNN) is employed to evaluate the selected genes. Experiments on nine publicly available cancer DNA microarray datasets have shown that HWF-GS selects robust signatures with high classification accuracy and competes with and even outperforms other methods in the literature.

Article Preview

Top

1. Introduction

Several advanced genomic technologies developed last year's (DNA microarrays, NGS and RNAseq…), especially during the sequencing the human genome are being very helpful for molecular diagnostics, unveiling new insights into biology and have led to biomarker discovery (Mabert et al., 2014). Certainly, the use of molecular biomarkers will impact different areas of clinical practice and will give precious additional information for tumor diagnosis/prognosis and finally, contribute to personalized therapy of cancer. The ideal biomarker for cancer would have applications in (a) classification of tumors, (b) prognosis of disease progression, (c) prediction of response to therapy, (d) monitoring of response to therapy and serve as a target for drug development (Stoss & Henkel, 2004).

Gene expression microarray is used to survey and measure genes activity in healthy and diseased tissues through various populations. It can measure and record the expression level of thousands of genes simultaneously in different samples types and specific experimental conditions (referred to as a sample) (Bolon-Canedo et al., 2014). In cancer examination these technologies have been broadly investigated for classification of different types of tumors and make the accurate prediction of cancer possible and easier using bioinformatics tools in machine learning and pattern recognition (Wu et al., 2012).

As a general observation, there are several problems studied in genes expression microarrays (GEM). All of them can be divided into three classes namely the class prediction which uses supervised machine learning approaches, the class discovery which uses unsupervised machine learning approaches (Banu & Andrews, 2015) and the class gene comparison that uses machine learning approaches in general (Golub et al., 1999). The direct application of these methods on high-dimensional data is usually ineffective (Wu et al., 2012). Since gene expression data consists of a high number of features (genes) and small sample sizes. However, there are a large number of irrelevant, redundant and noisy genes. Only a small set of genes contains useful biological interpretations and finally gives high accuracy for cancer diagnosis. In addition, the presence of many features affects not only the performance of prediction but also the computational time of learning algorithms (Bolon-Canedo et al., 2014).

To avoid the problem of the curse of dimensionality it becomes then necessary to select a small subset of features/genes that can separate healthy patients from cancer patients or in more general terms, genes which are relevant, non-redundant and discriminative for a particular genetic disease. These genes are called biomarkers, informative genes, parsimonious genes or differentially expressed genes.

Therefore, we require dimensionality reduction techniques, which identify a small set of genes that represent the most discriminant information of the original ensemble of genes to achieve better learning performance. This step plays a central role in the field of machine learning and more specifically in the classification task and allows many pros (Krishnapuram et al., 2004) (a) reduce the computational cost and storage space of the classification model, by constructing them using only a small subset of the original set of genes, (b) Improve significantly the intelligibility of the classifier, and maximize the prediction performance of a classification algorithm and (c) reduce the risk of ‘‘overfitting’’ when the number of samples is small. Subsequently, the prediction result of classifiers is more reliable, robust and can help doctors to take appropriate treatment solution which provide patients with better treatment or response to therapy, especially when the disease has been identified at its early time (Osl et al., 2012).

Complete Article List

Search this Journal:

Reset

Volume 15: 1 Issue (2024)

Volume 14: 1 Issue (2023)

Volume 13: 4 Issues (2022): 2 Released, 2 Forthcoming

Volume 12: 4 Issues (2021)

Volume 11: 4 Issues (2020)

Volume 10: 4 Issues (2019)

Volume 9: 4 Issues (2018)

Volume 8: 4 Issues (2017)

Volume 7: 4 Issues (2016)

Volume 6: 4 Issues (2015)

Volume 5: 4 Issues (2014)

Volume 4: 4 Issues (2013)

Volume 3: 4 Issues (2012)

Volume 2: 4 Issues (2011)

Volume 1: 4 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Hybrid Wrapper/Filter Gene Selection Using an Ensemble of Classifiers and PSO Algorithm

Abstract

1. Introduction

Complete Article List