Save 10% on All IGI Global Research Books
& OnDemand Individual Chapter & Article DownloadsAvailable exclusively on IGI Global’s Online Bookstore. Offer valid through October 31, 2024

Special Offers
- Save 10% on the IGI Global Online bookstore
  Now through October 31, 2024, save 10% on all IGI Global research books & OnDemand individual chapter & article downloads. IGI Global contributors may stack this discount with their exclusive 50% contributor discount, which is automatically applied when logged into a contributor portal account. Non-contributors may also combine the discount with one other discount, including coupon codes. Not valid on open access processing charges, e-collections, or videos. Discount is not applicable for distributors.
  Explore Books & Chapters
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

A Comparative Analysis of Rough Set Based Intelligent Techniques for Unsupervised Gene Selection

P. K. Nizar Banu, H. Hannah Inbarani

Source Title: International Journal of System Dynamics Applications (IJSDA) 2(4)

DOI: 10.4018/ijsda.2013100103

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

As the micro array databases increases in dimension and results in complexity, identifying the most informative genes is a challenging task. Such difficulty is often related to the huge number of genes with very few samples. Research in medical data mining addresses this problem by applying techniques from data mining and machine learning to the micro array datasets. In this paper Unsupervised Tolerance Rough Set based Quick Reduct (U-TRS-QR), a diverse feature selection algorithm, which extends the existing equivalent rough sets for unsupervised learning, is proposed. Genes selected by the proposed method leads to a considerably improved class predictions in wide experiments on two gene expression datasets: Brain Tumor and Colon Cancer. The results indicate consistent improvement among 12 classifiers.

Article Preview

Top

Introduction

Dimensionality reduction has received considerable attention in micro array data analysis as a way to select the most informative genes by removing the least informative genes. Feature extraction and Feature selection are the two important methods used for dimensionality reduction. Feature extraction transforms all or a part of the features to a lower dimension space whereas feature selection selects a subset of the original features. In the analysis of gene expression dataset, feature selection bears a significant advantage over feature extraction methods (Assaf, 2009). Feature selection refers to the process of selecting highly informative features that are most effective in characterizing a given field. It addresses the specific task of finding a subset of given features that are useful to solve the domain problem without disrupting the underlying meaning of the selected features. Many criteria can be employed to measure the similarity among features (Mitra et al., 2002). Feature selection methods can be divided into supervised and unsupervised. Genetic algorithms have been successfully used as an efficient method of supervised feature selection for a high-dimensional spectral dataset (Cho et al., 2008; Davis et al., 2006). Supervised feature selection problems have been formulated by a multiple hypothesis testing procedure that controls the false discovery rate (Mei et al., 2009; Kim et al., 2008). Instead of investigating supervised/unsupervised feature extraction and supervised feature selection, few attempts have been made to identify the important features by using unsupervised feature selection methods (Mao, 2005). Unsupervised feature selection methods usually have been divided into three categories — wrapper, filter, and hybrid approaches (Kim & Gao, 2006). Dy and Brodley (2000) introduced a wrapper approach that uses Expectation Maximization (EM) clustering algorithm. Trevor Hastie et al., (2000) developed a gene-shaving method that used its first principal component to identify the best subsets of those features with large variations. Ding (2003) proposed a two-way ordering approach in which relevant genes were selected based on their similarity information. PCA is a widely used unsupervised feature extraction method in that the process depends solely upon the input variables, and does not take into account, information from the output variable (Jolliffe, 2002).

This paper studies and analyses gene expression datasets that contains large number of features (genes), the majority of the genes are not relevant to describe the problem and in turn degrades the classification performance. Recent approaches in feature selection include probabilistic neural networks (Huang, 2004), Support Vector Machines (Cao et al., 2003), neuro-fuzzy computing (Chu et al., 2004), neuro-genetic hybridization (Karzynski et al., 2003), sparse unsupervised dimensionality reduction method (Dou et al., 2010), Unsupervised Quick Reduct (Velayutham & Thangavel, 2011) and Unsupervised Relative Reduct (Velayutham & Thangavel, 2011). The idea behind feature selection is to retain genes that have a major role in arriving at a decision about the output classes.

The gene expression datasets consists of real values which expresses the functional value of every gene. Most of the existing supervised and unsupervised feature selection algorithms discretize or normalize the original values which results in certain information loss. Traditional rough sets are also incapable of dealing with real-valued datasets. This problem is addressed in this paper by introducing the extension of rough sets called Tolerance Rough Sets (TRS). With the help of TRS the values of gene expressions are preserved as such. In this paper features and genes are used interchangeably.

The rest of the paper is organized as follows. The first section focuses on research background. A brief introduction about tolerance rough sets is presented. The proposed algorithm for gene selection with worked example is described. The Experimental results are then presented, discussed and analyzed. Finally, our concluding remarks and future work is presented.

Complete Article List

Search this Journal:

Reset

Volume 12: 1 Issue (2024): Forthcoming, Available for Pre-Order

Volume 11: 5 Issues (2022)

Volume 10: 4 Issues (2021)

Volume 9: 4 Issues (2020)

Volume 8: 4 Issues (2019)

Volume 7: 4 Issues (2018)

Volume 6: 4 Issues (2017)

Volume 5: 4 Issues (2016)

Volume 4: 4 Issues (2015)

Volume 3: 4 Issues (2014)

Volume 2: 4 Issues (2013)

Volume 1: 4 Issues (2012)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

A Comparative Analysis of Rough Set Based Intelligent Techniques for Unsupervised Gene Selection

Abstract

Introduction

Complete Article List