Save 10% on All IGI Global Research Books
& OnDemand Individual Chapter & Article DownloadsAvailable exclusively on IGI Global’s Online Bookstore. Offer valid through October 31, 2024

Special Offers
- Save 10% on the IGI Global Online bookstore
  Now through October 31, 2024, save 10% on all IGI Global research books & OnDemand individual chapter & article downloads. IGI Global contributors may stack this discount with their exclusive 50% contributor discount, which is automatically applied when logged into a contributor portal account. Non-contributors may also combine the discount with one other discount, including coupon codes. Not valid on open access processing charges, e-collections, or videos. Discount is not applicable for distributors.
  Explore Books & Chapters
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education & Social Sciences
  - Journals by Field
e-Collections
OnDemand
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us

Hybrid System based on Rough Sets and Genetic Algorithms for Medical Data Classifications

Hanaa Ismail Elshazly, Ahmad Taher Azar, Aboul Ella Hassanien, Abeer Mohamed Elkorany

Source Title: International Journal of Fuzzy System Applications (IJFSA) 3(4)

DOI: 10.4018/ijfsa.2013100103

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Computational intelligence provides the biomedical domain by a significant support. The application of machine learning techniques in medical applications have been evolved from the physician needs. Screening, medical images, pattern classification, prognosis are some examples of health care support systems. Typically medical data has its own characteristics such as huge size and features, continuous and real attributes that refer to patients' investigations. Therefore, discretization and feature selection process are considered a key issue in improving the extracted knowledge from patients' investigations records. In this paper, a hybrid system that integrates Rough Set (RS) and Genetic Algorithm (GA) is presented for the efficient classification of medical data sets of different sizes and dimensionalities. Genetic Algorithm is applied with the aim of reducing the dimension of medical datasets and RS decision rules were used for efficient classification. Furthermore, the proposed system applies the Entropy Gain Information (EI) for discretization process. Four biomedical data sets are tested by the proposed system (EI-GA-RS), and the highest score was obtained through three different datasets. Other different hybrid techniques shared the proposed technique the highest accuracy but the proposed system preserves its place as one of the highest results systems four three different sets. EI as discretization technique also is a common part for the best results in the mentioned datasets while RS as an evaluator realized the best results in three different data sets.

Article Preview

Top

1. Introduction

The increasing advent of information technology provides historical data a great value and allows its efficient reusing (Wu et al., 2005). Limited human capability makes the high size of stored information becomes less useful. Human inspection and interpretation of the data is not feasible since no one can understand and benefit from the massive databases. The huge size of the historical data is the damning factor that limits the promising expected benefit from this stored data. Typically, these resources may include irrelevant features and noise. Limited resources and learning speed are important factors that must be considered in generalization performance. Increasing computational complexity may lead to intractable behavior. Knowledge discovery is the process of extracted hidden pattern and clusters from massive stored data. It concerns with the elicitation of buried knowledge from massive databases (Liao et al., 2011; Han & Kamber, 2011). It includes ensemble of sub processes that can extract the required knowledge with an efficient manner and benefit significantly from this flood of information. Data preprocessing, feature selection, data mining and evaluation are the main sub processes of KD process. Data preprocessing is the first step in KD process. It concerns with the data preparation, cleaning, discretization and removal of outliers as well as the managing of inconsistency. Some preprocessing techniques are related to understanding the distribution of the data like central tendency and dispersion measure of the data. These descriptive data summaries can be presented graphically like bar charts or histograms which is very helpful for visual inspection of the data (Kotsiantis, 2007). Others like discretization are related to data preparation to be better understanding and managing. Discretization is a process which adopts the concept of hierarchy generation. It tends to improve the data mining process by reducing the number of managing values. Feature selection is the process of specifying a minimal feature subset including the most relevant features that best contribute in classification process based on evaluation criteria (Ladha & Deepa, 2011; Chen et al., 2011; Lavrac, 1999). Zhao et al. (2008) recommended the application of a feature selection process for pattern recognition, machine learning and data mining. This recommendation is motivated by the considerable support provided to subsequent steps of knowledge extraction. Feature selection process reduces the dimensionality of feature space and computational cost which decrease drastically the storage resource and running time. In order to increase the speed of training and improve the predictive accuracy, we get rid of noise. The retained data is considered the most relevant data that make better understanding of extracted knowledge and facilitate the visualization function (Escolano et al., 2009; Tsai, 2009). Concerning the evaluation step, Rough Set (RS) theory has many applications including the medical domain (Hassanien et al., 2009). Rough sets theory provides a novel approach to knowledge description and approximation of sets. It was introduced by Pawlak during the early 1980s (Pawlak, 1982) and is based on an approximation space-based approach to classify sets of objects. RS outperforms many other techniques by some properties. Firstly, it doesn't need any external parameters which provide an advantage over many other techniques. Secondly, it can ascertain the completeness of the data for the classification task especially for limited or expensive information sources (Pawlak, 1991; Polkowski, 2003; Yu & Liu, 2004). Rough set methods can also be used to classify unknown data based on already gained knowledge. It can be utilized to determine whether sufficient data for a task is available to extract a minimal sufficient set of features for classification. Reduct is an important concept in rough set theory and data reduction is a main application of rough set theory in pattern recognition and data mining. In this context, this paper proposes an integrated model that decomposes applying genetic algorithms for feature selection and rough set during classification process for prediction problem in the biomedical domain. Medical historical data is considered as a buried value. Stored patients data can be used as a significant prediction source for unknown cases. The high dimensionality of medical data imposes the use of some helpful support to analyze and classify the data. Computer Aided Diagnosis (CAD) systems present a number of tremendous aids to health care field such medical diagnosis, medical imaging, computer vision, genomics and medical computer translation which plays an important role in the physician’s interpretation (Cios & Mooree, 2002; Kononenko, 2001). Therefore, the proposed research investigates the efficiency of the combining (EI-GA-RS) in improving the classification accuracy for diseases diagnosis. Four data sets were tested to certain the reliability and the efficiency of the proposed system.

Complete Article List

Search this Journal:

Reset

Volume 13: 1 Issue (2024)

Volume 12: 1 Issue (2023)

Volume 11: 4 Issues (2022)

Volume 10: 4 Issues (2021)

Volume 9: 4 Issues (2020)

Volume 8: 4 Issues (2019)

Volume 7: 4 Issues (2018)

Volume 6: 4 Issues (2017)

Volume 5: 4 Issues (2016)

Volume 4: 4 Issues (2015)

Volume 3: 4 Issues (2013)

Volume 2: 4 Issues (2012)

Volume 1: 4 Issues (2011)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Hybrid System based on Rough Sets and Genetic Algorithms for Medical Data Classifications

Abstract

1. Introduction

Complete Article List