Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Optimized Clustering Techniques with Special Focus to Biomedical Datasets

Anusuya S. Venkatesan

Source Title: Biomedical Engineering: Concepts, Methodologies, Tools, and Applications

DOI: 10.4018/978-1-5225-3158-6.ch049

OnDemand:

(Individual Chapters)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

The clinical data including clinical test results, MRI images and drug responses of patients are documented and analyzed with machine learning and data mining tools. The scale and complexity of these datasets is a big challenge to machine learning and data mining community as the data is of mixed type. The extraction of meaningful or desired information from these datasets provides knowledge in decision making process which in turn helps for the diagnosis and treatment of the diseases. Biomedical datasets are a collection of data with diverse types as it involves images, clinical studies, statistical reports etc. The recent researches have focused on different clustering and classification methods to manage and analyze the biomedical datasets. The objective of this chapter is to cluster or classify the patterns of interest from Brain MRI images, Liver disorder and Breast cancer datasets using efficient clustering methodologies. Among the different algorithms in data mining for clustering, classification, visualization and interpretation, K Means, Fuzzy C Means and Neural Networks(NN) are frequently used for clustering and classification of biomedical datasets. The performance of these methods are greatly influenced by the initialization of K value and its convergence speed. This chapter discusses about FCM and K Means clustering methods and its optimization with meta heuristics such as Particle Swarm Optimization (PSO) and Quantum Particle Swarm Optimization (QPSO). The experimental section of this paper exhibits analysis in terms of Intra cluster distances, elapsed time and Davis Bouldin Index (DBI).

Chapter Preview

Top

Introduction

Data mining is an interdisciplinary area involves artificial intelligence, soft computing, database system etc. The tools of data mining infer information from the databases and this information converted to knowledge of patterns and relationships. The relationships among data are referred as Classes, Clusters, Associations and Sequential Patterns. Data clustering have been applied in the area of data mining and machine learning. The specific applications include statistics (McLachlan et al., 1997), bioinformatics (He et al. 2006), machine learning (Ethem Alpaydin., 2004) exploratory data analysis, image segmentation, security, medical image analysis, web handling and mathematical programming (Pyle 1999; Panov et al. 2008). The process of clustering split the data into homogeneous and inhomogeneous classes with respect to similarity between data.

Clusters are formed by finding the distance between data points. The existing tools explore the data and help to visualize it in different models. The cluster representation is one of the widely accepted exploratory models to analyze the data with the different levels of observation. Clustering is applied to biomedical datasets to understand the characteristics of bio information and to find interesting patterns associated to prior information. Most of the bio medical datasets have inherent noise and inconsistency, sometimes mixed with semantic information and experimental results. Hence, generating quality clusters on biomedical datasets is a challenging task. The role of clustering in biomedical datasets is to derive meaningful information which assists pathologists on decision making.

In case of medical image segmentation, the system works by segmenting the whole image into multiple segments and extracts only the specific region for investigation. In image segmentation, the intensities of all pixels within a homogenous cluster are similar but the intensities of inhomogeneous clusters are different from homogeneous one. Medical image analysis is mainly dependent on effective image segmentation to extract suspicious regions from complex medical images (Neeraj et al., 2010).

Clustering is categorized as an optimization problem to satisfy the criteria of minimizing the similarity within a cluster and maximizing the dissimilarity between clusters. Table 1 shows the different distance measures used to find the distance between data points. Some clustering techniques use heuristic algorithms (Bandyopadhyay S, 2002)(Das S et al.,2008)(S. Ouadfel et al.,2010) to obtain centres for clusters. The objectives of using Optimization techniques with clustering techniques are 1) to find global optima 2) to enforce robustness against initialization 3) to improve the partitioning quality 4) to deal with unknown and known number of clusters 5) to speed up convergence etc. Clusters are represented in different forms such as Connectivity models, Centroid models, Distribution models, Density models and Graph-based models.

•
Hierarchical Clustering (Ward et al, 1963): Is an example for connectivity models which fall into two types, they are Agglomerative and Divisive. Agglomerative is bottom up approach while Divisive is top down approach.
•
Centroid Models: Clusters are formed based on the centroids found on the original data, the number of centroids is fixed to K, the clusters are formed by assigning the data items to the nearest cluster centroid. Example: K Means (Hartigan et al, 2009).
•
Density Based Models: Martin Ester et al. (1996) defines clusters as connected dense regions in the data space. Example: Density Based Spatial Clustering of Applications with Noise (DBSCAN) and Ordering Points to Identify the Clustering Structure (OPTICS).
•
Distribution Models (Bishop, 2006): Clusters are modeled using statistical distributions, such as multivariate normal distributions using Expectation-Maximization algorithm.

Complete Chapter List

Search this Book:

Reset

MLA

APA

Chicago

Export Reference

Optimized Clustering Techniques with Special Focus to Biomedical Datasets

Abstract

Introduction

Complete Chapter List