Special Offers
- IGI Global’s New Emerging Topic e-Book Collections
  Acquire highly focused and affordable Cutting-Edge Peer-Reviewed Research Content through a selection of 17 topic-focused e-Book Collections discounted up to 90%, compared to list prices. Collection topics include Artificial Intelligence, Data Science, Language Learning, Marketing and Customer Relations, Sustainability, and many more. Hosted on the InfoSci^® platform, these collections feature no DRM, no additional cost for multi-user licensing, no embargo of content, full-text PDF & HTML format, and more.
  Learn More
- Open Access Book (Free Access) - Encyclopedia of Information Science and Technology, Sixth Edition (ISBN: 9781668473665)
  The Encyclopedia of Information Science and Technology, Sixth Edition) continues the legacy set forth by the first five editions by providing comprehensive coverage and up-to-date definitions of the most important issues, concepts, and trends pertaining to technological advancements and information management within a variety of settings and industries. The entire book is being published under open access.
  Read Now
- Open Access Book (Free Access) - Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries (ISBN: 9781668456293)
  Food Sustainability, Environmental Awareness, and Adaptation and Mitigation Strategies for Developing Countries provides information on the recent technology, mitigation, and environmental protection that must be applied for food sustainability in developing countries. This book is being published under Platinum Open Access through funding from Diponegoro University, Indonesia.
  Read Now
- Open Access Book (Free Access) - New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY (ISBN: 9781668438091)
  The Walmart Corporation and the Lumina Foundation have provided funding to make New Models of Higher Education: Unbundled, Rebundled, Customized, and DIY fully open access, completely removing any paywall between scholars in education and the latest research on new models for the future of higher education.
  Read Now
- Open Access Book (Free Access) - Handbook of Research on the Global View of Open Access and Scholarly Communications (ISBN: 9781799898054)
  Through a collaboration between IGI Global and the University of North Texas, the Handbook of Research on the Global View of Open Access and Scholarly Communications has been published as fully open access, completely removing any paywall between researchers of any field, and the latest research on the equitable and inclusive nature of Open Access and all of its complications.
  Read Now
Books
- - Books by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Books by Field
Journals
- - Journals
  - OnDemand Journal Articles
  - Journals by Subject
  - Business, Administration, & Management
  - Scientific, Technical, & Medical (STM)
  - Education
  - Journals by Field
e-Collections
Open Access
- View All Open Access Opportunities
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Find an Open Access Journal for Your Next Manuscript
  Search across all of IGI Global’s available open access publishing opportunities to unleash your research potential.
  Submit an Open Access Book Proposal
  Learn more about open access book publishing and how it can propel your research forward in the field.
  Convert Your Work to Open Access
  Already published? You can convert your work to open access to increase its impact through IGI Global’s Restrospective Open Access Program.
  Utilize Open Access Collection Database
  Open up your research potential by utilizing our open access content or integrating the open access collection into your library
  Consider Open Access Agreements
  For Libraries: consider no-cost or investment-level open access agreements with IGI Global to support your faculty's research endeavors.
  Search Funding Resources
  Looking for additional funding resources to support your open accesss endeavors? View industry resources compiled by our open access team.
  Review Open Access Policies & Ethical Guidelines
  Considering IGI Global to publish your work under open access? Review IGI Global’s open access policies and ethical guidelines
Publish with Us
Resources
- - Instructors
  - Course Adoption
  - Teaching Cases
  - K-12 Online Learning Collection
  - Authors and Editors
  - eEditorial Discovery^® System
  - Peer Review Process
  - Ethics and Malpractice
  - COPE Membership
  - Fair Use Policy
  - Open Access Publishing
  - FAQ
Catalogs
About Us
Newsroom

Evaluation of Clustering Patterns using Singular Value Decomposition (SVD): A Case Study of Metabolic Syndrome

Josephine M. Namayanja

Source Title: International Journal of Computational Models and Algorithms in Medicine (IJCMAM) 1(3)

DOI: 10.4018/jcmam.2010070104

OnDemand:

(Individual Articles)

Available

$37.50

Current Special Offers

No Current Special Offers

Abstract

Computational techniques, such as Simple K, have been used for exploratory analysis in applications ranging from data mining research, machine learning, and computational biology. The medical domain has benefitted from these applications, and in this regard, the authors analyze patterns in individuals of selected age groups linked with the possibility of Metabolic Syndrome (MetS), a disorder affecting approximately 45% of the elderly. The study identifies groups of individuals behaving in two defined categories, that is, those diagnosed with MetS (MetS Positive) and those who are not (MetS Negative), comparing the pattern definition. The paper compares the cluster formation in patterns when using a data reduction technique referred to as Singular Value Decomposition (SVD) versus eliminating its application in clustering. Data reduction techniques like SVD have proved to be very useful in projecting only what is considered to be key relations in the data by suppressing the less important ones. With the existence of high dimensionality, the importance of SVD can be highly effective. By applying two internal measures to validate the cluster quality, findings in this study prove interesting in context to both approaches.

Article Preview

Top

Introduction

In a realistic sense humans by nature are different, however it might seem that if individuals are affected by similar factors – either internal or external, they tend to exhibit similar characteristics. What if these characteristics vary from one individual to another or between groups of individuals? This study identifies groups of clusters in individuals that are diagnosed with MetS (MetS Positive) and is furthered with a comparison of those do not fit into the diagnosis (MetS Negative). In order to understand the patterns of the clusters formed, there are two techniques applied – 1) Simple K – Means clustering and 2) Singular Value Decomposition (SVD) as a pre-processing task performed prior to the former. Because the process of collecting medical data can be quite tasking on the part of health professionals due to the limitless number of features that surround its domain, it presents even more of a problem for those, such as researchers and analysts, who try to expand the use of this type of data. Such datasets are highly dimensional and thus contain just about too many attributes for a single record. This often makes it difficult for knowledge workers to not only make a viable selection but to also represent this information in a manner that supports both visualization and clarity. Therefore (Thomasian, Castelli, & Li, 1998) commend that computational research presents data reduction techniques such as SVD and Principle Component Analysis (PCA) that can be applied to manage this curse of dimensionality.

In the first approach (also referred to as Non- SVD) of this study, a Simple K-means clustering algorithm is used as an exploratory analysis technique to group individuals of selected age groups with similar characteristics into k clusters where k refers to the ideal number of clusters (Bunn & Ostrovsky, 2007) with an aim of maximizing intra cluster similarity and minimizing inter cluster similarity. While in the second approach SVD (also referred to as SVD- based clustering in this study) is initially applied to create a factorization of a given matrix X and hence reduce the dimensionality of its underlying data structure to produce a more appropriate subspace of the data based on some ranking system (Phillips, Watson, & Wynne, 2008). This ranking system describes the linear independence of a data matrix which is determined by the dependency of rows/columns on each other. Further still, this technique can be applied to any type of data set D represented as any size of matrix but the linearity may differ depending on the distribution of the data. (Martin, 2005) adds that it also provides a method for mathematically discovering correlations within data and in this study the use of SVD is followed by clustering the reduced dataset into k clusters. The purpose of both approaches sums up into a comparison of clusters formed at both iterations to determine the performance of the clustering algorithm depending on the quality of clusters formed. As the technique of clustering has previously been applied to numerous applications such as information retrieval, machine learning, data mining research (Bunn & Ostrovsky, 2007) and computational biology (Frahling & Sohler, 2006), SVD has taken phase in face recognition, medical imaging as stated in (Hsu et al., 2007; Lee et al., 2004; Lui et al., 2009; Ma et al., 2009; Zanderigo et al., 2009) and is also used as a complimentary technique on Latent Semantic Indexing (LSI) and clustering which is the case in this study. More so, computational methods such as Markov Chains and SVD itself have been looked at previously in alliance with several aspects of the health domain from brain functionality (Brockwell, Kass, & Schwartz, 2007) and Alzheimer’s disease (Li et al., 2005) to cardio (Brinegar et al., 2009) plus gene related studies (Alshalalfa, Alhaji, & Rokne, 2008). With the objective of this study in mind, the comparison of cluster quality is validated using two internal measures which include: 1) Sum of Square Error (SSE) which looks at the distance (Euclidean distance) of the data point x from the cluster centroid c and 2) Silhouette Coefficient (Sil) which considers the distances of individual data points to other points internally within a cluster and externally to other clusters formed (Frahling & Sohler, 2006). This foreword on the study is extended with a motivation that describes Metabolic Syndrome in depth and samples on real statistical based examples of its effects.

Complete Article List

Search this Journal:

Reset

Volume 4: 2 Issues (2014)

Volume 3: 4 Issues (2012)

Volume 2: 4 Issues (2011)

Volume 1: 4 Issues (2010)

View Complete Journal Contents Listing

MLA

APA

Chicago

Export Reference

Evaluation of Clustering Patterns using Singular Value Decomposition (SVD): A Case Study of Metabolic Syndrome

Abstract

Introduction

Complete Article List