Multiclass Molecular Classification

Multiclass Molecular Classification

Chia Huey Ooi
Copyright: © 2009 |Pages: 6
ISBN13: 9781605660103|ISBN10: 1605660108|EISBN13: 9781605660110
DOI: 10.4018/978-1-60566-010-3.ch209
Cite Chapter Cite Chapter

MLA

Huey Ooi, Chia. "Multiclass Molecular Classification." Encyclopedia of Data Warehousing and Mining, Second Edition, edited by John Wang, IGI Global, 2009, pp. 1352-1357. https://doi.org/10.4018/978-1-60566-010-3.ch209

APA

Huey Ooi, C. (2009). Multiclass Molecular Classification. In J. Wang (Ed.), Encyclopedia of Data Warehousing and Mining, Second Edition (pp. 1352-1357). IGI Global. https://doi.org/10.4018/978-1-60566-010-3.ch209

Chicago

Huey Ooi, Chia. "Multiclass Molecular Classification." In Encyclopedia of Data Warehousing and Mining, Second Edition, edited by John Wang, 1352-1357. Hershey, PA: IGI Global, 2009. https://doi.org/10.4018/978-1-60566-010-3.ch209

Export Reference

Mendeley
Favorite

Abstract

Molecular classification involves the classification of samples into groups of biological phenotypes. Studies on molecular classification generally focus on cancer for the following reason: Molecular classification of tumor samples from patients into different molecular types or subtypes is vital for diagnosis, prognosis, and effective treatment of cancer (Slonim, Tamayo, Mesirov, Golub, and Lander, 2000). Traditionally, such classification relies on observations regarding the location (Slonim et al., 2000) and microscopic appearance of the cancerous cells (Garber et al., 2001). These methods have proven to be slow and ineffective; there is no way of predicting with reliable accuracy the progress of the disease, since tumors of similar appearance have been known to take different paths in the course of time. With the advent of the microarray technology, data regarding the gene expression levels in each tumor sample may now prove to be a useful tool in molecular classification. This is because gene expression data provide snapshots of the activities within the cells and thus, the profile of the state of the cells in the tissue. The use of microarrays for gene expression profiling was first published in 1995 (Schena, Shalon, Davis, and Brown, 1995). In a typical microarray experiment, the expression levels of up to 10,000 or more genes are measured in each sample. The high-dimensionality of the data means that feature selection (FS) plays a crucial role in aiding the classification process by reducing the dimensionality of the input to the classification process. In the context of FS, the terms gene and feature will be used interchangeably in the context of gene expression data.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.