Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is Protein Classification

Handbook of Research on Text and Web Mining Technologies
A process which enables to build a predictive model that automatically assign the functional family of a protein sequence from its description.
Published in Chapter:
Using the Text Categorization Framework for Protein Classification
Ricco Rakotomalala (University of Lyon, France) and Faouzi Mhamdi (University of Jandouba, Tunisia)
Copyright: © 2009 |Pages: 13
DOI: 10.4018/978-1-59904-990-8.ch008
Abstract
In this chapter, we are interested in proteins classification starting from their primary structures. The goal is to automatically affect proteins sequences to their families. The main originality of the approach is that we directly apply the text categorization framework for the protein classification with very minor modifications. The main steps of the task are clearly identified: we must extract features from the unstructured dataset, we use the fixed length n-grams descriptors; we select and combine the most relevant one for the learning phase; and then, we select the most promising learning algorithm in order to produce accurate predictive model. We obtain essentially two main results. First, the approach is credible, giving accurate results with only 2-grams descriptors length. Second, in our context where many irrelevant descriptors are automatically generated, we must combine aggressive feature selection algorithms and low variance classifiers such as SVM (Support Vector Machine).
Full Text Chapter Download: US $37.50 Add to Cart
More Results
Data Mining in Proteomics Using Grid Computing
The process of assigning a single or multiple Protein Family labels to a protein sequence with known aminoacid composition but unknown functions and/or properties.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR