Deterministic Motif Mining in Protein Databases

Deterministic Motif Mining in Protein Databases

Pedro Gabriel Ferreira, Paulo Jorge Azevedo
DOI: 10.4018/978-1-60566-058-5.ch158
(Individual Chapters)
No Current Special Offers


Protein sequence motifs describe, through means of enhanced regular expression syntax, regions of amino acids that have been conserved across several functionally related proteins. These regions may have an implication at the structural and functional level of the proteins. Sequence motif analysis can bring significant improvements towards a better understanding of the protein sequence- structure-function relation. In this chapter, we review the subject of mining deterministic motifs from protein sequence databases. We start by giving a formal definition of the different types of motifs and the respective specificities. Then, we explore the methods available to evaluate the quality and interest of such patterns. Examples of applications and motif repositories are described. We discuss the algorithmic aspects and different methodologies for motif extraction. A brief description on how sequence motifs can be used to extract structural level information patterns is also provided.

Complete Chapter List

Search this Book: