Feature Selection for the Promoter Recognition and Prediction Problem

Feature Selection for the Promoter Recognition and Prediction Problem

George Potamias, Alexandros Kanterakis
ISBN13: 9781599049519|ISBN10: 1599049511|EISBN13: 9781599049526
DOI: 10.4018/978-1-59904-951-9.ch132
Cite Chapter Cite Chapter

MLA

Potamias, George, and Alexandros Kanterakis. "Feature Selection for the Promoter Recognition and Prediction Problem." Data Warehousing and Mining: Concepts, Methodologies, Tools, and Applications, edited by John Wang, IGI Global, 2008, pp. 2248-2262. https://doi.org/10.4018/978-1-59904-951-9.ch132

APA

Potamias, G. & Kanterakis, A. (2008). Feature Selection for the Promoter Recognition and Prediction Problem. In J. Wang (Ed.), Data Warehousing and Mining: Concepts, Methodologies, Tools, and Applications (pp. 2248-2262). IGI Global. https://doi.org/10.4018/978-1-59904-951-9.ch132

Chicago

Potamias, George, and Alexandros Kanterakis. "Feature Selection for the Promoter Recognition and Prediction Problem." In Data Warehousing and Mining: Concepts, Methodologies, Tools, and Applications, edited by John Wang, 2248-2262. Hershey, PA: IGI Global, 2008. https://doi.org/10.4018/978-1-59904-951-9.ch132

Export Reference

Mendeley
Favorite

Abstract

With the completion of various whole genomes, one of the fundamental bioinformatics tasks is the identification of functional regulatory regions, such as promoters, and the computational discovery of genes from the produced DNA sequences. Confronted with huge amounts of DNA sequences, the utilization of automated computational sequence analysis methods and tools is more than demanding. In this article, we present an efficient feature selection to the promoter recognition, prediction, and localization problem. The whole approach is implemented in a system called MineProm. The basic idea underlying our approach is that each position-nucleotide pair in a DNA sequence is represented by a distinct binary-valued feature—the binary position base value (BPBV). A hybrid filter-wrapper, featuredeletion (or addition) algorithmic process is called for in order to select those BPBVs that best discriminate between two DNA sequences target classes (i.e., promoter vs. nonpromoter). MineProm is tested on two widely used benchmark data sets. Assessment of results demonstrates the reliability of the approach.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.