Uncovering Fine Structure in Gene Expression Profile by Maximum Entropy Modeling of cDNA Microarray Images and Kernel Density Methods

Uncovering Fine Structure in Gene Expression Profile by Maximum Entropy Modeling of cDNA Microarray Images and Kernel Density Methods

George Sakellaropoulos (University of Patras, Greece), Antonis Daskalakis (University of Patras, Greece), George Nikiforidis (University of Patras, Greece) and Christos Argyropoulos (University of Pittsburgh Medical Center, USA)
Copyright: © 2009 |Pages: 18
DOI: 10.4018/978-1-60566-076-9.ch012
OnDemand PDF Download:
$37.50

Abstract

The presentation and interpretation of microarray-based genome-wide gene expression profiles as complex biological entities are considered to be problematic due to their featureless, dense nature. Furthermore microarray images are characterized by significant background noise, but the effects of the latter on the holistic interpretation of gene expression profiles remains under-explored. We hypothesize that a framework combining (a) Bayesian methodology for background adjustment in microarray images with (b) model-free modeling tools, may serve the dual purpose of data and model reduction, exposing hitherto hidden features of gene expression profiles. Within the proposed framework, microarray image restoration and noise adjustment is facilitated by a class of prior Maximum Entropy distributions. The resulting gene expression profiles are non-parametrically modeled by kernel density methods, which not only normalize the data, but facilitate the generation of reduced mathematical descriptions of biological variability as mixture models.
Chapter Preview
Top

Introduction

The advent of complementary DNA (cDNA) microarray technologies enabled the simultaneous and specific assessment of the expression levels of thousands of genes (Southern, Mir, & Shchepinov, 1999). The conventional approach to analyze such datasets is to explore quantitative co-expression relations across a variety of experimental conditions prior to invoking putative similarities in gene regulation or function (DeRisi, Iyer, & Brown, 1997; Eisen, Spellman, Brown, & Botstein, 1998). The alternative viewpoint considers gene expression profiles from specific conditions to be informative of distinct molecular signatures that characterize cellular states. Such genome wide, transcriptional signatures have been used to distinguish normal from abnormal samples in benign developmental conditions (Barnes et al., 2005), solid tumors and hematologic malignancies (Febbo et al., 2005; Valentini, 2002) and differentiate distinct disease states of renal allografts (Sarwal et al., 2003). It has been suggested that the thousands of expression values in a microarray experiment are too dense and irregular to be directly interpreted in a holistic manner and that alternative transformations of the normalized gene profiles should be sought after (Guo, Eichler, Feng, Ingber, & Huang, 2006). Nevertheless one could justifiably argue that the irregularity of the gene profiles is due to incomplete modeling and adjustment for the presence of measurement noise. However this alternative hypothesis has not been adequately addressed in the current literature. These considerations underline the impetus for the present work, which aims to:

  • 1.

    Establish the role of microarray image background in the irregularity and “featureless” appearance of gene expression profiles (GEP) from individual experimental states.

  • 2.

    Propose a data and model reduction framework for the analysis of GEP consisting of:

    • a.

      A probabilistic Bayesian algorithm for background adjustment of microarray images based on Maximum Entropy distributions.

    • b.

      Non-parametric kernel density estimation methods for the mathematical representation and exploration of the resultant gene expression profiles.

Key Terms in this Chapter

Ontological Modeling: Modeling that implies the existence of certain objects in the physical natural world. The distinction between ontological and epistemological modeling is a subtle one; whereas the former is an investigation about natural objects and properties, the latter concerns the analysis of (usually) subjective statements about models of the world.

Non–Parametric Kernel Density Estimation: Non-parametric kernel density estimation methods are model free techniques for the estimation of an empiric distribution from experimental data. Formally such estimators smooth out the contribution of each observed data point over a local neighborhood.

Power Series Distribution: Discrete probability distributions with probability mass function given by: . Modified power series distribution (MPSD) are more general distributions which arise when ? is a function of another (simple) parameter. In such a case we define the power parameter ? ( m ) and series function ? ( ? ) by: .Particular choices of the power parameter render power distributions that are analytic approximations to Maximum Entropy priors over finite domains.

Normalization: The process in which mathematical transformations of the microarray data are undertaken to reduce variability in the expression levels and make data from different experiments directly comparable.

Epistemological Modeling: Modeling that quantifies one’s perception about the world rather than the world per se. The object of such modeling is to generate coherent descriptions of one’s knowledge usually in the face of uncertainty.

Bayesian probability: An interpretation of the colloquial term probability, which identifies the latter with the degree of belief of a proposition about the world. This interpretation is firmly grounded in the rules of Aristotelian logic and in fact extends the latter in situations of uncertainty i.e. when the truth or falsity of propositions cannot be ascertained completely. Stated in other terms, the construct of Bayesian Probability and the supporting theory is nothing more than “common sense reduced to numbers”. The major instrument for updating one’s prior beliefs to posterior inferences in light of new information is the computational machinery of the Bayes’ theorem.

Maximum Entropy Prior: The distribution that results from application of the variational maximum entropy algorithm. The latter uniquely determines the least biased epistemic (Bayesian probability distribution) that encodes certain testable information is by maximizing the convex functional that information negative entropy defines. The resulting distributions are least informative or “ objective” ones in the sense that they are most compatible with the “pre-data” constraints, while being maximally noncommittal about the missing information.

Complete Chapter List

Search this Book:
Reset
Editorial Advisory Board
Table of Contents
Foreword
Ralf Herwig
Preface
Andriani Daskalaki
Acknowledgment
Andriani Daskalaki
Chapter 1
Peter Ghazal
An increasing number of biological experiments and more recently clinical based studies are being conducted using large-scale genomic, proteomic and... Sample PDF
Pathway Biology Approach to Medicine
$37.50
Chapter 2
Peter Wellstead, Sree Sreenath, Kwang-Hyun Cho
In this chapter the authors describe systems and control theory concepts for systems biology and the corresponding implications for medicine. The... Sample PDF
Systems and Control Theory for Medical Systems Biology
$37.50
Chapter 3
S. Nikolov
In this chapter we investigate how the inclusion of time delay alters the dynamic properties of (a) delayed protein cross talk model, (b) time delay... Sample PDF
Mathematical Description of Time Delays in Pathways Cross Talk
$37.50
Chapter 4
Elisabeth Maschke-Dutz
In this chapter basic mathematical methods for the deterministic kinetic modeling of biochemical systems are described. Mathematical analysis... Sample PDF
Deterministic Modeling in Medicine
$37.50
Chapter 5
Andrew Kuznetsov
Biologists have used a reductionist approach to investigate the essence of life. In the last years, scientific disciplines have merged with the aim... Sample PDF
Synthetic Biology as a Proof of Systems Biology
$37.50
Chapter 6
Tuan D. Pham
Computational models have been playing a significant role for the computer-based analysis of biological and biomedical data. Given the recent... Sample PDF
Computational Models for the Analysis of Modern Biological Data
$37.50
Chapter 7
Vanathi Gopalakrishnan
This chapter provides a perspective on 3 important collaborative areas in systems biology research. These areas represent biological problems of... Sample PDF
Computer Aided Knowledge Discovery in Biomedicine
$37.50
Chapter 8
Thomas Meinel
The function of proteins is a main subject of research in systems biology. Inference of function is now, more than ever, required by the upcoming of... Sample PDF
Function and Homology of Proteins Similar in Sequence: Phylogenetic Profiling
$37.50
Chapter 9
Nikolaos G. Sgourakis, Pantelis G. Bagos, Stavros J. Hamodrakas
GPCRs comprise a wide and diverse class of eukaryotic transmembrane proteins with well-established pharmacological significance. As a consequence of... Sample PDF
Computational Methods for the Prediction of GPCRs Coupling Selectivity
$37.50
Chapter 10
Pantelis G. Bagos, Stavros J. Hamodrakas
ß-barrel outer membrane proteins constitute the second and less well-studied class of transmembrane proteins. They are present exclusively in the... Sample PDF
Bacterial ß-Barrel Outer Membrane Proteins: A Common Structural Theme Implicated in a Wide Variety of Functional Roles
$37.50
Chapter 11
L.K. Flack
Clustering methods are used to place items in natural patterns or convenient groups. They can be used to place genes into clusters to have similar... Sample PDF
Clustering Methods for Gene-Expression Data
$37.50
Chapter 12
George Sakellaropoulos, Antonis Daskalakis, George Nikiforidis, Christos Argyropoulos
The presentation and interpretation of microarray-based genome-wide gene expression profiles as complex biological entities are considered to be... Sample PDF
Uncovering Fine Structure in Gene Expression Profile by Maximum Entropy Modeling of cDNA Microarray Images and Kernel Density Methods
$37.50
Chapter 13
Wasco Wruck
This chapter describes the application of the BeadArrayTM technology for gene expression profiling. It introduces the BeadArrayTM technology, shows... Sample PDF
Gene Expression Profiling with the BeadArrayTM Platform
$37.50
Chapter 14
Djork-Arné Clevert, Axel Rasche
Readers shall find a quick introduction with recommendations into the preprocessing of Affymetrix GeneChip® microarrays. In the rapidly growing... Sample PDF
The Affymetrix GeneChip® Microarray Platform
$37.50
Chapter 15
Jacek Majewski
Eukaryotic genes have the ability to produce several distinct products from a single genomic locus. Recent developments in microarray technology... Sample PDF
Alternative Isoform Detection Using Exon Arrays
$37.50
Chapter 16
Prerak Desai
The use of systems biology to study complex biological questions is gaining ground due to the ever-increasing amount of genetic tools and genome... Sample PDF
Gene Expression in Microbial Systems for Growth and Metabolism
$37.50
Chapter 17
Heike Stier
Alternative splicing is an important part of the regular process of gene expression. It controls time and tissue dependent expression of specific... Sample PDF
Alternative Splicing and Disease
$37.50
Chapter 18
Axel Kowald
Aging is a complex biological phenomenon that practically affects all multicellular eukaryotes. It is manifested by an ever increasing mortality... Sample PDF
Mathematical Modeling of the Aging Process
$37.50
Chapter 19
Evgenia Makrantonaki
This chapter introduces an in vitro model as a means of studying human hormonal aging. For this purpose, human sebaceous gland cells were maintained... Sample PDF
The Sebaceous Gland: A Model of Hormonal Aging
$37.50
Chapter 20
R. Seigneuric, N.A.W. van Riel, M.H.W. Starmans, A. van Erk
Complex diseases such as cancer have multiple origins and are therefore difficult to understand and cure. Highly parallel technologies such as DNA... Sample PDF
Systems Biology Applied to Cancer Research
$37.50
Chapter 21
Matej Orešic, Antonio Vidal-Puig
In this chapter the authors report on their experience with the analysis and modeling of data obtained from studies of animal models related to... Sample PDF
Systems Biology Strategies in Studies of Energy Homeostasis In Vivo
$37.50
Chapter 22
Axel Rasche
We acquired new computational and experimental prospects to seek insight and cure for millions of afflicted persons with an ancient malady. Type 2... Sample PDF
Approaching Type 2 Diabetes Mellitus by Systems Biology
$37.50
Chapter 23
Alia Benkahla, Lamia Guizani-Tabbane, Ines Abdeljaoued-Tej, Slimane Ben Miled, Koussay Dellagi
This chapter reports a variety of molecular biology informatics and mathematical methods that model the cell response to pathogens. The authors... Sample PDF
Systems Biology and Infectious Diseases
$37.50
Chapter 24
Daniela Albrecht, Reinhard Guthke
This chapter describes a holistic approach to understand the molecular biology and infection process of human-pathogenic fungi. It comprises the... Sample PDF
Systems Biology of Human-Pathogenic Fungi
$37.50
Chapter 25
Jessica Ahmed
Secretases are aspartic proteases, which specifically trim important, medically relevant targets such as the amyloid-precursor protein (APP) or the... Sample PDF
Development of Specific Gamma Secretase Inhibitors
$37.50
Chapter 26
Paul Wrede
Peptides fulfill many tasks in controlling and regulating cellular functions and are key molecules in systems biology. There is a great demand in... Sample PDF
In Machina Systems for the Rational De Novo Peptide Design
$37.50
Chapter 27
Ferda Mavituna, Raul Munoz-Hernandez, Ana Katerine de Carvalho Lima Lobato
This chapter summarizes the fundamentals of metabolic flux balancing as a computational tool of metabolic engineering and systems biology. It also... Sample PDF
Applications of Metabolic Flux Balancing in Medicine
$37.50
Chapter 28
Roberta Alfieri, Luciano Milanesi
This chapter aims to describe data integration and data mining techniques in the context of systems biology studies. It argues that the different... Sample PDF
Multi-Level Data Integration and Data Mining in Systems Biology
$37.50
Chapter 29
Hendrik Hache
In this chapter, different methods and applications for reverse engineering of gene regulatory networks that have been developed in recent years are... Sample PDF
Methods for Reverse Engineering of Gene Regulatory Networks
$37.50
Chapter 30
Alok Mishra
This chapter introduces the techniques that have been used to identify the genetic regulatory modules by integrating data from various sources. Data... Sample PDF
Data Integration for Regulatory Gene Module Discovery
$37.50
Chapter 31
Elizabeth Santiago-Cortés
Biological systems are composed of multiple interacting elements; in particular, genetic regulatory networks are formed by genes and their... Sample PDF
Discrete Networks as a Suitable Approach for the Analysis of Genetic Regulation
$37.50
Chapter 32
A. Maffezzoli
In this chapter, authors review main methods, approaches, and models for the analysis of neuronal network data. In particular, the analysis concerns... Sample PDF
Investigating the Collective Behavior of Neural Networks: A Review of Signal Processing Approaches
$37.50
Chapter 33
Paolo Vicini
This chapter describes the System for Population Kinetics (SPK), a novel Web service for performing population kinetic analysis. Population kinetic... Sample PDF
The System for Population Kinetics: Open Source Software for Population Analysis
$37.50
Chapter 34
Julia Adolphs
This chapter introduces the theory of optical spectra and excitation energy transfer of light harvesting complexes in photosynthesis. The light... Sample PDF
Photosynthesis: How Proteins Control Excitation Energy Transfer
$37.50
Chapter 35
Michael R. Hamblin
Photodynamic therapy (PDT) is a rapidly advancing treatment for multiple diseases. PDT involves the administration of a nontoxic drug or dye known... Sample PDF
Photodynamic Therapy: A Systems Biology Approach
$37.50
Chapter 36
Andriani Daskalaki
Photodynamic Therapy (PDT) involves administration of a photosensitizer (PS) either systemically or locally, followed by illumination of the lesion... Sample PDF
Modeling of Porphyrin Metabolism with PyBioS
$37.50
Chapter 37
Alexey R. Brazhe, Nadezda A. Brazhe, Alexey N. Pavlov, Georgy V. Maksimov
This chapter describes the application of interference microscopy and double-wavelet analysis to noninvasive study of cell structure and function.... Sample PDF
Interference Microscopy for Cellular Studies
$37.50
Chapter 38
Cathrin Dressler, Olaf Minet, Urszula Zabarylo, Jürgen Beuthan
This chapter deals with the mitochondrias’ stress response to heat, which is the central agent of thermotherapy. Thermotherapies function by... Sample PDF
Fluorescence Imaging of Mitochondrial Long-Term Depolarization in Cancer Cells Exposed to Heat-Stress
$37.50
Chapter 39
Athina Theodosiou, Charalampos Moschopoulos, Marc Baumann, Sophia Kossida
In previous years, scientists have begun understanding the significance of proteins and protein interactions. The direct connection of those with... Sample PDF
Protein Interactions and Diseases
$37.50
Chapter 40
Bernard de Bono
From a genetic perspective, disease can be interpreted in terms of a variation in molecular sequence or expression (dose) that impairs normal... Sample PDF
The Breadth and Depth of BioMedical Molecular Networks: The Reactome Perspective
$37.50
Chapter 41
Jorge Numata
Thermodynamics is one of the best established notions in science. Some recent work in biomolecular modeling has sacrificed its rigor in favor of... Sample PDF
Entropy and Thermodynamics in Biomolecular Simulation
$37.50
Chapter 42
Isabel Reinecke, Peter Deuflhard
In this chapter some model development concepts can be used for the mathematical modeling in physiology as well as a graph theoretical approach for... Sample PDF
Model Development and Decomposition in Physiology
$37.50
Chapter 43
Mohamed Derouich
Throughout the world, seasonal outbreaks of influenza affect millions of people, killing about 500,000 individuals every year. Human influenza... Sample PDF
A Pandemic Avian Influenza Mathematical Model
$37.50
Chapter 44
Mohamed Derouich
Dengue fever is a re-emergent disease affecting more than 100 countries. Its incidence rate has increased fourfold since 1970 with nearly half the... Sample PDF
Dengue Fever: A Mathematical Model with Immunization Program
$37.50
Chapter 45
Ross Foley
The field of histopathology has encountered a key transition point with the progressive move towards use of digital slides and automated image... Sample PDF
Automated Image Analysis Approaches in Histopathology
$37.50
About the Contributors