Methods for Reverse Engineering of Gene Regulatory Networks

Methods for Reverse Engineering of Gene Regulatory Networks

Hendrik Hache (Max Planck Institute for Molecular Genetics, Germany)
Copyright: © 2009 |Pages: 19
DOI: 10.4018/978-1-60566-076-9.ch029
OnDemand PDF Download:


In this chapter, different methods and applications for reverse engineering of gene regulatory networks that have been developed in recent years are discussed and compared. Inferring gene networks from different kinds of experimental data are a challenging task that emerged, especially with the development of high throughput technologies. Various computational methods based on diverse principles were introduced to identify new regulations among genes. Mathematical aspects of the models are highlighted, and applications for reverse engineering are mentioned.
Chapter Preview


Deciphering the structure of gene regulatory networks by means of computational methods is a challenging task that emerged during last decades. Large scale experiments, not only gene expression measurements from microarrays but also promoter sequence search for transcription factor binding sites and investigations of protein-DNA interactions have spawned various computational algorithms to infer the structure of underlying gene regulatory networks. Identifying gene interactions yields to an understanding of the topology of gene regulatory networks and the functional role of any gene in a particular pathway. Of interest are genes with a certain impact on pathways. These can be therefore putative drug targets. Once a network is obtained, in silico experiments can be performed to test hypotheses and generate predictions on disease states or on the behavior of the system under different conditions (Wierling et al., 2007).

Quantitative gene expression measurements using microarrays were first performed by Schena et al. (1995) for 45 Arabidopsis thaliana genes and shortly after, for thousands of genes or even a whole genome (DeRisi et al., 1996; DeRisi et al. 1997). Since that time various methods for the analysis of such large-scale data have been developed. First of all, clustering algorithms were used to partition genes into subsets of co-regulated genes regarding to their expression profiles (Eisen et al., 1998). It is found that genes belonging to the same cluster have similar biological functions. But this does not imply information about any directed regulatory interactions among these genes. For this purpose, more sophisticated methods were employed to reverse engineer gene networks and regulation causality from such data. Therefore, reverse engineering constitutes an intermediate step from correlative to causative data analysis.

Gardner and Faith (2005) classified reconstruction algorithms into two general strategies: “physical” approaches and “influence” approaches. Algorithms of the first group seek to identify interactions between transcription factors and DNA and reveal protein factors that physically control RNA synthesis. These methods, such as promoter binding analysis, as performed by Lee et al. (2002), uses the genomic sequence information directly. The second strategy, the “influence” approach, aims to identify causal relationships between RNA transcripts by the examination of expression profiles. The regulation of the transcription machinery can be effected on multiple levels. For instance, it can be regulated on the DNA, transcriptional, or translational level. The regulation on the DNA or transcriptional level is mainly due to binding of transcription factors on specific parts of DNA or by chemical or structural modifications of DNA. A regulation on the translational level might be due to microRNAs resulting in the decay of the respective mRNA target (Ruvkun, 2001). Since the quality of current measured protein concentration is not sufficiently high enough for reconstruction purposes, it is assumed in mathematical models that changes in expression as measured by mRNAs concentrations can explain changes of other gene transcripts. However, a recent study by Newman et al. (2006) showed that many changes in the measured protein levels at single-cell resolution are not observable by DNA microarray experiments. In computational models many regulation effects are neglected or included as hidden factors. In recent years, a combination of both approaches were employed by integrating multiple data sources for the construction of priors of networks or parameters (Imoto et al., 2003; Bernard and Hartemink, 2005; Werhli and Husmeier, 2007).

In this chapter different reverse engineering algorithms following an “influence approach” that have been proposed in the last decades are discussed. Moreover, relevant mathematical aspects are briefly described and applications to reveal gene regulatory networks or part of them are highlighted. Another crucial point that is discussed in this chapter is the validation of algorithms. The reverse engineering methods have to cope with noisy, high dimensional, and incomplete data. But the quality and amount of useful measurements for reverse engineering is increasing.

Key Terms in this Chapter

Neural Network: This refers to a graphical structure with artificial neurons as nodes. The node value of each node is determined by the input signals of the connected nodes passing through a nonlinear transfer function.

Posterior Probability Distribution: It is the conditional probability distribution of a random variable after observing an other variable. It is computed from the prior and the likelihood function.

Prior Probability Distribution: It is the probability distribution of a random variable before any data has been observed. It expresses information about a variable obtain beforehand. Often it is called prior.

Markov Assumption: The conditional probability distribution of the current state is independent of all non-parents. It means for a dynamical system that given the present state, all following states are independent of all past states.

Associated Network: Each arc of this network is associated with an similarity measure between values of the nodes. This measure can be Pearson correlation, mutual information, or others. The node values are vectors of real numbers.

Bayesian Network: This refers to a probabilistic graphical network model defined by a set of random variables and a set of conditional probability distributions. These can be multinomial for discrete variables, Gaussian, for continuous variables, or others.

MCMC: Short for Markov Chain Monte Carlo. It is a class of algorithms for sampling from a probability distribution. This distribution is simulated with a Markov Chain whose equilibrium distribution is the desired probability distribution.

Boolean network: This refers to a graphical structure with nodes that can have two discrete states. A state of a node is determined by the state of other connected nodes. The state of the network is determined by the state of each node.

Mutual Information: It is the information of one random variable in another one. In other words, it is the reduction of uncertainty about one variable after observing the other. It is symmetric in respect to the variables.

Likelihood Function: It is the probability of the occurrence of a sample configuration. The conditional probability distribution of a random variable given the parameters of the distribution has to be known. L ( q | X = x ) = P ( X = x | q ) is a likelihood function, where X is a random variable, x is the observed value of X , and q is a parameter.

Ordinary Differential Equation: It is an equation that contains exactly one real variable and its derivatives.

Reverse Engineering: In general, it is the reconstruction of a system by analyzing of its structure, functions, and operations. Reverse engineering of gene regulatory networks is the process of revealing the underlying structure of gene regulation from biological measurements, such as gene and protein expression, or others.

Complete Chapter List

Search this Book:
Editorial Advisory Board
Table of Contents
Ralf Herwig
Andriani Daskalaki
Andriani Daskalaki
Chapter 1
Peter Ghazal
An increasing number of biological experiments and more recently clinical based studies are being conducted using large-scale genomic, proteomic and... Sample PDF
Pathway Biology Approach to Medicine
Chapter 2
Peter Wellstead, Sree Sreenath, Kwang-Hyun Cho
In this chapter the authors describe systems and control theory concepts for systems biology and the corresponding implications for medicine. The... Sample PDF
Systems and Control Theory for Medical Systems Biology
Chapter 3
S. Nikolov
In this chapter we investigate how the inclusion of time delay alters the dynamic properties of (a) delayed protein cross talk model, (b) time delay... Sample PDF
Mathematical Description of Time Delays in Pathways Cross Talk
Chapter 4
Elisabeth Maschke-Dutz
In this chapter basic mathematical methods for the deterministic kinetic modeling of biochemical systems are described. Mathematical analysis... Sample PDF
Deterministic Modeling in Medicine
Chapter 5
Andrew Kuznetsov
Biologists have used a reductionist approach to investigate the essence of life. In the last years, scientific disciplines have merged with the aim... Sample PDF
Synthetic Biology as a Proof of Systems Biology
Chapter 6
Tuan D. Pham
Computational models have been playing a significant role for the computer-based analysis of biological and biomedical data. Given the recent... Sample PDF
Computational Models for the Analysis of Modern Biological Data
Chapter 7
Vanathi Gopalakrishnan
This chapter provides a perspective on 3 important collaborative areas in systems biology research. These areas represent biological problems of... Sample PDF
Computer Aided Knowledge Discovery in Biomedicine
Chapter 8
Thomas Meinel
The function of proteins is a main subject of research in systems biology. Inference of function is now, more than ever, required by the upcoming of... Sample PDF
Function and Homology of Proteins Similar in Sequence: Phylogenetic Profiling
Chapter 9
Nikolaos G. Sgourakis, Pantelis G. Bagos, Stavros J. Hamodrakas
GPCRs comprise a wide and diverse class of eukaryotic transmembrane proteins with well-established pharmacological significance. As a consequence of... Sample PDF
Computational Methods for the Prediction of GPCRs Coupling Selectivity
Chapter 10
Pantelis G. Bagos, Stavros J. Hamodrakas
ß-barrel outer membrane proteins constitute the second and less well-studied class of transmembrane proteins. They are present exclusively in the... Sample PDF
Bacterial ß-Barrel Outer Membrane Proteins: A Common Structural Theme Implicated in a Wide Variety of Functional Roles
Chapter 11
L.K. Flack
Clustering methods are used to place items in natural patterns or convenient groups. They can be used to place genes into clusters to have similar... Sample PDF
Clustering Methods for Gene-Expression Data
Chapter 12
George Sakellaropoulos, Antonis Daskalakis, George Nikiforidis, Christos Argyropoulos
The presentation and interpretation of microarray-based genome-wide gene expression profiles as complex biological entities are considered to be... Sample PDF
Uncovering Fine Structure in Gene Expression Profile by Maximum Entropy Modeling of cDNA Microarray Images and Kernel Density Methods
Chapter 13
Wasco Wruck
This chapter describes the application of the BeadArrayTM technology for gene expression profiling. It introduces the BeadArrayTM technology, shows... Sample PDF
Gene Expression Profiling with the BeadArrayTM Platform
Chapter 14
Djork-Arné Clevert, Axel Rasche
Readers shall find a quick introduction with recommendations into the preprocessing of Affymetrix GeneChip® microarrays. In the rapidly growing... Sample PDF
The Affymetrix GeneChip® Microarray Platform
Chapter 15
Jacek Majewski
Eukaryotic genes have the ability to produce several distinct products from a single genomic locus. Recent developments in microarray technology... Sample PDF
Alternative Isoform Detection Using Exon Arrays
Chapter 16
Prerak Desai
The use of systems biology to study complex biological questions is gaining ground due to the ever-increasing amount of genetic tools and genome... Sample PDF
Gene Expression in Microbial Systems for Growth and Metabolism
Chapter 17
Heike Stier
Alternative splicing is an important part of the regular process of gene expression. It controls time and tissue dependent expression of specific... Sample PDF
Alternative Splicing and Disease
Chapter 18
Axel Kowald
Aging is a complex biological phenomenon that practically affects all multicellular eukaryotes. It is manifested by an ever increasing mortality... Sample PDF
Mathematical Modeling of the Aging Process
Chapter 19
Evgenia Makrantonaki
This chapter introduces an in vitro model as a means of studying human hormonal aging. For this purpose, human sebaceous gland cells were maintained... Sample PDF
The Sebaceous Gland: A Model of Hormonal Aging
Chapter 20
R. Seigneuric, N.A.W. van Riel, M.H.W. Starmans, A. van Erk
Complex diseases such as cancer have multiple origins and are therefore difficult to understand and cure. Highly parallel technologies such as DNA... Sample PDF
Systems Biology Applied to Cancer Research
Chapter 21
Matej Orešic, Antonio Vidal-Puig
In this chapter the authors report on their experience with the analysis and modeling of data obtained from studies of animal models related to... Sample PDF
Systems Biology Strategies in Studies of Energy Homeostasis In Vivo
Chapter 22
Axel Rasche
We acquired new computational and experimental prospects to seek insight and cure for millions of afflicted persons with an ancient malady. Type 2... Sample PDF
Approaching Type 2 Diabetes Mellitus by Systems Biology
Chapter 23
Alia Benkahla, Lamia Guizani-Tabbane, Ines Abdeljaoued-Tej, Slimane Ben Miled, Koussay Dellagi
This chapter reports a variety of molecular biology informatics and mathematical methods that model the cell response to pathogens. The authors... Sample PDF
Systems Biology and Infectious Diseases
Chapter 24
Daniela Albrecht, Reinhard Guthke
This chapter describes a holistic approach to understand the molecular biology and infection process of human-pathogenic fungi. It comprises the... Sample PDF
Systems Biology of Human-Pathogenic Fungi
Chapter 25
Jessica Ahmed
Secretases are aspartic proteases, which specifically trim important, medically relevant targets such as the amyloid-precursor protein (APP) or the... Sample PDF
Development of Specific Gamma Secretase Inhibitors
Chapter 26
Paul Wrede
Peptides fulfill many tasks in controlling and regulating cellular functions and are key molecules in systems biology. There is a great demand in... Sample PDF
In Machina Systems for the Rational De Novo Peptide Design
Chapter 27
Ferda Mavituna, Raul Munoz-Hernandez, Ana Katerine de Carvalho Lima Lobato
This chapter summarizes the fundamentals of metabolic flux balancing as a computational tool of metabolic engineering and systems biology. It also... Sample PDF
Applications of Metabolic Flux Balancing in Medicine
Chapter 28
Roberta Alfieri, Luciano Milanesi
This chapter aims to describe data integration and data mining techniques in the context of systems biology studies. It argues that the different... Sample PDF
Multi-Level Data Integration and Data Mining in Systems Biology
Chapter 29
Hendrik Hache
In this chapter, different methods and applications for reverse engineering of gene regulatory networks that have been developed in recent years are... Sample PDF
Methods for Reverse Engineering of Gene Regulatory Networks
Chapter 30
Alok Mishra
This chapter introduces the techniques that have been used to identify the genetic regulatory modules by integrating data from various sources. Data... Sample PDF
Data Integration for Regulatory Gene Module Discovery
Chapter 31
Elizabeth Santiago-Cortés
Biological systems are composed of multiple interacting elements; in particular, genetic regulatory networks are formed by genes and their... Sample PDF
Discrete Networks as a Suitable Approach for the Analysis of Genetic Regulation
Chapter 32
A. Maffezzoli
In this chapter, authors review main methods, approaches, and models for the analysis of neuronal network data. In particular, the analysis concerns... Sample PDF
Investigating the Collective Behavior of Neural Networks: A Review of Signal Processing Approaches
Chapter 33
Paolo Vicini
This chapter describes the System for Population Kinetics (SPK), a novel Web service for performing population kinetic analysis. Population kinetic... Sample PDF
The System for Population Kinetics: Open Source Software for Population Analysis
Chapter 34
Julia Adolphs
This chapter introduces the theory of optical spectra and excitation energy transfer of light harvesting complexes in photosynthesis. The light... Sample PDF
Photosynthesis: How Proteins Control Excitation Energy Transfer
Chapter 35
Michael R. Hamblin
Photodynamic therapy (PDT) is a rapidly advancing treatment for multiple diseases. PDT involves the administration of a nontoxic drug or dye known... Sample PDF
Photodynamic Therapy: A Systems Biology Approach
Chapter 36
Andriani Daskalaki
Photodynamic Therapy (PDT) involves administration of a photosensitizer (PS) either systemically or locally, followed by illumination of the lesion... Sample PDF
Modeling of Porphyrin Metabolism with PyBioS
Chapter 37
Alexey R. Brazhe, Nadezda A. Brazhe, Alexey N. Pavlov, Georgy V. Maksimov
This chapter describes the application of interference microscopy and double-wavelet analysis to noninvasive study of cell structure and function.... Sample PDF
Interference Microscopy for Cellular Studies
Chapter 38
Cathrin Dressler, Olaf Minet, Urszula Zabarylo, Jürgen Beuthan
This chapter deals with the mitochondrias’ stress response to heat, which is the central agent of thermotherapy. Thermotherapies function by... Sample PDF
Fluorescence Imaging of Mitochondrial Long-Term Depolarization in Cancer Cells Exposed to Heat-Stress
Chapter 39
Athina Theodosiou, Charalampos Moschopoulos, Marc Baumann, Sophia Kossida
In previous years, scientists have begun understanding the significance of proteins and protein interactions. The direct connection of those with... Sample PDF
Protein Interactions and Diseases
Chapter 40
Bernard de Bono
From a genetic perspective, disease can be interpreted in terms of a variation in molecular sequence or expression (dose) that impairs normal... Sample PDF
The Breadth and Depth of BioMedical Molecular Networks: The Reactome Perspective
Chapter 41
Jorge Numata
Thermodynamics is one of the best established notions in science. Some recent work in biomolecular modeling has sacrificed its rigor in favor of... Sample PDF
Entropy and Thermodynamics in Biomolecular Simulation
Chapter 42
Isabel Reinecke, Peter Deuflhard
In this chapter some model development concepts can be used for the mathematical modeling in physiology as well as a graph theoretical approach for... Sample PDF
Model Development and Decomposition in Physiology
Chapter 43
Mohamed Derouich
Throughout the world, seasonal outbreaks of influenza affect millions of people, killing about 500,000 individuals every year. Human influenza... Sample PDF
A Pandemic Avian Influenza Mathematical Model
Chapter 44
Mohamed Derouich
Dengue fever is a re-emergent disease affecting more than 100 countries. Its incidence rate has increased fourfold since 1970 with nearly half the... Sample PDF
Dengue Fever: A Mathematical Model with Immunization Program
Chapter 45
Ross Foley
The field of histopathology has encountered a key transition point with the progressive move towards use of digital slides and automated image... Sample PDF
Automated Image Analysis Approaches in Histopathology
About the Contributors