Scaling Behavior of Maximal Repeat Distributions in Genomic Sequences

Scaling Behavior of Maximal Repeat Distributions in Genomic Sequences

J.D. Wang (Asia University, Taiwan), Hsiang-Chuan Liu (University of Illinois, USA), Jeffrey J.P. Tsai (Asia University, Taiwan) and Ka-Lok Ng (Asia University, Taiwan)
DOI: 10.4018/jcini.2008070103
OnDemand PDF Download:
No Current Special Offers


The genome sequences data from various organisms were analyzed, and it is found that the relative frequency distributions of maximal repeat sequences P(k) verses the frequency of appearance k exhibits scaling behavior (P(k) ~ k-?). Correlation analysis provides very good evidence (with a coefficient of determination r2 > 0.875 for every case studied case, and the scaling relation is valid over three orders of magnitude of k) supporting that the distributions are well described by the power-law. It is found that the scaling behavior holds at the chromosome level, for different organelles (nucleus, chloroplast and mitochondria) and for a very wide range of taxa, such as Fungi, Algea, Protozoa, Archaea, bacteria, Plants, Nematode. This result is quite surprise as it suggests that (1) the scaling behavior seems to be universal and probably independent of the organisms, and (2) genomic sequences have features resembles natural languages.

Complete Article List

Search this Journal:
Volume 16: 1 Issue (2022)
Volume 15: 4 Issues (2021)
Volume 14: 4 Issues (2020)
Volume 13: 4 Issues (2019)
Volume 12: 4 Issues (2018)
Volume 11: 4 Issues (2017)
Volume 10: 4 Issues (2016)
Volume 9: 4 Issues (2015)
Volume 8: 4 Issues (2014)
Volume 7: 4 Issues (2013)
Volume 6: 4 Issues (2012)
Volume 5: 4 Issues (2011)
Volume 4: 4 Issues (2010)
Volume 3: 4 Issues (2009)
Volume 2: 4 Issues (2008)
Volume 1: 4 Issues (2007)
View Complete Journal Contents Listing