Abstract
The statistical correlation of nucleotides in a DNA sequence is described by a set of redundanciesD 1,D 2,D 3,... By calculation of {D n} of 2341 coding regions of nucleic acid sequences it is demonstrated that about 2/3 of sequences has correlation length ≤2, 10% of sequences—correlation with 3-periodicity and others—long range aperiodic correlations. The implications of the results from the interactions of random mutation and natural selection are discussed briefly.
Similar content being viewed by others
Literature
Gatlin, L. L. 1966. The information content of DNA.J. theor. Biol. 10, 281–300.
Gatlin, L. L. 1972.Information Theory and the Living System. New York: Columbia University Press.
Granero-Porati, M. I., A. Porati and L. Zani. 1980. Informational parameters of an exact DNA sequence.J. theor. Biol. 86, 401–403.
Lee, W. J. 1989. The distributions of informational redundancies of random sequences.Acta Sci. nat. Univ. Intramong. 20, 490–494.
Luo, L. F., L. Tsai and Y. M. Zhou. 1980a. Informational parameters of nucleic acid and molecular evolution.J. theor. Biol. 130, 351–361.
Luo, L. F., L. Tsai and Y. M. Zhou. 1988b. The statistical distribution of nucleic acid sequences.J. Biomath. 3, 10–17.
Luo, L. F. and Tsai, L. 1988. The statistical correlation of nucleotides and the entropy approximation.Acta Sci. nat. Univ. Intramong. 19, 326–332.
Nucelotide Sequence Database. 1987. London: IRL Press.
Author information
Authors and Affiliations
Additional information
Project supported by National Science Foundation of China.
Rights and permissions
About this article
Cite this article
Luo, L., Li, H. The statistical correlation of nucleotides in protein-coding DNA sequences. Bltn Mathcal Biology 53, 345–353 (1991). https://doi.org/10.1007/BF02460722
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02460722