Statistical analysis and prediction of the exonic structure of human genes
- 1 September 1992
- journal article
- Published by Springer Nature in Journal of Molecular Evolution
- Vol. 35 (3) , 239-252
- https://doi.org/10.1007/bf00178600
Abstract
Nonhomologous fully sequenced human protein-coding genes were studied. Three sets of exon-exon junctions were formed defined by the intron (shadow) position relative to the reading frame. For the analysis of intron shadow signals in exons, information content and discrimination energy approaches were used with the correction allowing one to ignore the influence of a protein-coding message. The corrected formulas allow one to define the consensuses for the three types of intron shadow signals as aAG/guwn, cAG/GUnn, and cAG/gunU, and provide better recognition than the original formulas. The analysis of the codon usage in the signal positions leads to the conclusion that the prevalence of some amino acids in corresponding protein sites is caused by the signal requirements and not vice versa. The distribution of potential intron shadow signals in exons contradicts the hypothesis of intron insertion into suitable preexisting sites. There exists a correlation between the intron types and/or the exon length modulo 3.Keywords
This publication has 42 references indexed in Scilit:
- Evolution of collagen IV genes from a 54-base pair exon: A role for introns in gene evolutionJournal of Molecular Evolution, 1990
- A general model for the evolution of nuclear pre-mRNA intronsJournal of Theoretical Biology, 1989
- Intron Existence Predated the Divergence of Eukaryotes and ProkaryotesScience, 1988
- Structure of vertebrate genes: A statistical analysis implicating selectionJournal of Molecular Evolution, 1988
- Structural organization of the 5′ region of the thyroglobulin geneJournal of Molecular Biology, 1987
- Intron‐dependent evolution: Preferred types of exons and intronsFEBS Letters, 1987
- Selection of DNA binding sites by regulatory proteinsJournal of Molecular Biology, 1987
- Introns as relict retrotransposons: Implications for the evolutionary origin of eukaryotic mRNA splicing mechanismsJournal of Theoretical Biology, 1986
- Information content of binding sites on nucleotide sequencesJournal of Molecular Biology, 1986
- Evolution of the proteases of blood coagulation and fibrinolysis by assembly from modulesCell, 1985