Intron distribution difference for 276 ancient and 131 modern genes suggests the existence of ancient introns
Open Access
- 30 October 2001
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 98 (23) , 13177-13182
- https://doi.org/10.1073/pnas.231491498
Abstract
Do introns delineate elements of protein tertiary structure? This issue is crucial to the debate about the role and origin of introns. We present an analysis of the full set of proteins with known three-dimensional structures that have homologs with intron positions recorded in GenBank. A computer program was generated that maps on a reference sequence the positions of all introns in homologous genes. We have applied this program to a set of 665 nonredundant protein sequences with defined three-dimensional structures in the Protein Data Bank (PDB), which yielded 8,217 introns in 407 proteins. For the subset of proteins corresponding to ancient conserved regions (ACR), we find that there is a correlation of phase-zero introns with the boundary regions of modules and no correlation for the phase-one and phase-two positions. However, for a subset of proteins without prokaryotic counterparts (131 non-ACR proteins), a set of presumably modern proteins (or proteins that have diverged extremely far from any ancestral form), we do not find any correlation of phase-zero intron positions with three-dimensional structure. Furthermore, we find an anticorrelation of phase-one intron positions with module boundaries: they actually have a preference for the interior of modules. This finding is explicable as a preference for phase-one introns to lie in glycines, between G|G sequences, the preference for glycines being anticorrelated with the three-dimensional modules. We interpret this anticorrelation as a sign that a number of phase-one introns, and hence many modern introns, have been inserted into G|G “protosplice” sequences.Keywords
This publication has 22 references indexed in Scilit:
- Toward a resolution of the introns early/late debate: Only phase zero introns are correlated with the structure of ancient proteinsProceedings of the National Academy of Sciences, 1998
- Intron positions correlate with module boundaries in ancient proteinsProceedings of the National Academy of Sciences, 1996
- A novel intron site in the triosephosphate isomerase gene from the mosquito Culex tarsalisNature, 1993
- The Exon Theory of GenesCold Spring Harbor Symposia on Quantitative Biology, 1987
- The triosephosphate isomerase gene from maize introns antedate the plant-animal divergenceCell, 1986
- Intron-dependent evolution of the nucleotide-binding domains within alcohol dehydrogenase and related enzymesNucleic Acids Research, 1986
- Structure of the human phosphoglycerate kinase gene and the intron-mediated evolution and dispersal of the nucleotide-binding domain.Proceedings of the National Academy of Sciences, 1985
- Intron-dependent evolution of chicken glyceraldehyde phosphate dehydrogenase geneNature, 1985
- Correlation of DNA exonic regions with protein structural units in haemoglobinNature, 1981
- Genes in pieces: were they ever together?Nature, 1978