In silico detection of control signals: mRNA 3′-end-processing sequences in diverse species
- 23 November 1999
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 96 (24) , 14055-14060
- https://doi.org/10.1073/pnas.96.24.14055
Abstract
We have investigated mRNA 3′-end-processing signals in each of six eukaryotic species (yeast, rice, arabidopsis, fruitfly, mouse, and human) through the analysis of more than 20,000 3′-expressed sequence tags. The use and conservation of the canonical AAUAAA element vary widely among the six species and are especially weak in plants and yeast. Even in the animal species, the AAUAAA signal does not appear to be as universal as indicated by previous studies. The abundance of single-base variants of AAUAAA correlates with their measured processing efficiencies. As found previously, the plant polyadenylation signals are more similar to those of yeast than to those of animals, with both common content and arrangement of the signal elements. In all species examined, the complete polyadenylation signal appears to consist of an aggregate of multiple elements. In light of these and previous results, we present a broadened concept of 3′-end-processing signals in which no single exact sequence element is universally required for processing. Rather, the total efficiency is a function of all elements and, importantly, an inefficient word in one element can be compensated for by strong words in other elements. These complex patterns indicate that effective tools to identify 3′-end-processing signals will require more than consensus sequence identification.Keywords
This publication has 31 references indexed in Scilit:
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- mRNA polyadenylation and its coupling to other RNA processing reactions and to transcriptionCurrent Opinion in Cell Biology, 1999
- Visualizing the competitive recognition of TATA-boxes in vertebrate promotersTrends in Genetics, 1998
- Information Content of Individual Genetic SequencesJournal of Theoretical Biology, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Cleavage Factor II of Saccharomyces cerevisiaeContains Homologues to Subunits of the Mammalian Cleavage/ Polyadenylation Specificity Factor and Exhibits Sequence-specific, ATP-dependent Interaction with Precursor RNAJournal of Biological Chemistry, 1997
- An Efficient Statistic to Detect Over- and Under-represented Words in DNA SequencesJournal of Computational Biology, 1997
- Interfering contexts of regulatory sequence elementsBioinformatics, 1996
- Sequence logos: a new way to display consensus sequencesNucleic Acids Research, 1990
- Selection of DNA binding sites by regulatory proteinsJournal of Molecular Biology, 1987