Patterns of Variant Polyadenylation Signal Usage in Human Genes
Top Cited Papers
Open Access
- 1 July 2000
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 10 (7) , 1001-1010
- https://doi.org/10.1101/gr.10.7.1001
Abstract
The formation of mature mRNAs in vertebrates involves the cleavage and polyadenylation of the pre-mRNA, 10–30 nt downstream of an AAUAAA or AUUAAA signal sequence. The extensive cDNA data now available shows that these hexamers are not strictly conserved. In order to identify variant polyadenylation signals on a large scale, we compared over 8700 human 3′ untranslated sequences to 157,775 polyadenylated expressed sequence tags (ESTs), used as markers of actual mRNA 3′ ends. About 5600 EST-supported putative mRNA 3′ ends were collected and analyzed for significant hexameric sequences. Known polyadenylation signals were found in only 73% of the 3′ fragments. Ten single-base variants of the AAUAAA sequence were identified with a highly significant occurrence rate, potentially representing 14.9% of the actual polyadenylation signals. Of the mRNAs, 28.6% displayed two or more polyadenylation sites. In these mRNAs, the poly(A) sites proximal to the coding sequence tend to use variant signals more often, while the 3′-most site tends to use a canonical signal. The average number of ESTs associated with each signal type suggests that variant signals (including the common AUUAAA) are processed less efficiently than the canonical signal and could therefore be selected for regulatory purposes. However, the position of the site in the untranslated region may also play a role in polyadenylation rate.Keywords
This publication has 67 references indexed in Scilit:
- Genomic organization of four β-1,4-endoglucanase genes in plant-parasitic cyst nematodes and its evolutionary implicationsGene, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Multiple transcripts of the murine immunoglobulin ϵ membrane locus are generated by alternative splicing and differential usage of two polyadenylation sitesMolecular Immunology, 1997
- Clinical Expression of a Rare β-Globin Gene Mutation Co-Inherited with Haemoglobin E-Diseasecclm, 1996
- The Drosophila melanogaster homolog of the mammalian MAPK-activated protein kinase-2 (MAPKAPK-2) lacks a proline-rich N terminusGene, 1995
- 3′‐end processing of the maize 27 kDa zein mRNAThe Plant Journal, 1993
- dbEST — database for “expressed sequence tags”Nature Genetics, 1993
- Silent carrier β-thalassaemia due to a severe β-globin mutation interacting with other genetic elementsEuropean Journal of Pediatrics, 1993
- Structure and expression of the human θl globin geneNature, 1988
- Sequence determination of the 3? end of mouse mammary tumor virus RNAMolecular Biology Reports, 1981