Dichotomous splicing signals in exon flanks
Open Access
- 1 June 2005
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 15 (6) , 768-779
- https://doi.org/10.1101/gr.3217705
Abstract
Intronic elements flanking the splice-site consensus sequences are thought to play a role in pre-mRNA splicing. However, the generality of this role, the catalog of effective sequences, and the mechanisms involved are still lacking. Using molecular genetic tests, we first showed that the ∼50-nt intronic flanking sequences of exons beyond the splice-site consensus are generally important for splicing. We then went on to characterize exon flank sequences on a genomic scale. The G+C content of flanks displayed a bimodal distribution reflecting an exaggeration of this base composition in flanks relative to the gene as a whole. We divided all exons into two classes according to their flank G+C content and used computational and statistical methods to define pentamers of high relative abundance and phylogenetic conservation in exon flanks. Upstream pentamers were often common to the two classes, whereas downstream pentamers were totally different. Upstream and downstream pentamers were often identical around low G+C exons, and in contrast, were often complementary around high G+C exons. In agreement with this complementarity, predicted base pairing was more frequent between the flanks of high G+C exons. Pseudo exons did not exhibit this behavior, but rather tended to form base pairs between flanks and exon bodies. We conclude that most exons require signals in their immediate flanks for efficient splicing. G+C content is a sequence feature correlated with many genetic and genomic attributes. We speculate that there may be different mechanisms for splice site recognition depending on G+C content.Keywords
This publication has 79 references indexed in Scilit:
- hnRNP A1 and the SR Proteins ASF/SF2 and SC35 Have Antagonistic Functions in Splicing of β-Tropomyosin Exon 6BJournal of Biological Chemistry, 2004
- Sequence Information for the Splicing of Human Pre-mRNA Identified by Support Vector Machine ClassificationGenome Research, 2003
- Determination of the RNA Binding Specificity of the Heterogeneous Nuclear Ribonucleoprotein (hnRNP) H/H′/F/2H9 FamilyJournal of Biological Chemistry, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Processing of Endogenous Pre-mRNAs in Association with SC-35 Domains Is Gene SpecificThe Journal of cell biology, 1999
- What Drives Codon Choices in Human Genes?Journal of Molecular Biology, 1996
- Features of spliceosome evolution and function inferred from an analysis of the information at human splice sitesJournal of Molecular Biology, 1992
- G + C-rich tract in 5′ end of human intronsJournal of Molecular Biology, 1992
- Human pre-mRNA splicing signalsJournal of Theoretical Biology, 1991
- Effects of RNA secondary structure on alternative splicing of Pre-mRNA: Is folding limited to a region behind the transcribing RNA polymerase?Cell, 1988