Diversity of preferred nucleotide sequences around the translation initiation codon in eukaryote genomes
Open Access
- 21 November 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (3) , 861-871
- https://doi.org/10.1093/nar/gkm1102
Abstract
Understanding regulatory mechanisms of protein synthesis in eukaryotes is essential for the accurate annotation of genome sequences. Kozak reported that the nucleotide sequence GCCGCC(A/G)CCAUGG (AUG is the initiation codon) was frequently observed in vertebrate genes and that this ‘consensus’ sequence enhanced translation initiation. However, later studies using invertebrate, fungal and plant genes reported different ‘consensus’ sequences. In this study, we conducted extensive comparative analyses of nucleotide sequences around the initiation codon by using genomic data from 47 eukaryote species including animals, fungi, plants and protists. The analyses revealed that preferred nucleotide sequences are quite diverse among different species, but differences between patterns of nucleotide bias roughly reflect the evolutionary relationships of the species. We also found strong biases of A/G at position −3, A/C at position −2 and C at position +5 that were commonly observed in all species examined. Genes with higher expression levels showed stronger signals, suggesting that these nucleotides are responsible for the regulation of translation initiation. The diversity of preferred nucleotide sequences around the initiation codon might be explained by differences in relative contributions from two distinct patterns, GCCGCCAUG and AAAAAAAUG, which implies the presence of multiple molecular mechanisms for controlling translation initiation.Keywords
This publication has 41 references indexed in Scilit:
- Sequencing and analysis of 10,967 full-length cDNA clones fromXenopus laevisandXenopus tropicalisreveals post-tetraploidization transcriptome remodelingGenome Research, 2006
- Evidence for conservation and selection of upstream open reading frames suggests probable encoding of bioactive peptidesBMC Genomics, 2006
- The Transcriptional Landscape of the Mammalian GenomeScience, 2005
- Comparative analysis of the base biases at the gene terminal portions in seven eukaryote genomesNucleic Acids Research, 2003
- Pushing the limits of the scanning mechanism for initiation of translationGene, 2002
- Initiation of translation in prokaryotes and eukaryotesGene, 1999
- Dissecting the Regulatory Circuitry of a Eukaryotic GenomeCell, 1998
- Recognition of AUG and alternative initiator codons is augmented by G in position +4 but is not generally affected by the nucleotides in positions +5 and +6The EMBO Journal, 1997
- Context sequences of translation initiation codon in plantsPlant Molecular Biology, 1997
- Sequence and structural features associated with translational initiator regions in yeast — a reviewGene, 1987