Interpreting cDNA sequences: Some insights from studies on translation
- 1 August 1996
- journal article
- review article
- Published by Springer Nature in Mammalian Genome
- Vol. 7 (8) , 563-574
- https://doi.org/10.1007/s003359900171
Abstract
This review discusses some rules for assessing the completeness of a cDNA sequence and identifying the start site for translation. Features commonly invoked—such as an ATG codon in a favorable context for initiation, or the presence of an upstream in-frame terminator codon, or the prediction of a signal peptide-like sequence at the amino terminus—have some validity; but examples drawn from the literature illustrate limitations to each of these criteria. The best advice is to inspect a cDNA sequence not only for these positive features but also for the absence of certain negative indicators. Three specific warning signs are discussed and documented: (i) The presence of numerous ATG codons upstream from the presumptive start site for translation often indicates an aberration (sometimes a retained intron) at the 5′ end of the cDNA. (ii) Even one strong, upstream, out-of-frame ATG codon poses a problem if the reading frame set by the upstream ATG overlaps the presumptive start of the major open reading frame. Many cDNAs that display this arrangement turn out to be incomplete; that is, the out-of-frame ATG codon is within, rather than upstream from, the protein coding domain. (iii) A very weak context at the putative start site for translation often means that the cDNA lacks the authentic initiator codon. In addition to presenting some criteria that may aid in recognizing incomplete cDNA sequences, the review includes some advice for using in vitro translation systems for the expression of cDNAs. Some unresolved questions about translational regulation are discussed by way of illustrating the importance of verifying mRNA structures before making deductions about translation.Keywords
This publication has 124 references indexed in Scilit:
- Human Decorin Gene: Intron-Exon Junctions and Chromosomal LocalizationGenomics, 1993
- Genomic organization, sequence analysis, and chromosomal localization of the human car☐yl ester lipase (CEL) gene and a CEL-like (CELL) geneGenomics, 1992
- cDNA that encodes active agrinNeuron, 1992
- The long leader sequence of the mouse ornithine decarboxylase mRNA, previously suspected to be a cloning artifact, is probably a product of recombination with a MuLV-like retrovirusGene, 1991
- Sequence of the cDNA encoding ovine tumor necrosis factor-α: problems with cloning by inverse PCRGene, 1991
- Cloning and characterization of the gene encoding rabbit cardiac calsequestrinGene, 1991
- Mutation in the Gene Encoding the Stimulatory G Protein of Adenylate Cyclase in Albright's Hereditary OsteodystrophyNew England Journal of Medicine, 1990
- Complete primary structure of the transacylase (E2b) subunit of the human branched chain α-keto acid dehydrogenase complexBiochemical and Biophysical Research Communications, 1989
- Structure of the 5' flanking region of the gene encoding human parathyroid-hormone-related protein (PTHrP)Gene, 1989
- Structural organization of the mouse cytosolic malate dehydrogenase gene: Comparison with that of the mouse mitochondrial malate dehydrogenase geneJournal of Molecular Biology, 1988