RevTrans: multiple alignment of coding DNA from aligned amino acid sequences
Top Cited Papers
Open Access
- 1 July 2003
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (13) , 3537-3539
- https://doi.org/10.1093/nar/gkg609
Abstract
The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit from the information that is implicit in empirical substitution matrices such as BLOSUM-62. Taken together with the generally higher rate of synonymous mutations over non-synonymous ones, this means that the phylogenetic signal disappears much more rapidly from DNA sequences than from the encoded proteins. It is therefore preferable to align coding DNA at the amino acid level and it is for this purpose we have constructed the program RevTrans. RevTrans constructs a multiple DNA alignment by: (i) translating the DNA; (ii) aligning the resulting peptide sequences; and (iii) building a multiple DNA alignment by 'reverse translation' of the aligned protein sequences. In the resulting DNA alignment, gaps occur in groups of three corresponding to entire codons, and analogous codon positions are therefore always lined up. These features are useful when constructing multiple DNA alignments for phylogenetic analysis. RevTrans also accepts user-provided protein alignments for greater control of the alignment process. The RevTrans web server is freely available at http://www.cbs.dtu.dk/services/RevTrans/.Keywords
This publication has 12 references indexed in Scilit:
- Statistical methods for detecting molecular adaptationPublished by Elsevier ,2000
- Estimating Synonymous and Nonsynonymous Substitution Rates Under Realistic Evolutionary ModelsMolecular Biology and Evolution, 2000
- DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment.Bioinformatics, 1999
- Synonymous and nonsynonymous rate variation in nuclear genes of mammalsJournal of Molecular Evolution, 1998
- [15] Effective protein sequence comparisonPublished by Elsevier ,1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- An Algorithm Combining DNA and Protein AlignmentJournal of Theoretical Biology, 1994
- Genomic alignmentJournal of Molecular Evolution, 1994
- Amino acid substitution matrices from protein blocks.Proceedings of the National Academy of Sciences, 1992
- Relationships of Human Protein Sequences to Those of Other OrganismsPublished by Cold Spring Harbor Laboratory ,1986