TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations
Top Cited Papers
Open Access
- 30 April 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 38 (suppl_2) , W7-W13
- https://doi.org/10.1093/nar/gkq291
Abstract
We present TranslatorX, a web server designed to align protein-coding nucleotide sequences based on their corresponding amino acid translations. Many comparisons between biological sequences (nucleic acids and proteins) involve the construction of multiple alignments. Alignments represent a statement regarding the homology between individual nucleotides or amino acids within homologous genes. As protein-coding DNA sequences evolve as triplets of nucleotides (codons) and it is known that sequence similarity degrades more rapidly at the DNA than at the amino acid level, alignments are generally more accurate when based on amino acids than on their corresponding nucleotides. TranslatorX novelties include: (i) use of all documented genetic codes and the possibility of assigning different genetic codes for each sequence; (ii) a battery of different multiple alignment programs; (iii) translation of ambiguous codons when possible; (iv) an innovative criterion to clean nucleotide alignments with GBlocks based on protein information; and (v) a rich output, including Jalview-powered graphical visualization of the alignments, codon-based alignments coloured according to the corresponding amino acids, measures of compositional bias and first, second and third codon position specific alignments. The TranslatorX server is freely available at http://translatorx.co.uk.Keywords
This publication has 20 references indexed in Scilit:
- The Phylogenetic Informativeness of Nucleotide and Amino Acid Sequences for Reconstructing the Vertebrate TreeJournal of Molecular Evolution, 2008
- Relative character-state space, amount of potential phylogenetic information, and heterogeneity of nucleotide and amino acid charactersMolecular Phylogenetics and Evolution, 2004
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- The Jalview Java alignment editorBioinformatics, 2004
- A Simple, Fast, and Accurate Algorithm to Estimate Large Phylogenies by Maximum LikelihoodSystematic Biology, 2003
- RevTrans: multiple alignment of coding DNA from aligned amino acid sequencesNucleic Acids Research, 2003
- Amino acid vs. nucleotide characters: challenging preconceived notionsMolecular Phylogenetics and Evolution, 2002
- T-coffee: a novel method for fast and accurate multiple sequence alignment 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- Selection of Conserved Blocks from Multiple Alignments for Their Use in Phylogenetic AnalysisMolecular Biology and Evolution, 2000
- Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites.Molecular Biology and Evolution, 1993