PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments
Top Cited Papers
Open Access
- 1 July 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (Web Server) , W609-W612
- https://doi.org/10.1093/nar/gkl315
Abstract
PAL2NAL is a web server that constructs a multiple codon alignment from the corresponding aligned protein sequences. Such codon alignments can be used to evaluate the type and rate of nucleotide substitutions in coding DNA for a wide range of evolutionary analyses, such as the identification of levels of selective constraint acting on genes, or to perform DNA-based phylogenetic studies. The server takes a protein sequence alignment and the corresponding DNA sequences as input. In contrast to other existing applications, this server is able to construct codon alignments even if the input DNA sequence has mismatches with the input protein sequence, or contains untranslated regions and polyA tails. The server can also deal with frame shifts and inframe stop codons in the input models, and is thus suitable for the analysis of pseudogenes. Another distinct feature is that the user can specify a subregion of the input alignment in order to specifically analyze functional domains or exons of interest. The PAL2NAL server is available at http://www.bork.embl.de/pal2nal .Keywords
This publication has 11 references indexed in Scilit:
- transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequencesBMC Bioinformatics, 2005
- A Genome-Wide Survey of Human PseudogenesGenome Research, 2003
- RevTrans: multiple alignment of coding DNA from aligned amino acid sequencesNucleic Acids Research, 2003
- The Bioperl Toolkit: Perl Modules for the Life SciencesGenome Research, 2002
- Selection of Conserved Blocks from Multiple Alignments for Their Use in Phylogenetic AnalysisMolecular Biology and Evolution, 2000
- PAML: a program package for phylogenetic analysis by maximum likelihoodBioinformatics, 1997
- Dynamite: a flexible code generating language for dynamic programming methods used in sequence comparison.1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- A codon-based model of nucleotide substitution for protein-coding DNA sequences.Molecular Biology and Evolution, 1994
- Molecular evolution of mRNA: A method for estimating evolutionary rates of synonymous and amino acid substitutions from homologous nucleotide sequences and its applicationJournal of Molecular Evolution, 1980