Automated Whole-Genome Multiple Alignment of Rat, Mouse, and Human
Open Access
- 1 April 2004
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 14 (4) , 685-692
- https://doi.org/10.1101/gr.2067704
Abstract
We have built a whole-genome multiple alignment of the three currently available mammalian genomes using a fully automated pipeline that combines the local/global approach of the Berkeley Genome Pipeline and the LAGAN program. The strategy is based on progressive alignment and consists of two main steps: (1) alignment of the mouse and rat genomes, and (2) alignment of human to either the mouse-rat alignments from step 1, or the remaining unaligned mouse and rat sequences. The resulting alignments demonstrate high sensitivity, with 87% of all human gene-coding areas aligned in both mouse and rat. The specificity is also high: 100 kb agree with a three-way synteny map built independently, using predicted exons in the three genomes. At the nucleotide level <1% of the rat nucleotides are mapped to multiple places in the human sequence in the alignment, and 96.5% of human nucleotides within all alignments agree with the synteny map. The alignments are publicly available online, with visualization through the novel Multi-VISTA browser that we also present.Keywords
This publication has 28 references indexed in Scilit:
- Characterization of Evolutionary Rates and Constraints in Three Mammalian GenomesGenome Research, 2004
- Genome sequence of the Brown Norway rat yields insights into mammalian evolutionNature, 2004
- Sequencing and comparison of yeast species to identify genes and regulatory elementsNature, 2003
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- The Human Genome Browser at UCSCGenome Research, 2002
- Transcriptional Regulation of the Stem Cell Leukemia Gene (SCL) — Comparative Analysis of Five Vertebrate SCL LociGenome Research, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Sequence alignment and penalty choiceJournal of Molecular Biology, 1994