Measuring the accuracy of genome-size multiple alignments
Open Access
- 26 June 2007
- journal article
- Published by Springer Nature in Genome Biology
- Vol. 8 (6) , R124
- https://doi.org/10.1186/gb-2007-8-6-r124
Abstract
Whole-genome alignments are invaluable for comparative genomics. Before doing any comparative analysis on a region of interest, one must have confidence in that region's alignment. We provide a methodology to measure the accuracy of arbitrary regions of these alignments, and apply it to the UCSC Genome Browser's 17-vertebrate alignment. We identify 9.7% (21 Mbp) of the human chromosome 1 alignment as suspiciously aligned. We present independent evidence that many of these suspicious regions represent misalignments.Keywords
This publication has 22 references indexed in Scilit:
- Mapping of conserved RNA secondary structures predicts thousands of functional noncoding RNAs in the human genomeNature Biotechnology, 2005
- Statistics of local multiple alignmentsBioinformatics, 2005
- An initial strategy for the systematic identification of functional elements in the human genome by low-redundancy comparative sequencingProceedings of the National Academy of Sciences, 2005
- Comparative sequencing provides insights about the structure and conservation of marsupial and monotreme genomesProceedings of the National Academy of Sciences, 2005
- Highly Conserved Non-Coding Sequences Are Associated with Vertebrate DevelopmentPLoS Biology, 2004
- Ultraconserved Elements in the Human GenomeScience, 2004
- Regulatory Potential Scores From Genome-Wide Three-Way Alignments of Human, Mouse, and RatGenome Research, 2004
- Identification and Characterization of Multi-Species Conserved SequencesGenome Research, 2003
- Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes.Proceedings of the National Academy of Sciences, 1990
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981