Quantitative Estimates of Sequence Divergence for Comparative Analyses of Mammalian Genomes
Open Access
- 1 May 2003
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 13 (5) , 813-820
- https://doi.org/10.1101/gr.1064503
Abstract
Comparative sequence analyses on a collection of carefully chosen mammalian genomes could facilitate identification of functional elements within the human genome and allow quantification of evolutionary constraint at the single nucleotide level. High-resolution quantification would be informative for determining the distribution of important positions within functional elements and for evaluating the relative importance of nucleotide sites that carry single nucleotide polymorphisms (SNPs). Because the level of resolution in comparative sequence analyses is a direct function of sequence diversity, we propose that the information content of a candidate mammalian genome be defined as the sequence divergence it would add relative to already-sequenced genomes. We show that reliable estimates of genomic sequence divergence can be obtained from small genomic regions. On the basis of a multiple sequence alignment of ∼1.4 megabases each from eight mammals, we generate such estimates for five unsequenced mammals. Estimates of the neutral divergence in these data suggest that a small number of diverse mammalian genomes in addition to human, mouse, and rat would allow single nucleotide resolution in comparative sequence analyses.[The multiple sequence alignment of theCFTR region and a spreadsheet with the calculations performed, will be available as supplementary information online atwww.genome.org.]Keywords
This publication has 19 references indexed in Scilit:
- LAGAN and Multi-LAGAN: Efficient Tools for Large-Scale Multiple Alignment of Genomic DNAGenome Research, 2003
- Sequence First. Ask Questions Later.Cell, 2002
- A Comparison of Whole-Genome Shotgun-Derived Mouse Chromosome 16 and the Human GenomeScience, 2002
- The Human Genome Browser at UCSCGenome Research, 2002
- Transcriptional Regulation of the Stem Cell Leukemia Gene (SCL) — Comparative Analysis of Five Vertebrate SCL LociGenome Research, 2002
- Resolution of the Early Placental Mammal Radiation Using Bayesian PhylogeneticsScience, 2001
- Human and Mouse ABCA1 Comparative Sequencing and Transgenesis Studies Revealing Novel Regulatory SequencesGenomics, 2001
- An Efficient Cis-Element Discovery Method Using Multiple Sequence Comparisons Based on Evolutionary RelationshipsGenomics, 2001
- Synonymous and nonsynonymous substitutions in mammalian genes and the nearly neutral theoryJournal of Molecular Evolution, 1995
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994