Overlapping Genomic Sequences: A Treasure Trove of Single-Nucleotide Polymorphisms
Open Access
- 1 July 1998
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 8 (7) , 748-754
- https://doi.org/10.1101/gr.8.7.748
Abstract
An efficient strategy to develop a dense set of single-nucleotide polymorphism (SNP) markers is to take advantage of the human genome sequencing effort currently under way. Our approach is based on the fact that bacterial artificial chromosomes (BACs) and P1-based artificial chromosomes (PACs) used in long-range sequencing projects come from diploid libraries. If the overlapping clones sequenced are from different lineages, one is comparing the sequences from 2 homologous chromosomes in the overlapping region. We have analyzed in detail every SNP identified while sequencing three sets of overlapping clones found on chromosome 5p15.2, 7q21–7q22, and 13q12–13q13. In the 200.6 kb of DNA sequence analyzed in these overlaps, 153 SNPs were identified. Computer analysis for repetitive elements and suitability for STS development yielded 44 STSs containing 68 SNPs for further study. All 68 SNPs were confirmed to be present in at least one of the three (Caucasian, African-American, Hispanic) populations studied. Furthermore, 42 of the SNPs tested (62%) were informative in at least one population, 32 (47%) were informative in two or more populations, and 23 (34%) were informative in all three populations. These results clearly indicate that developing SNP markers from overlapping genomic sequence is highly efficient and cost effective, requiring only the two simple steps of developing STSs around the known SNPs and characterizing them in the appropriate populations. [The sequence data described in this paper have been submitted to the GenBank data library under accession nos. AC003015 (for GS113423),AC002380 (GS330J10), AC000066 (RG293F11), AC003086 (RG104F04), AC002525(257C22A), and U73331 (96A18A).]Keywords
This publication has 13 references indexed in Scilit:
- Base-Calling of Automated Sequencer Traces UsingPhred. I. Accuracy AssessmentGenome Research, 1998
- Variations on a Theme: Cataloging Human DNA Sequence VariationScience, 1997
- The use of a genetic map of biallelic markers in linkage studiesNature Genetics, 1997
- Increasing the Information Content of STS-Based Genome Maps: Identifying Polymorphisms in Mapped STSsGenomics, 1996
- A new DNA sequence assembly programNucleic Acids Research, 1995
- Comparative Analysis of Human DNA Variations by Fluorescence-Based Sequencing of PCR ProductsGenomics, 1994
- A New Five-Year Plan for the U.S. Human Genome ProjectScience, 1993
- OSP: a computer program for choosing PCR and DNA sequencing primers.Genome Research, 1991
- An estimate of unique DNA sequence heterozygosity in the human genomeHuman Genetics, 1985
- The Neutral Theory of Molecular EvolutionPublished by Cambridge University Press (CUP) ,1983