Sequence Diversity and Large-Scale Typing of SNPs in the Human Apolipoprotein E Gene
Open Access
- 1 October 2000
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 10 (10) , 1532-1545
- https://doi.org/10.1101/gr.146900
Abstract
A common strategy for genotyping large samples begins with the characterization of human single nucleotide polymorphisms (SNPs) by sequencing candidate regions in a small sample for SNP discovery. This is usually followed by typing in a large sample those sites observed to vary in a smaller sample. We present results from a systematic investigation of variation at the human apolipoprotein E locus (APOE), as well as the evaluation of the two-tiered sampling strategy based on these data. We sequenced 5.5 kb spanning the entireAPOE genomic region in a core sample of 72 individuals, including 24 each of African-Americans from Jackson, Mississippi; European-Americans from Rochester, Minnesota; and Europeans from North Karelia, Finland. This sequence survey detected 21 SNPs and 1 multiallelic indel, 14 of which had not been previously reported. Alleles varied in relative frequency among the populations, and 10 sites were polymorphic in only a single population sample. Oligonucleotide ligation assays (OLA) were developed for 20 of these sites (omitting the indel and a closely-linked SNP). These were then scored in 2179 individuals sampled from the same three populations (n = 843, 884, and 452, respectively). Relative allele frequencies were generally consistent with estimates from the core sample, although variation was found in some populations in the larger sample at SNPs that were monomorphic in the corresponding smaller core sample. Site variation in the larger samples showed no systematic deviation from Hardy-Weinberg expectation. The large OLA sample clearly showed that variation in many, but not all, of OLA-typed SNPs is significantly correlated with the classical protein-coding variants, implying that there may be important substructure within the classical ɛ2, ɛ3, and ɛ4 alleles. Comparison of the levels and patterns of polymorphism in the core samples with those estimated for the OLA-typed samples shows how nucleotide diversity is underestimated when only a subset of sites are typed and underscores the importance of adequate population sampling at the polymorphism discovery stage. [The sequence data described in this paper have been submitted to the GenBank data library under accession no. AF261279.]Keywords
This publication has 63 references indexed in Scilit:
- Circumventing multiple testing: A multilocus Monte Carlo approach to testing for associationGenetic Epidemiology, 2000
- PipMaker—A Web Server for Aligning Two Genomic DNA SequencesGenome Research, 2000
- A 4-Mb High-Density Single Nucleotide Polymorphism-Based Map around Human APOEGenomics, 1998
- Allelic polymorphisms in the transcriptional regulatory region of apolipoprotein E geneFEBS Letters, 1998
- A polymorphism in the regulatory region of APOE associated with risk for Alzheimer's dementiaNature Genetics, 1998
- Nuclear DNA diversity in worldwide distributed human populationsGene, 1997
- Genetic Dissection of Complex TraitsScience, 1994
- Genetic associations with human longevity at the APOE and ACE lociNature Genetics, 1994
- Gene Dose of Apolipoprotein E Type 4 Allele and the Risk of Alzheimer's Disease in Late Onset FamiliesScience, 1993
- Type III hyperlipoproteinemia associated with apolipoprotein E phenotype E3/3. Structure and genetics of an apolipoprotein E3 variant.Journal of Clinical Investigation, 1989