Combining Microarray‐based Genomic Selection (MGS) with the Illumina Genome Analyzer Platform to Sequence Diploid Target Regions
Open Access
- 6 August 2009
- journal article
- research article
- Published by Wiley in Annals of Human Genetics
- Vol. 73 (5) , 502-513
- https://doi.org/10.1111/j.1469-1809.2009.00530.x
Abstract
Novel methods of targeted sequencing of unique regions from complex eukaryotic genomes have generated a great deal of excitement, but critical demonstrations of these methods efficacy with respect to diploid genotype calling and experimental variation are lacking. To address this issue, we optimized microarray‐based genomic selection (MGS) for use with the Illumina Genome Analyzer (IGA). A set of 202 fragments (304 kb total) contained within a 1.7 Mb genomic region on human chromosome X were MGS/IGA sequenced in ten female HapMap samples generating a total of 2.4 GB of DNA sequence. At a minimum coverage threshold of 5X, 93.9% of all bases and 94.9% of segregating sites were called, while 57.7% of bases (57.4% of segregating sites) were called at a 50X threshold. Data accuracy at known segregating sites was 98.9% at 5X coverage, rising to 99.6% at 50X coverage. Accuracy at homozygous sites was 98.7% at 5X sequence coverage and 99.5% at 50X coverage. Although accuracy at heterozygous sites was modestly lower, it was still over 92% at 5X coverage and increased to nearly 97% at 50X coverage. These data provide the first demonstration that MGS/IGA sequencing can generate the very high quality sequence data necessary for human genetics research. All sequences generated in this study have been deposited in NCBI Short Read Archive (http://www.ncbi.nlm.nih.gov/Traces/sra, Accession # SRA007913).Keywords
This publication has 29 references indexed in Scilit:
- Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencingNature Biotechnology, 2009
- Accurate whole human genome sequencing using reversible terminator chemistryNature, 2008
- The diploid genome sequence of an Asian individualNature, 2008
- Common variants at CD40 and other loci confer risk of rheumatoid arthritisNature Genetics, 2008
- Genome-wide in situ exon capture for selective resequencingNature Genetics, 2007
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- Microarray-based genomic selection for high-throughput resequencingNature Methods, 2007
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007
- A haplotype map of the human genomeNature, 2005
- Advanced sequencing technologies: methods and goalsNature Reviews Genetics, 2004