A highly annotated whole-genome sequence of a Korean individual
Open Access
- 8 July 2009
- journal article
- research article
- Published by Springer Nature in Nature
- Vol. 460 (7258) , 1011-1015
- https://doi.org/10.1038/nature08211
Abstract
The genome of an anonymous Korean male has been sequenced using a broad spread of genomic techniques. This combinatorial approach allows for detailed characterization of sequence and structural variation. The first four individual genomes to have been determined spanned three distinct ethnic groups: a Yoruba African, northwest European (Craig Venter and James Watson) and Han Chinese. This new work, together with another Korean sequence reported elsewhere, adds the Altaic ethnic grouping to the list. Human genome sequences have so far been reported for individuals with ancestry in three distinct geographical regions: a Yoruba African, two individuals of northwest European origin, and a person from China. Here, using a combination of methods, a highly annotated, whole-genome sequence is provided for a Korean male. Recent advances in sequencing technologies have initiated an era of personal genome sequences. To date, human genome sequences have been reported for individuals with ancestry in three distinct geographical regions: a Yoruba African, two individuals of northwest European origin, and a person from China1,2,3,4. Here we provide a highly annotated, whole-genome sequence for a Korean individual, known as AK1. The genome of AK1 was determined by an exacting, combined approach that included whole-genome shotgun sequencing (27.8× coverage), targeted bacterial artificial chromosome sequencing, and high-resolution comparative genomic hybridization using custom microarrays featuring more than 24 million probes. Alignment to the NCBI reference, a composite of several ethnic clades5,6, disclosed nearly 3.45 million single nucleotide polymorphisms (SNPs), including 10,162 non-synonymous SNPs, and 170,202 deletion or insertion polymorphisms (indels). SNP and indel densities were strongly correlated genome-wide. Applying very conservative criteria yielded highly reliable copy number variants for clinical considerations. Potential medical phenotypes were annotated for non-synonymous SNPs, coding domain indels, and structural variants. The integration of several human whole-genome sequences derived from several ethnic groups will assist in understanding genetic ancestry, migration patterns and population bottlenecks.Keywords
This publication has 21 references indexed in Scilit:
- Genetic risk factors for rheumatoid arthritis differ in caucasian and Korean populationsArthritis & Rheumatism, 2009
- Accurate whole human genome sequencing using reversible terminator chemistryNature, 2008
- The diploid genome sequence of an Asian individualNature, 2008
- Genomic Convergence Analysis of Schizophrenia: mRNA Sequencing Reveals Altered Synaptic Vesicular Transport in Post-Mortem CerebellumPLOS ONE, 2008
- Evidence for Natural Selection on Leukocyte Immunoglobulin-like Receptors for HLA Class I in Northeast AsiansAmerican Journal of Human Genetics, 2008
- The complete genome of an individual by massively parallel DNA sequencingNature, 2008
- Transcriptome sequencing of malignant pleural mesothelioma tumorsProceedings of the National Academy of Sciences, 2008
- The Diploid Genome Sequence of an Individual HumanPLoS Biology, 2007
- Dynamic Building of a BAC Clone Tiling Path for the Rat Genome Sequencing ProjectGenome Research, 2004
- Initial sequencing and analysis of the human genomeNature, 2001