A highly annotated whole-genome sequence of a Korean individual by Kim Jong-Il, Ju Young Seok, Park Hansoo, Kim Sheehyun, Lee Seonwook, Yi Jae-Hyuk, Mudge Joann, Miller Neil A, Hong Dongwan, Bell Callum J, Kim Hye-Sun, Chung In-Soon, Lee Woo-Chung, Lee Ji-Sun, Seo Seung-Hyun, Yun Ji-Young, Woo Hyun Nyun, Lee Heewook, Suh Dongwhan, Lee Seungbok, Kim Hyun-Jin, Yavartanoo Maryam, Kwak Minhye, Zheng Ying, Lee Mi Kyeong, Park Hyunjun, Kim Jeong Yeon, Gokcumen Omer, Mills Ryan E, Zaranek Alexander Wait, Thakuria Joseph, Wu Xiaodi, Kim Ryan W, Huntley Jim J, Luo Shujun, Schroth Gary P, Wu Thomas D, Kim HyeRan, Yang Kap-Seok, Park Woong-Yang, Kim Hyungtae, Church George M, Lee Charles, Kingsmore Stephen F, Seo Jeong-Sun in Nature (2009). PubMed

Abstract

Recent advances in sequencing technologies have initiated an era of personal genome sequences. To date, human genome sequences have been reported for individuals with ancestry in three distinct geographical regions: a Yoruba African, two individuals of northwest European origin, and a person from China. Here we provide a highly annotated, whole-genome sequence for a Korean individual, known as AK1. The genome of AK1 was determined by an exacting, combined approach that included whole-genome shotgun sequencing (27.8x coverage), targeted bacterial artificial chromosome sequencing, and high-resolution comparative genomic hybridization using custom microarrays featuring more than 24 million probes. Alignment to the NCBI reference, a composite of several ethnic clades, disclosed nearly 3.45 million single nucleotide polymorphisms (SNPs), including 10,162 non-synonymous SNPs, and 170,202 deletion or insertion polymorphisms (indels). SNP and indel densities were strongly correlated genome-wide. Applying very conservative criteria yielded highly reliable copy number variants for clinical considerations. Potential medical phenotypes were annotated for non-synonymous SNPs, coding domain indels, and structural variants. The integration of several human whole-genome sequences derived from several ethnic groups will assist in understanding genetic ancestry, migration patterns and population bottlenecks.

[ hide abstract ]

Discussed In Paper