A draft annotation and overview of the human genome
Open Access
- 4 July 2001
- journal article
- Published by Springer Nature in Genome Biology
Abstract
The recent draft assembly of the human genome provides a unified basis for describing genomic structure and function. The draft is sufficiently accurate to provide useful annotation, enabling direct observations of previously inferred biological phenomena. We report here a functionally annotated human gene index placed directly on the genome. The index is based on the integration of public transcript, protein, and mapping information, supplemented with computational prediction. We describe numerous global features of the genome and examine the relationship of various genetic maps with the assembly. In addition, initial sequence analysis reveals highly ordered chromosomal landscapes associated with paralogous gene clusters and distinct functional compartments. Finally, these annotation data were synthesized to produce observations of gene density and number that accord well with historical estimates. Such a global approach had previously been described only for chromosomes 21 and 22, which together account for 2.2% of the genome. We estimate that the genome contains 65,000-75,000 transcriptional units, with exon sequences comprising 4%. The creation of a comprehensive gene index requires the synthesis of all available computational and experimental evidence.Keywords
This publication has 59 references indexed in Scilit:
- Assembly, Annotation, and Integration of UNIGENE Clusters into the Human Genome DraftGenome Research, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Genetic variation in the gene encoding calpain-10 is associated with type 2 diabetes mellitusNature Genetics, 2000
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Analysis of Distribution in the Human, Pig, and Rat Genomes Points toward a General Subtelomeric Origin of Minisatellite StructuresGenomics, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Intelligent linkage analysis using gene density estimatesNature Genetics, 1997
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997
- The distribution of CpG islands in mammalian chromosomesNature Genetics, 1994
- An analysis of eukaryotic genomes by density gradient centrifugationJournal of Molecular Biology, 1976