Gene Annotation: Prediction and Testing
- 1 September 2003
- journal article
- review article
- Published by Annual Reviews in Annual Review of Genomics and Human Genetics
- Vol. 4 (1) , 69-88
- https://doi.org/10.1146/annurev.genom.4.070802.110300
Abstract
Fifty years after the publication of DNA structure, the whole human genome sequence will be officially finished. This achievement marks the beginning of the task to catalogue every human gene and identify each of their function expression patterns. Currently, researchers estimate that there are about 30,000 human genes and approximately 70% of these can be automatically predicted using a combination of ab initio and similarity-based programs. However, to experimentally investigate every gene's function, the research community requires a high-quality annotation of alternative splicing, pseudogenes, and promoter regions that can only be provided by manual intervention. Manual curation of the human genome will be a long-term project as experimental data are continually produced to confirm or refine the predictions, and new features such as noncoding RNAs and enhancers have not been fully identified. Such a highly curated human gene-set made publicly available will be a great asset for the experimental community and for future comparative genome projects.Keywords
This publication has 98 references indexed in Scilit:
- Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humansProceedings of the National Academy of Sciences, 2002
- Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAsNature, 2002
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Identification and Analysis of Over 2000 Ribosomal Protein Pseudogenes in the Human GenomeGenome Research, 2002
- The Human Genome Browser at UCSCGenome Research, 2002
- Non–coding RNA genes and the modern RNA worldNature Reviews Genetics, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- The human ribosomal protein L6 gene in a critical region for Noonan syndromeJournal of Human Genetics, 2000
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997