Large-scale identification of novel transcripts in the human genome
- 31 January 2007
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 17 (3) , 287-292
- https://doi.org/10.1101/gr.5486607
Abstract
Although the sequencing of the human genome has been completed, the number and identity of genes contained within it remains to be fully determined. We used LongSAGE to analyze 660,357 human transcripts from human brain mRNA and identified expression of 17,409 known genes and >15,000 different transcripts that were not annotated in genome databases. Analysis of a subset of these unannotated transcripts suggests that 85% were differentially expressed in various tissue types and that fewer than 20% would have been detected by ab initio gene predictions. These studies suggest that the human genome contains on the order of twice as many transcribed regions as are currently annotated and that experimental approaches will be required to fully elucidate the novel genes corresponding to these transcripts.Keywords
This publication has 29 references indexed in Scilit:
- Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA ClonesPLoS Biology, 2004
- Using the transcriptome to annotate the genomeNature Biotechnology, 2002
- Integrating genomic homology into gene structure predictionBioinformatics, 2001
- Evaluation of Gene-Finding Programs on Mammalian SequencesGenome Research, 2001
- The Human Transcriptome Map: Clustering of Highly Expressed Genes in Chromosomal DomainsScience, 2001
- The Sequence of the Human GenomeScience, 2001
- Experimental annotation of the human genome using microarray technologyNature, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- cDNA Cloning by Amplification of Circularized First Strand cDNAs Reveals Non-IRE-Regulated Iron-Responsive mRNAsBiochemical and Biophysical Research Communications, 2000
- Analysis of human transcriptomesNature Genetics, 1999