A comprehensive transcript index of the human genome generated using microarrays and computational approaches
Open Access
- 23 September 2004
- journal article
- research article
- Published by Springer Nature in Genome Biology
- Vol. 5 (10) , R73
- https://doi.org/10.1186/gb-2004-5-10-r73
Abstract
Background: Computational and microarray-based experimental approaches were used to generate a comprehensive transcript index for the human genome. Oligonucleotide probes designed from approximately 50,000 known and predicted transcript sequences from the human genome were used to survey transcription from a diverse set of 60 tissues and cell lines using ink-jet microarrays. Further, expression activity over at least six conditions was more generally assessed using genomic tiling arrays consisting of probes tiled through a repeat-masked version of the genomic sequence making up chromosomes 20 and 22. Results: The combination of microarray data with extensive genome annotations resulted in a set of 28,456 experimentally supported transcripts. This set of high-confidence transcripts represents the first experimentally driven annotation of the human genome. In addition, the results from genomic tiling suggest that a large amount of transcription exists outside of annotated regions of the genome and serves as an example of how this activity could be measured on a genome-wide scale. Conclusions: These data represent one of the most comprehensive assessments of transcriptional activity in the human genome and provide an atlas of human gene expression over a unique set of gene predictions. Before the annotation of the human genome is considered complete, however, the previously unannotated transcriptional activity throughout the genome must be fully characterized.Keywords
This publication has 51 references indexed in Scilit:
- The Pfam protein families databaseNucleic Acids Research, 2004
- Complete sequencing and characterization of 21,243 full-length human cDNAsNature Genetics, 2003
- The transcriptional activity of human Chromosome 22Genes & Development, 2003
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- rVistafor Comparative Sequence-Based Discovery of Functional Transcription Factor Binding SitesGenome Research, 2002
- The DNA sequence and comparative analysis of human chromosome 20Nature, 2001
- Evaluation of Gene-Finding Programs on Mammalian SequencesGenome Research, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997