Comprehensive sampling of gene expression in human cell lines with massively parallel signature sequencing
Open Access
- 15 April 2003
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 100 (8) , 4702-4705
- https://doi.org/10.1073/pnas.0831040100
Abstract
Whereas information is rapidly accumulating about the structure and position of genes encoded in the human genome, less is known about the complexity and relative abundance of their expression in individual human cells and tissues. Here, we describe the characteristics of the transcriptomes of two cultured cell lines, HB4a (normal breast epithelium) and HCT-116 (colon adenocarcinoma), using massively parallel signature sequencing (MPSS). We generated in excess of 107 short signature sequences per cell line, thus providing a comprehensive snapshot of gene expression, within the technical limitations of the method. The number of genes expressed at one copy per cell or more in either of the lines was estimated to be between 10,000 and 15,000. The vast majority of the transcripts found in these cells can be mapped to known genes and their polyadenylation variants. Among the genes that could be identified from their signature sequences, ≈8,500 were expressed by both cell lines, whereas 6,000 showed cellular specificity. Taking into account sequence tags that map uniquely to the genome but not to known transcripts, overall the data are consistent with an upper limit of 17,000 for the total number of genes expressed at more than one copy per cell in one or both of the two cell lines examined.Keywords
This publication has 18 references indexed in Scilit:
- Identifying novel transcripts and novel genes in the human genome by using novel SAGE tagsProceedings of the National Academy of Sciences, 2002
- An anatomy of normal and malignant gene expressionProceedings of the National Academy of Sciences, 2002
- Long-Range Heterogeneity at the 3′ Ends of Human mRNAsGenome Research, 2002
- Navigating the human transcriptomeProceedings of the National Academy of Sciences, 2001
- Heterogeneity in polyadenylation cleavage sites in mammalian mRNA sequences: implications for SAGE analysisNucleic Acids Research, 2001
- Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arraysNature Biotechnology, 2000
- Serial Analysis of Gene ExpressionScience, 1995
- Rapid cDNA sequencing (expressed sequence tags) from a directionally cloned human infant brain cDNA libraryNature Genetics, 1993
- The expression of three abundance classes of messenger RNA in mouse tissuesCell, 1976
- Three abundance classes in HeLa cell messenger RNANature, 1974