Congruence of tissue expression profiles from Gene Expression Atlas, SAGEmap and TissueInfo databases
Open Access
- 29 July 2003
- journal article
- research article
- Published by Springer Nature in BMC Genomics
- Vol. 4 (1) , 31
- https://doi.org/10.1186/1471-2164-4-31
Abstract
Extracting biological knowledge from large amounts of gene expression information deposited in public databases is a major challenge of the postgenomic era. Additional insights may be derived by data integration and cross-platform comparisons of expression profiles. However, database meta-analysis is complicated by differences in experimental technologies, data post-processing, database formats, and inconsistent gene and sample annotation. We have analysed expression profiles from three public databases: Gene Expression Atlas, SAGEmap and TissueInfo. These are repositories of oligonucleotide microarray, Serial Analysis of Gene Expression and Expressed Sequence Tag human gene expression data respectively. We devised a method, Preferential Expression Measure, to identify genes that are significantly over- or under-expressed in any given tissue. We examined intra- and inter-database consistency of Preferential Expression Measures. There was good correlation between replicate experiments of oligonucleotide microarray data, but there was less coherence in expression profiles as measured by Serial Analysis of Gene Expression and Expressed Sequence Tag counts. We investigated inter-database correlations for six tissue categories, for which data were present in the three databases. Significant positive correlations were found for brain, prostate and vascular endothelium but not for ovary, kidney, and pancreas. We show that data from Gene Expression Atlas, SAGEmap and TissueInfo can be integrated using the UniGene gene index, and that expression profiles correlate relatively well when large numbers of tags are available or when tissue cellular composition is simple. Finally, in the case of brain, we demonstrate that when PEM values show good correlation, predictions of tissue-specific expression based on integrated data are very accurate.Keywords
This publication has 92 references indexed in Scilit:
- Clustering of housekeeping genes provides a unified model of gene order in the human genomeNature Genetics, 2002
- Vesicular restriction of synaptobrevin suggests a role for calcium in membrane fusionNature, 2002
- Characterization of Tissue- and Cell-Type-Specific Expression of a Novel Human Septin Family Gene, BradeionBiochemical and Biophysical Research Communications, 2001
- Systematic variation in gene expression patterns in human cancer cell linesNature Genetics, 2000
- Molecular Cloning of Testican‐2Journal of Neurochemistry, 1999
- Sequence and Tissue Expression of a Novel Human Carbonic Anhydrase-Related Protein, CARP-2, Mapping to Chromosome 19q13.3Biochemical and Biophysical Research Communications, 1998
- Neural Membrane Protein 35 (NMP35): A Novel Member of a Gene Family Which Is Highly Expressed in the Adult Nervous SystemMolecular and Cellular Neuroscience, 1998
- A mutation in the α tropomyosin gene TPM3 associated with autosomal dominant nemaline myopathyNature Genetics, 1995
- A post-docking role for synaptobrevin in synaptic vesicle fusionNeuron, 1994
- Human prostatic acid phosphatase: cDNA cloning, gene mapping and protein sequence homology with lysosomal acid phosphataseBiochemical and Biophysical Research Communications, 1989