Clustering of housekeeping genes provides a unified model of gene order in the human genome
Top Cited Papers
- 6 May 2002
- journal article
- research article
- Published by Springer Nature in Nature Genetics
- Vol. 31 (2) , 180-183
- https://doi.org/10.1038/ng887
Abstract
It is often supposed that, except for tandem duplicates, genes are randomly distributed throughout the human genome. However, recent analyses suggest that when all the genes expressed in a given tissue (notably placenta1 and skeletal muscle2) are examined, these genes do not map to random locations but instead resolve to clusters. We have asked three questions: (i) is this clustering true for most tissues, or are these the exceptions; (ii) is any clustering simply the result of the expression of tandem duplicates and (iii) how, if at all, does this relate to the observed clustering of genes with high expression rates3? We provide a unified model of gene clustering that explains the previous observations. We examined Serial Analysis of Gene Expression (SAGE)4 data for 14 tissues and found significant clustering, in each tissue, that persists even after the removal of tandem duplicates. We confirmed clustering by analysis of independent expressed-sequence tag (EST) data. We then tested the possibility that the human genome is organized into subregions, each specializing in genes needed in a given tissue. By comparing genes expressed in different tissues, we show that this is not the case: those genes that seem to be tissue-specific in their expression do not, as a rule, cluster. We report that genes that are expressed in most tissues (housekeeping genes) show strong clustering. In addition, we show that the apparent clustering of genes with high expression rates3 is a consequence of the clustering of housekeeping genes.Keywords
This publication has 16 references indexed in Scilit:
- The Human Transcriptome Map: Clustering of Highly Expressed Genes in Chromosomal DomainsScience, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Major factors influencing linkage disequilibrium by analysis of different chromosome regions in distinct populations: demography, chromosome recombination frequency and selectionHuman Molecular Genetics, 2000
- SAGEmap: A Public Gene Expression ResourceGenome Research, 2000
- Complete sequence and gene map of a human major histocompatibility complexNature, 1999
- Higher-order chromatin structure: looping long molecules.Plant Molecular Biology, 1999
- Conservation of gene order: a fingerprint of proteins that physically interactPublished by Elsevier ,1998
- Imprinted genes have few and small intronsNature Genetics, 1996
- Serial Analysis of Gene ExpressionScience, 1995
- HOVERGEN: a database of homologous vertebrate genesNucleic Acids Research, 1994