Local Coexpression Domains of Two to Four Genes in the Genome of Arabidopsis
Open Access
- 27 May 2005
- journal article
- Published by Oxford University Press (OUP) in Plant Physiology
- Vol. 138 (2) , 923-934
- https://doi.org/10.1104/pp.104.055673
Abstract
Expression of genes in eukaryotic genomes is known to cluster, but cluster size is generally loosely defined and highly variable. We have here taken a very strict definition of cluster as sets of physically adjacent genes that are highly coexpressed and form so-called local coexpression domains. The Arabidopsis (Arabidopsis thaliana) genome was analyzed for the presence of such local coexpression domains to elucidate its functional characteristics. We used expression data sets that cover different experimental conditions, organs, tissues, and cells from the Massively Parallel Signature Sequencing repository and microarray data (Affymetrix) from a detailed root analysis. With these expression data, we identified 689 and 1,481 local coexpression domains, respectively, consisting of two to four genes with a pairwise Pearson's correlation coefficient larger than 0.7. This number is approximately 1- to 5-fold higher than the numbers expected by chance. A small (5%–10%) yet significant fraction of genes in the Arabidopsis genome is therefore organized into local coexpression domains. These local coexpression domains were distributed over the genome. Genes in such local domains were for the major part not categorized in the same functional category (GOslim). Neither tandemly duplicated genes nor shared promoter sequence nor gene distance explained the occurrence of coexpression of genes in such chromosomal domains. This indicates that other parameters in genes or gene positions are important to establish coexpression in local domains of Arabidopsis chromosomes.Keywords
This publication has 31 references indexed in Scilit:
- Arabidopsis MPSS. An Online Resource for Quantitative Expression AnalysisPlant Physiology, 2004
- The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and communityNucleic Acids Research, 2003
- Chromosomal clustering of muscle-expressed genes in Caenorhabditis elegansNature, 2002
- Clustering of housekeeping genes provides a unified model of gene order in the human genomeNature Genetics, 2002
- Gene Expression Omnibus: NCBI gene expression and hybridization array data repositoryNucleic Acids Research, 2002
- Insulators and Boundaries: Versatile Regulatory Elements in the Eukaryotic GenomeScience, 2001
- A computational analysis of whole-genome expression data reveals chromosomal domains of gene expressionNature Genetics, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- The MAR-Mediated Reduction in Position Effect Can Be Uncoupled from Copy Number-Dependent Expression in Transgenic Plants.Plant Cell, 1995
- Reduced Position Effect in Mature Transgenic Plants Conferred by the Chicken Lysozyme Matrix-Associated Region.Plant Cell, 1994