Inferring Hypotheses on Functional Relationships of Genes: Analysis of the Arabidopsis thaliana Subtilase Gene Family
Open Access
- 23 September 2005
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 1 (4) , e40-312
- https://doi.org/10.1371/journal.pcbi.0010040
Abstract
The gene family of subtilisin-like serine proteases (subtilases) in Arabidopsis thaliana comprises 56 members, divided into six distinct subfamilies. Whereas the members of five subfamilies are similar to pyrolysins, two genes share stronger similarity to animal kexins. Mutant screens confirmed 144 T-DNA insertion lines with knockouts for 55 out of the 56 subtilases. Apart from SDD1, none of the confirmed homozygous mutants revealed any obvious visible phenotypic alteration during growth under standard conditions. Apart from this specific case, forward genetics gave us no hints about the function of the individual 54 non-characterized subtilase genes. Therefore, the main objective of our work was to overcome the shortcomings of the forward genetic approach and to infer alternative experimental approaches by using an integrative bioinformatics and biological approach. Computational analyses based on transcriptional co-expression and co-response pattern revealed at least two expression networks, suggesting that functional redundancy may exist among subtilases with limited similarity. Furthermore, two hubs were identified, which may be involved in signalling or may represent higher-order regulatory factors involved in responses to environmental cues. A particular enrichment of co-regulated genes with metabolic functions was observed for four subtilases possibly representing late responsive elements of environmental stress. The kexin homologs show stronger associations with genes of transcriptional regulation context. Based on the analyses presented here and in accordance with previously characterized subtilases, we propose three main functions of subtilases: involvement in (i) control of development, (ii) protein turnover, and (iii) action as downstream components of signalling cascades. Supplemental material is available in the Plant Subtilase Database (PSDB) (http://csbdb.mpimp-golm.mpg.de/psdb.html) , as well as from the CSB.DB (http://csbdb.mpimp-golm.mpg.de). The first complete plant genome sequence was available for Arabidopsis thaliana, a common weed. The number of genes in the Arabidopsis genome is estimated to be around 25,000. The functions of most of these gene are, however, still unknown. Many genes are grouped into gene families due to conserved sequences and predicted protein structures. In this article, the large subtilisin-like serine protease (subtilase) family of Arabidopsis is analysed. Although 56 subtilase genes have been identified in Arabidopsis, the function of only two subtilases is known. Analysis of mutants has revealed no further hints about the function of the other 54 subtilases. Here the authors present a novel approach to infer hypotheses about functions of the subtilase genes using computational analysis. Based on the analyses presented here and in accordance with previously characterized subtilases, they propose three main functions of subtilases: involvement in (i) control of development, (ii) protein degradation, and (iii) signalling. The results presented can be used to direct further analysis to elucidate functions of subtilases in plants.Keywords
This publication has 65 references indexed in Scilit:
- A gene expression map of Arabidopsis thaliana developmentNature Genetics, 2005
- NASCArrays: a repository for microarray data generated by NASC's transcriptomics serviceNucleic Acids Research, 2004
- Genome-Wide Insertional Mutagenesis of Arabidopsis thalianaScience, 2003
- The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and communityNucleic Acids Research, 2003
- Receptor-like kinases from Arabidopsis form a monophyletic gene family related to animal receptor kinasesProceedings of the National Academy of Sciences, 2001
- Predicting Subcellular Localization of Proteins Based on their N-terminal Amino Acid SequenceJournal of Molecular Biology, 2000
- Taraxalisin – a serine proteinase from dandelion Taraxacum officinale Webb s.lFEBS Letters, 1998
- Mammalian subtilisins: The long-sought dibasic processing endoproteasesCell, 1991
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Metric and Euclidean properties of dissimilarity coefficientsJournal of Classification, 1986