Predicted role for the archease protein family based on structural and sequence analysis of TM1083 and MTH1598, two proteins structurally characterized through structural genomics efforts
- 28 April 2004
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 56 (1) , 19-27
- https://doi.org/10.1002/prot.20141
Abstract
Recently, the structures of two proteins belonging to the archease family, TM1083 from Thermotoga maritima and MTH1598 from Methanobacterium thermoautotrophicum, have been solved independently by two Protein Structure Initiative structural genomics pilot centers using X‐ray crystallography and NMR, respectively. The archease protein family is a good example of one of the paradoxes of structural genomics: Approximately one third of protein structures produced by structural genomics centers have no known function and are still annotated as “hypothetical proteins” in the Protein Data Bank. In the case of archeases, despite the existence of two protein structures and abundant sequence information, there is still no function assigned to this protein family. Here, our group predicts, based on structural similarity, sequence conservation, and gene context analyses, that members of this protein family might function as chaperones or modulators of proteins involved in DNA/RNA processing. The conservation of genomic context for this protein family is constant from Archaea and Bacteria to humans, and suggests that unannotated open reading frames contiguous to them could be novel RNA/DNA binding proteins. Proteins 2004.Keywords
This publication has 31 references indexed in Scilit:
- SCOP: A structural classification of proteins database for the investigation of sequences and structuresPublished by Elsevier ,2006
- The ERGOTM genome analysis and discovery systemNucleic Acids Research, 2003
- Structural genomics of the Thermotoga maritima proteome implemented in a high-throughput structure determination pipelineProceedings of the National Academy of Sciences, 2002
- An NMR approach to structural proteomicsProceedings of the National Academy of Sciences, 2002
- The Pfam Protein Families DatabaseNucleic Acids Research, 2002
- Protein structure alignment by incremental combinatorial extension (CE) of the optimal pathProtein Engineering, Design and Selection, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Analysis of topological and nontopological structural similarities in the PDB: New examples with old structuresProteins-Structure Function and Bioinformatics, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Protein Structure Comparison by Alignment of Distance MatricesJournal of Molecular Biology, 1993