Knowledge-based selection of targets for structural genomics
Open Access
- 1 March 2002
- journal article
- research article
- Published by Oxford University Press (OUP) in Protein Engineering, Design and Selection
- Vol. 15 (3) , 169-183
- https://doi.org/10.1093/protein/15.3.169
Abstract
The problem of rational target selection for protein structure determination in structural genomics projects on microbes is addressed. A flexible computational procedure is described that directly incorporates the whole body of annotation available in the PEDANT genome database into the sequence clustering and selection process in order to identify proteins that are likely to possess currently unknown structural domains. Filtering out gene products based on predicted structural features, such as known three-dimensional structures and transmembrane regions, allows one to reduce the complexity of neighbor relationships between sequences and all but eliminates the need for further partitioning of single-linkage clusters into disjoint protein groups corresponding to homologous families. The results of a large-scale computation experiment in which exemplary target selection for 32 prokaryotic genomes was conducted are presented.Keywords
This publication has 43 references indexed in Scilit:
- Structural proteomics of an archaeon.Nature Structural & Molecular Biology, 2000
- GeneRAGE: a robust algorithm for sequence clustering and domain detectionBioinformatics, 2000
- The ASTRAL compendium for protein structure and sequence analysisNucleic Acids Research, 2000
- The Pfam Protein Families DatabaseNucleic Acids Research, 2000
- The Protein Data BankNucleic Acids Research, 2000
- Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequenceNature, 1998
- Profile hidden Markov models.Bioinformatics, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Cytoplasmic signalling domains: the next generationTrends in Biochemical Sciences, 1997
- The Solution Structure of the S1 RNA Binding Domain: A Member of an Ancient Nucleic Acid–Binding FoldCell, 1997