Completeness in structural genomics
Top Cited Papers
- 1 June 2001
- journal article
- research article
- Published by Springer Nature in Nature Structural & Molecular Biology
- Vol. 8 (6) , 559-566
- https://doi.org/10.1038/88640
Abstract
Structural genomics has the goal of obtaining useful, three-dimensional models of all proteins by a combination of experimental structure determination and comparative model building. We evaluate different strategies for optimizing information return on effort. The strategy that maximizes structural coverage requires about seven times fewer structure determinations compared with the strategy in which targets are selected at random, With a choice of reasonable model quality and the goal of 90% coverage, we extrapolate the estimate of the total effort of structural genomics. It would take similar to 16,000 carefully selected structure determinations to construct useful atomic models for the vast majority of all proteins. In practice, unless there is global coordination of target selection, the total effort will likely increase by a factor of three. The task can be accomplished within a decade provided that selection of targets is highly coordinated and significant funding is available.Keywords
This publication has 9 references indexed in Scilit:
- The Sequence of the Human GenomeScience, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Towards a covering set of protein family profilesProgress in Biophysics and Molecular Biology, 2000
- Protein modelling for allTrends in Biochemical Sciences, 1999
- Patterns of protein-fold usage in eight microbial genomes: A comprehensive structural censusProteins-Structure Function and Bioinformatics, 1998
- Large-scale protein structure modeling of the Saccharomyces cerevisiae genomeProceedings of the National Academy of Sciences, 1998
- Class‐directed structure determination: Foundation for a protein structure initiativeProtein Science, 1998
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- Assessment of comparative modeling in CASP2Proteins-Structure Function and Bioinformatics, 1997