Analyzing the simplicial decomposition of spatial protein structures
Open Access
- 13 February 2008
- journal article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 9 (S1) , S11
- https://doi.org/10.1186/1471-2105-9-s1-s11
Abstract
Background The fast growing Protein Data Bank contains the three-dimensional description of more than 45000 protein- and nucleic-acid structures today. The large majority of the data in the PDB are measured by X-ray crystallography by thousands of researchers in millions of work-hours. Unfortunately, lots of structural errors, bad labels, missing atoms, falsely identified chains and groups make dificult the automated processing of this treasury of structural biological data. Results After we performed a rigorous re-structuring of the whole PDB on graph-theoretical basis, we created the RS-PDB (Rich-Structure PDB) database. Using this cleaned and repaired database, we defined simplicial complexes on the heavy-atoms of the PDB, and analyzed the tetrahedra for geometric properties. Conclusion We have found surprisingly characteristic differences between simplices with atomic vertices of different types, and between the atomic neighborhoods – described also by simplices – of different ligand atoms in proteins.Keywords
This publication has 9 references indexed in Scilit:
- Building a Structured PDB: The RS-PDB Database2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2006
- CHEMICAL 'NAMING' METHOD UNVEILEDChemical & Engineering News, 2005
- The PDBbind Database: Methodologies and UpdatesJournal of Medicinal Chemistry, 2005
- The PDBbind Database: Collection of Binding Affinities for Protein−Ligand Complexes with Known Three-Dimensional StructuresJournal of Medicinal Chemistry, 2004
- Chemists synthesize a single naming systemNature, 2002
- The Protein Data BankNucleic Acids Research, 2000
- The quickhull algorithm for convex hullsACM Transactions on Mathematical Software, 1996
- Delaunay Tessellation of Proteins: Four Body Nearest-Neighbor Propensities of Amino Acid ResiduesJournal of Computational Biology, 1996
- Multidimensional binary search trees used for associative searchingCommunications of the ACM, 1975