Delaunay Tessellation of Proteins: Four Body Nearest-Neighbor Propensities of Amino Acid Residues
- 1 January 1996
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 3 (2) , 213-221
- https://doi.org/10.1089/cmb.1996.3.213
Abstract
Delaunay tessellation is applied for the first time in the analysis of protein structure. By representing amino acid residues in protein chains by Cα atoms, the protein is described as a set of points in three-dimensional space. Delaunay tessellation of a protein structure generates an aggregate of space-filling irregular tetrahedra, or Delaunay simplices. The vertices of each simplex define objectively four nearest neighbor Cα atoms, i.e., four nearest-neighbor residues. A simplex classification scheme is introduced in which simplices are divided into five classes based on the relative positions of vertex residues in protein primary sequence. Statistical analysis of the residue composition of Delaunay simplices reveals nonrandom preferences for certain quadruplets of amino acids to be clustered together. This nonrandom preference may be used to develop a four-body potential that can be used in evaluating sequence–structure compatibililty for the purpose of inverted structure prediction.Keywords
This publication has 24 references indexed in Scilit:
- Statistics of sequence-structure threadingCurrent Opinion in Structural Biology, 1995
- Protein Structure Prediction: Recognition of Primary, Secondary, and Tertiary Structural Features from Amino Acid SequenceCritical Reviews in Biochemistry and Molecular Biology, 1995
- An empirical energy function for threading protein sequence through the folding motifProteins-Structure Function and Bioinformatics, 1993
- Topology fingerprint approach to the inverse protein folding problemJournal of Molecular Biology, 1992
- One thousand families for the molecular biologistNature, 1992
- Voronoi diagrams—a survey of a fundamental geometric data structureACM Computing Surveys, 1991
- A Method to Identify Protein Sequences That Fold into a Known Three-Dimensional StructureScience, 1991
- The protein-folding problem: the native fold determines packing, but does packing determine the native fold?Proceedings of the National Academy of Sciences, 1991
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- Random packings and the structure of simple liquids. I. The geometry of random close packingProceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences, 1970