PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence
Top Cited Papers
Open Access
- 1 July 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (Web Server) , W32-W37
- https://doi.org/10.1093/nar/gkl305
Abstract
Sequence-derived structural and physicochemical features have frequently been used in the development of statistical learning models for predicting proteins and peptides of different structural, functional and interaction profiles. PROFEAT ( Pro tein Feat ures) is a web server for computing commonly-used structural and physicochemical features of proteins and peptides from amino acid sequence. It computes six feature groups composed of ten features that include 51 descriptors and 1447 descriptor values. The computed features include amino acid composition, dipeptide composition, normalized Moreau–Broto autocorrelation, Moran autocorrelation, Geary autocorrelation, sequence-order-coupling number, quasi-sequence-order descriptors and the composition, transition and distribution of various structural and physicochemical properties. In addition, it can also compute previous autocorrelations descriptors based on user-defined properties. Our computational algorithms were extensively tested and the computed protein features have been used in a number of published works for predicting proteins of functional classes, protein–protein interactions and MHC-binding peptides. PROFEAT is accessible at http://jing.cz3.nus.edu.sg/cgi-bin/prof/prof.cgiKeywords
This publication has 43 references indexed in Scilit:
- Classification of Nuclear Receptors Based on Amino Acid Composition and Dipeptide CompositionJournal of Biological Chemistry, 2004
- Classifying G-protein coupled receptors with support vector machinesBioinformatics, 2002
- Support vector machine approach for protein subcellular localization predictionBioinformatics, 2001
- Predicting protein–protein interactions from primary structureBioinformatics, 2001
- Multi-class protein fold recognition using support vector machines and neural networksBioinformatics, 2001
- Accurate Prediction of Protein Secondary Structural ContentProtein Journal, 2001
- Prediction of Protein Subcellular Locations by Incorporating Quasi-Sequence-Order EffectBiochemical and Biophysical Research Communications, 2000
- AAindex: Amino Acid index databaseNucleic Acids Research, 2000
- Themes in RNA-protein recognitionJournal of Molecular Biology, 1999
- Molecular modeling study of diltiazem mimics at L-type calcium channels.Pharmaceutical Research, 1999