Derivation of an amino acid similarity matrix for peptide:MHC binding and its application as a Bayesian prior
Open Access
- 30 November 2009
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 10 (1) , 1-11
- https://doi.org/10.1186/1471-2105-10-394
Abstract
Background: Experts in peptide:MHC binding studies are often able to estimate the impact of a single residue substitution based on a heuristic understanding of amino acid similarity in an experimental context. Our aim is to quantify this measure of similarity to improve peptide:MHC binding prediction methods. This should help compensate for holes and bias in the sequence space coverage of existing peptide binding datasets. Results: Here, a novel amino acid similarity matrix (PMBEC) is directly derived from the binding affinity data of combinatorial peptide mixtures. Like BLOSUM62, this matrix captures well-known physicochemical properties of amino acid residues. However, PMBEC differs markedly from existing matrices in cases where residue substitution involves a reversal of electrostatic charge. To demonstrate its usefulness, we have developed a new peptide:MHC class I binding prediction method, using the matrix as a Bayesian prior. We show that the new method can compensate for missing information on specific residues in the training data. We also carried out a large-scale benchmark, and its results indicate that prediction performance of the new method is comparable to that of the best neural network based approaches for peptide:MHC class I binding. Conclusion: A novel amino acid similarity matrix has been derived for peptide:MHC binding interactions. One prominent feature of the matrix is that it disfavors substitution of residues with opposite charges. Given that the matrix was derived from experimentally determined peptide:MHC binding affinity measurements, this feature is likely shared by all peptide:protein interactions. In addition, we have demonstrated the usefulness of the matrix as a Bayesian prior in an improved scoring-matrix based peptide:MHC class I prediction method. A software implementation of the method is available at: http://www.mhc-pathway.net/smmpmbec.Keywords
This publication has 27 references indexed in Scilit:
- A Detailed Analysis of the Murine TAP Transporter Substrate SpecificityPLOS ONE, 2008
- NetMHC-3.0: accurate web accessible predictions of human, mouse and monkey MHC class I affinities for peptides of length 8–11Nucleic Acids Research, 2008
- Evaluation of MHC class I peptide binding prediction servers: Applications for vaccine researchBMC Immunology, 2008
- Quantitative peptide binding motifs for 19 human and mouse MHC class I molecules derived using positional scanning combinatorial peptide librariesImmunome Research, 2008
- The validity of predicted T-cell epitopesTrends in Biotechnology, 2006
- A Community Resource Benchmarking Predictions of Peptide Binding to MHC-I MoleculesPLoS Computational Biology, 2006
- Reliable prediction of T‐cell epitopes using neural networks with novel sequence representationsProtein Science, 2003
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- An Assessment of Amino Acid Exchange Matrices in Aligning Protein Sequences: The Twilight Zone RevisitedJournal of Molecular Biology, 1995
- A Structural Basis for Sequence ComparisonsJournal of Molecular Biology, 1993