Toward Prediction of Class II Mouse Major Histocompatibility Complex Peptide Binding Affinity: in Silico Bioinformatic Evaluation Using Partial Least Squares, a Robust Multivariate Statistical Technique
- 18 December 2005
- journal article
- Published by American Chemical Society (ACS) in Journal of Chemical Information and Modeling
- Vol. 46 (3) , 1491-1502
- https://doi.org/10.1021/ci050380d
Abstract
The accurate identification of T-cell epitopes remains a principal goal of bioinformatics within immunology. As the immunogenicity of peptide epitopes is dependent on their binding to major histocompatibility complex (MHC) molecules, the prediction of binding affinity is a prerequisite to the reliable prediction of epitopes. The iterative self-consistent (ISC) partial-least-squares (PLS)-based additive method is a recently developed bioinformatic approach for predicting class II peptide-MHC binding affinity. The ISC-PLS method overcomes many of the conceptual difficulties inherent in the prediction of class II peptide-MHC affinity, such as the binding of a mixed population of peptide lengths due to the open-ended class II binding site. The method has applications in both the accurate prediction of class II epitopes and the manipulation of affinity for heteroclitic and competitor peptides. The method is applied here to six class II mouse alleles (I-Ab, I-Ad, I-Ak, I-As, I-Ed, and I-Ek) and included peptides up to 25 amino acids in length. A series of regression equations highlighting the quantitative contributions of individual amino acids at each peptide position was established. The initial model for each allele exhibited only moderate predictivity. Once the set of selected peptide subsequences had converged, the final models exhibited a satisfactory predictive power. Convergence was reached between the 4th and 17th iterations, and the leave-one-out cross-validation statistical terms--q2, SEP, and NC--ranged between 0.732 and 0.925, 0.418 and 0.816, and 1 and 6, respectively. The non-cross-validated statistical terms r2 and SEE ranged between 0.98 and 0.995 and 0.089 and 0.180, respectively. The peptides used in this study are available from the AntiJen database (http://www.jenner.ac.uk/AntiJen). The PLS method is available commercially in the SYBYL molecular modeling software package. The resulting models, which can be used for accurate T-cell epitope prediction, will be made freely available online (http://www.jenner.ac.uk/MHCPred).Keywords
This publication has 64 references indexed in Scilit:
- Identification of an I-Ed-Restricted T-Cell Epitope ofEscherichia coliOuter Membrane Protein FInfection and Immunity, 2004
- The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003Nucleic Acids Research, 2003
- Recognition of core and flanking amino acids of MHC class II-bound peptides by the T cell receptorEuropean Journal of Immunology, 2002
- Structure of a Complex of the Human α/β T Cell Receptor (TCR) HA1.7, Influenza Hemagglutinin Peptide, and Major Histocompatibility Complex Class II Molecule, HLA-DR4 (DRA0101 and DRB10401)The Journal of Experimental Medicine, 2002
- Hb(64–76) epitope binds in different registers and lengths to I-Ek and I-AkMolecular Immunology, 2000
- T-Cell Epitope Mapping the PorB Protein of Serogroup BNeisseria meningitidisin B10 Congenic Strains of MiceClinical Immunology and Immunopathology, 1997
- Human class II MHC molecule HLA-DR1: X-ray structure determined from three crystal formsActa Crystallographica Section D-Biological Crystallography, 1995
- Development of high potency universal DR-restricted helper epitopes by modification of high affinity DR-blocking peptidesImmunity, 1994
- Prominent role of secondary anchor residues in peptide binding to HLA-A2.1 moleculesCell, 1993
- Predominant naturally processed peptides bound to HLA-DR1 are derived from MHC-related molecules and are heterogeneous in sizeNature, 1992