Accurate prediction of HLA class II antigen presentation across all loci using tailored data acquisition and refined machine learning

Sci Adv. 2023 Nov 24;9(47):eadj6367. doi: 10.1126/sciadv.adj6367. Epub 2023 Nov 24.

Abstract

Accurate prediction of antigen presentation by human leukocyte antigen (HLA) class II molecules is crucial for rational development of immunotherapies and vaccines targeting CD4+ T cell activation. So far, most prediction methods for HLA class II antigen presentation have focused on HLA-DR because of limited availability of immunopeptidomics data for HLA-DQ and HLA-DP while not taking into account alternative peptide binding modes. We present an update to the NetMHCIIpan prediction method, which closes the performance gap between all three HLA class II loci. We accomplish this by first integrating large immunopeptidomics datasets describing the HLA class II specificity space across all loci using a refined machine learning framework that accommodates inverted peptide binders. Next, we apply targeted immunopeptidomics assays to generate data that covers additional HLA-DP specificities. The final method, NetMHCIIpan-4.3, achieves high accuracy and molecular coverage across all HLA class II allotypes.

MeSH terms

  • Antigen Presentation*
  • HLA-DP Antigens / chemistry
  • HLA-DQ Antigens / chemistry
  • HLA-DR Antigens* / metabolism
  • Humans
  • Peptides / chemistry

Substances

  • HLA-DR Antigens
  • HLA-DP Antigens
  • HLA-DQ Antigens
  • Peptides