Cancer mutational signatures identification in clinical assays using neural embedding-based representations

Cell Rep Med. 2024 Jun 18;5(6):101608. doi: 10.1016/j.xcrm.2024.101608. Epub 2024 Jun 11.

Abstract

While mutational signatures provide a plethora of prognostic and therapeutic insights, their application in clinical-setting, targeted gene panels is extremely limited. We develop a mutational representation model (which learns and embeds specific mutation signature connections) that enables prediction of dominant signatures with only a few mutations. We predict the dominant signatures across more than 60,000 tumors with gene panels, delineating their landscape across different cancers. Dominant signature predictions in gene panels are of clinical importance. These included UV, tobacco, and apolipoprotein B mRNA editing enzyme, catalytic polypeptide (APOBEC) signatures that are associated with better survival, independently from mutational burden. Further analyses reveal gene and mutation associations with signatures, such as SBS5 with TP53 and APOBEC with FGFR3S249C. In a clinical use case, APOBEC signature is a robust and specific predictor for resistance to epidermal growth factor receptor-tyrosine kinase inhibitors (EGFR-TKIs). Our model provides an easy-to-use way to detect signatures in clinical setting assays with many possible clinical implications for an unprecedented number of cancer patients.

Keywords: cancer genomics; gene panels; machine learning; mutational signatures; precision oncology.

MeSH terms

  • ErbB Receptors / genetics
  • Humans
  • Mutation* / genetics
  • Neoplasms* / genetics
  • Neural Networks, Computer
  • Protein Kinase Inhibitors / pharmacology
  • Receptor, Fibroblast Growth Factor, Type 3 / genetics
  • Tumor Suppressor Protein p53 / genetics

Substances

  • ErbB Receptors
  • Protein Kinase Inhibitors
  • Tumor Suppressor Protein p53
  • Receptor, Fibroblast Growth Factor, Type 3