Generalizable biomarker prediction from cancer pathology slides with self-supervised deep learning: A retrospective multi-centric study

Jan Moritz Niehues; Philip Quirke; Nicholas P West; Heike I Grabsch; Marko van Treeck; Yoni Schirris; Gregory P Veldhuizen; Gordon G A Hutchins; Susan D Richman; Sebastian Foersch; Titus J Brinker; Junya Fukuoka; Andrey Bychkov; Wataru Uegami; Daniel Truhn; Hermann Brenner; Alexander Brobeil; Michael Hoffmeister; Jakob Nikolas Kather

doi:10.1016/j.xcrm.2023.100980

Generalizable biomarker prediction from cancer pathology slides with self-supervised deep learning: A retrospective multi-centric study

Cell Rep Med. 2023 Apr 18;4(4):100980. doi: 10.1016/j.xcrm.2023.100980. Epub 2023 Mar 22.

Authors

Jan Moritz Niehues¹, Philip Quirke², Nicholas P West², Heike I Grabsch³, Marko van Treeck¹, Yoni Schirris⁴, Gregory P Veldhuizen¹, Gordon G A Hutchins², Susan D Richman², Sebastian Foersch⁵, Titus J Brinker⁶, Junya Fukuoka⁷, Andrey Bychkov⁸, Wataru Uegami⁸, Daniel Truhn⁹, Hermann Brenner¹⁰, Alexander Brobeil¹¹, Michael Hoffmeister¹², Jakob Nikolas Kather¹³

Affiliations

¹ Else Kroener Fresenius Center for Digital Health, Technical University Dresden, 01307 Dresden, Germany; Department of Medicine III, University Hospital RWTH Aachen, 52074 Aachen, Germany.
² Pathology & Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds LS9 7TF, UK.
³ Pathology & Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds LS9 7TF, UK; Department of Pathology, GROW School for Oncology and Reproduction, Maastricht University Medical Center+, 6229 HX Maastricht, the Netherlands.
⁴ Else Kroener Fresenius Center for Digital Health, Technical University Dresden, 01307 Dresden, Germany; Netherlands Cancer Institute, 1066 CX Amsterdam, the Netherlands; University of Amsterdam, 1012 WP Amsterdam, the Netherlands.
⁵ Institute of Pathology, University Medical Center Mainz, 55131 Mainz, Germany.
⁶ Digital Biomarkers for Oncology Group, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany.
⁷ Department of Pathology Informatics, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki 852-8523, Japan; Department of Pathology, Kameda Medical Center, Kamogawa 296-8602, Chiba, Japan.
⁸ Department of Pathology, Kameda Medical Center, Kamogawa 296-8602, Chiba, Japan.
⁹ Department of Diagnostic and Interventional Radiology, University Hospital RWTH Aachen, 52074 Aachen, Germany.
¹⁰ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany; Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), 69120 Heidelberg, Germany; German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany.
¹¹ Institute of Pathology, University Hospital Heidelberg, 69120 Heidelberg, Germany; Tissue Bank, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, 69120 Heidelberg, Germany.
¹² Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), 69120 Heidelberg, Germany.
¹³ Else Kroener Fresenius Center for Digital Health, Technical University Dresden, 01307 Dresden, Germany; Department of Medicine III, University Hospital RWTH Aachen, 52074 Aachen, Germany; Pathology & Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds LS9 7TF, UK; Department of Medicine I, University Hospital Dresden, 01307 Dresden, Germany; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, 69120 Heidelberg, Germany. Electronic address: jakob-nikolas.kather@alumni.dkfz.de.

Abstract

Deep learning (DL) can predict microsatellite instability (MSI) from routine histopathology slides of colorectal cancer (CRC). However, it is unclear whether DL can also predict other biomarkers with high performance and whether DL predictions generalize to external patient populations. Here, we acquire CRC tissue samples from two large multi-centric studies. We systematically compare six different state-of-the-art DL architectures to predict biomarkers from pathology slides, including MSI and mutations in BRAF, KRAS, NRAS, and PIK3CA. Using a large external validation cohort to provide a realistic evaluation setting, we show that models using self-supervised, attention-based multiple-instance learning consistently outperform previous approaches while offering explainable visualizations of the indicative regions and morphologies. While the prediction of MSI and BRAF mutations reaches a clinical-grade performance, mutation prediction of PIK3CA, KRAS, and NRAS was clinically insufficient.

Keywords: artificial intelligence; attention heatmaps; attention-based multiple-instance learning; biomarker; colorectal cancer; computational pathology; multi-input models; oncogenic mutation; self-supervised learning.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Biomarkers
Class I Phosphatidylinositol 3-Kinases / genetics
Colorectal Neoplasms* / genetics
Colorectal Neoplasms* / pathology
Deep Learning*
Humans
Microsatellite Instability
Proto-Oncogene Proteins B-raf / genetics
Proto-Oncogene Proteins p21(ras) / genetics
Retrospective Studies

Substances

Proto-Oncogene Proteins B-raf
Proto-Oncogene Proteins p21(ras)
Biomarkers
Class I Phosphatidylinositol 3-Kinases