Transformer-based biomarker prediction from colorectal cancer histology: A large-scale multicentric study

Sophia J Wagner; Daniel Reisenbüchler; Nicholas P West; Jan Moritz Niehues; Jiefu Zhu; Sebastian Foersch; Gregory Patrick Veldhuizen; Philip Quirke; Heike I Grabsch; Piet A van den Brandt; Gordon G A Hutchins; Susan D Richman; Tanwei Yuan; Rupert Langer; Josien C A Jenniskens; Kelly Offermans; Wolfram Mueller; Richard Gray; Stephen B Gruber; Joel K Greenson; Gad Rennert; Joseph D Bonner; Daniel Schmolze; Jitendra Jonnagaddala; Nicholas J Hawkins; Robyn L Ward; Dion Morton; Matthew Seymour; Laura Magill; Marta Nowak; Jennifer Hay; Viktor H Koelzer; David N Church; TransSCOT consortium; Christian Matek; Carol Geppert; Chaolong Peng; Cheng Zhi; Xiaoming Ouyang; Jacqueline A James; Maurice B Loughrey; Manuel Salto-Tellez; Hermann Brenner; Michael Hoffmeister; Daniel Truhn; Julia A Schnabel; Melanie Boxberg; Tingying Peng; Jakob Nikolas Kather

doi:10.1016/j.ccell.2023.08.002

Transformer-based biomarker prediction from colorectal cancer histology: A large-scale multicentric study

Cancer Cell. 2023 Sep 11;41(9):1650-1661.e4. doi: 10.1016/j.ccell.2023.08.002. Epub 2023 Aug 30.

Authors

Sophia J Wagner¹, Daniel Reisenbüchler², Nicholas P West³, Jan Moritz Niehues⁴, Jiefu Zhu⁴, Sebastian Foersch³, Gregory Patrick Veldhuizen⁴, Philip Quirke⁵, Heike I Grabsch⁶, Piet A van den Brandt⁷, Gordon G A Hutchins⁵, Susan D Richman⁵, Tanwei Yuan⁸, Rupert Langer⁹, Josien C A Jenniskens⁷, Kelly Offermans⁷, Wolfram Mueller¹⁰, Richard Gray¹¹, Stephen B Gruber¹², Joel K Greenson¹³, Gad Rennert¹⁴, Joseph D Bonner¹⁵, Daniel Schmolze¹², Jitendra Jonnagaddala¹⁶, Nicholas J Hawkins¹⁷, Robyn L Ward¹⁸, Dion Morton¹⁹, Matthew Seymour²⁰, Laura Magill²¹, Marta Nowak²², Jennifer Hay²³, Viktor H Koelzer²⁴, David N Church²⁵; TransSCOT consortium; Christian Matek²⁶, Carol Geppert²⁷, Chaolong Peng²⁸, Cheng Zhi²⁹, Xiaoming Ouyang²⁹, Jacqueline A James³⁰, Maurice B Loughrey³¹, Manuel Salto-Tellez³², Hermann Brenner³³, Michael Hoffmeister⁸, Daniel Truhn³⁴, Julia A Schnabel³⁵, Melanie Boxberg³⁶, Tingying Peng³⁷, Jakob Nikolas Kather³⁸

Collaborators

TransSCOT consortium:
David Church, Enric Domingo, Joanne Edwards, Bengt Glimelius, Ismail Gogenur, Andrea Harkin, Jen Hay, Timothy Iveson, Emma Jaeger, Caroline Kelly, Rachel Kerr, Noori Maka, Hannah Morgan, Karin Oien, Clare Orange, Claire Palles, Campbell Roxburgh, Owen Sansom, Mark Saunders, Ian Tomlinson

Affiliations

¹ Helmholtz Munich - German Research Center for Environment and Health, Munich, Germany; School of Computation, Information and Technology, Technical University of Munich, Munich, Germany; Else Kroener Fresenius Center for Digital Health (EFFZ), Technical University Dresden, Dresden, Germany.
² Helmholtz Munich - German Research Center for Environment and Health, Munich, Germany.
³ Institute of Pathology, University Medical Center Mainz, Mainz, Germany.
⁴ Else Kroener Fresenius Center for Digital Health (EFFZ), Technical University Dresden, Dresden, Germany.
⁵ Division of Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK.
⁶ Division of Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK; Department of Pathology, GROW School for Oncology and Developmental Biology, Maastricht University Medical Center+, Maastricht, the Netherlands.
⁷ Department of Epidemiology, Maastricht University Medical Center+, Maastricht, the Netherlands.
⁸ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany.
⁹ Institute of Pathology und Molecular Pathology, Johannes Kepler University Hospital Linz, Linz, Österreich.
¹⁰ Gemeinschaftspraxis Pathologie, Starnberg, Germany.
¹¹ Nuffield Department of Population Health, University of Oxford, Oxford, UK.
¹² Center for Precision Medicine and Department of Medical Oncology, City of Hope National Medical Center, Duarte, CA, USA.
¹³ Department of Pathology, City of Hope Comprehensive Cancer Center, Duarte, CA, USA.
¹⁴ Department of Community Medicine & Epidemiology, Lady Davis Carmel Medical Center, Ruth & Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel; Steve and Cindy Rasmussen Institute for Genomic Medicine, Lady Davis Carmel Medical Center and Technion Faculty of Medicine, Clalit National Cancer Control Center, Haifa, Israel.
¹⁵ Department of Community Medicine & Epidemiology, Lady Davis Carmel Medical Center, Ruth & Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel.
¹⁶ School of Population Health, Faculty of Medicine and Health, UNSW Sydney, Sydney, NSW, Australia.
¹⁷ School of Medical Sciences, Faculty of Medicine and Health, UNSW Sydney, Sydney, NSW, Australia.
¹⁸ School of Medical Sciences, Faculty of Medicine and Health, UNSW Sydney, Sydney, NSW, Australia; Faculty of Medicine and Health, The University of Sydney, Sydney, NSW, Australia.
¹⁹ University Hospital Birmingham, Birmingham, UK.
²⁰ St James's University Hospital, Leeds, UK.
²¹ University of Birmingham Clinical Trials Unit, Birmingham, UK.
²² Department of Pathology and Molecular Pathology, University Hospital Zurich, University of Zurich, Zurich, Switzerland.
²³ Glasgow Tissue Research Facility, University of Glasgow, Queen Elizabeth University Hospital, Glasgow, UK.
²⁴ Department of Pathology and Molecular Pathology, University Hospital Zurich, University of Zurich, Zurich, Switzerland; Department of Oncology, University of Oxford, Oxford, UK; Nuffield Department of Medicine, University of Oxford, Roosevelt Drive, Oxford, UK.
²⁵ Nuffield Department of Medicine, University of Oxford, Roosevelt Drive, Oxford, UK; Oxford NIHR Comprehensive Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.
²⁶ Helmholtz Munich - German Research Center for Environment and Health, Munich, Germany; Institute of Pathology, University Hospital Erlangen, FAU Erlangen-Nuremberg, Erlangen, Germany; Comprehensive Cancer Center Erlangen-EMN (CCC), University Hospital Erlangen, FAU Erlangen-Nuremberg, Erlangen, Germany.
²⁷ Institute of Pathology, University Hospital Erlangen, FAU Erlangen-Nuremberg, Erlangen, Germany; Comprehensive Cancer Center Erlangen-EMN (CCC), University Hospital Erlangen, FAU Erlangen-Nuremberg, Erlangen, Germany.
²⁸ Medical School, Jianggang Shan University, Jiangxi, China.
²⁹ Department of Pathology, the Second Affiliated Hospital of Guangzhou Medical University, Guangzhou, China.
³⁰ Precision Medicine Centre of Excellence, Health Sciences Building, The Patrick G Johnston Centre for Cancer Research, Queen's University Belfast, Belfast, UK; Regional Molecular Diagnostic Service, Belfast Health and Social Care Trust, Belfast, UK; The Patrick G Johnston Centre for Cancer Research, Queen's University Belfast, Belfast, UK.
³¹ The Patrick G Johnston Centre for Cancer Research, Queen's University Belfast, Belfast, UK; Department of Cellular Pathology, Belfast Health and Social Care Trust, Belfast, UK; Centre for Public Health, Queen's University Belfast, Belfast, UK.
³² Precision Medicine Centre of Excellence, Health Sciences Building, The Patrick G Johnston Centre for Cancer Research, Queen's University Belfast, Belfast, UK; Regional Molecular Diagnostic Service, Belfast Health and Social Care Trust, Belfast, UK; Integrated Pathology Unit, Institute for Cancer Research and Royal Marsden Hospital, London, UK.
³³ Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany; Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg, Germany; German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany.
³⁴ Department of Diagnostic and Interventional Radiology, University Hospital RWTH Aachen, Aachen, Germany.
³⁵ Helmholtz Munich - German Research Center for Environment and Health, Munich, Germany; School of Computation, Information and Technology, Technical University of Munich, Munich, Germany; School of Biomedical Engineering and Imaging Sciences, King's College London, London, UK.
³⁶ Institute of Pathology, Technical University Munich, Munich, Germany; Institute of Pathology Munich-North, Munich, Germany.
³⁷ Helmholtz Munich - German Research Center for Environment and Health, Munich, Germany. Electronic address: tingying.peng@helmholtz-munich.de.
³⁸ Else Kroener Fresenius Center for Digital Health (EFFZ), Technical University Dresden, Dresden, Germany; Division of Pathology and Data Analytics, Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, UK; Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg. Electronic address: jakob_nikolas.kather@tu-dresden.de.

Abstract

Deep learning (DL) can accelerate the prediction of prognostic biomarkers from routine pathology slides in colorectal cancer (CRC). However, current approaches rely on convolutional neural networks (CNNs) and have mostly been validated on small patient cohorts. Here, we develop a new transformer-based pipeline for end-to-end biomarker prediction from pathology slides by combining a pre-trained transformer encoder with a transformer network for patch aggregation. Our transformer-based approach substantially improves the performance, generalizability, data efficiency, and interpretability as compared with current state-of-the-art algorithms. After training and evaluating on a large multicenter cohort of over 13,000 patients from 16 colorectal cancer cohorts, we achieve a sensitivity of 0.99 with a negative predictive value of over 0.99 for prediction of microsatellite instability (MSI) on surgical resection specimens. We demonstrate that resection specimen-only training reaches clinical-grade performance on endoscopic biopsy tissue, solving a long-standing diagnostic problem.

Keywords: artificial intelligence; biomarker; colorectal cancer; deep learning; microsatellite instability; multiple instance learning; transformer.

Publication types

Multicenter Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Biomarkers
Biopsy
Colorectal Neoplasms* / genetics
Humans
Microsatellite Instability

Substances

Biomarkers

Abstract

Publication types

MeSH terms

Substances

Grants and funding