Machine learning-based lifetime breast cancer risk reclassification compared with the BOADICEA model: impact on screening recommendations

Br J Cancer. 2020 Sep;123(5):860-867. doi: 10.1038/s41416-020-0937-0. Epub 2020 Jun 22.

Abstract

Background: The clinical utility of machine-learning (ML) algorithms for breast cancer risk prediction and screening practices is unknown. We compared classification of lifetime breast cancer risk based on ML and the BOADICEA model. We explored the differences in risk classification and their clinical impact on screening practices.

Methods: We used three different ML algorithms and the BOADICEA model to estimate lifetime breast cancer risk in a sample of 112,587 individuals from 2481 families from the Oncogenetic Unit, Geneva University Hospitals. Performance of algorithms was evaluated using the area under the receiver operating characteristic (AU-ROC) curve. Risk reclassification was compared for 36,146 breast cancer-free women of ages 20-80. The impact on recommendations for mammography surveillance was based on the Swiss Surveillance Protocol.

Results: The predictive accuracy of ML-based algorithms (0.843 ≤ AU-ROC ≤ 0.889) was superior to BOADICEA (AU-ROC = 0.639) and reclassified 35.3% of women in different risk categories. The largest reclassification (20.8%) was observed in women characterised as 'near population' risk by BOADICEA. Reclassification had the largest impact on screening practices of women younger than 50.

Conclusion: ML-based reclassification of lifetime breast cancer risk occurred in approximately one in three women. Reclassification is important for younger women because it impacts clinical decision- making for the initiation of screening.

MeSH terms

  • Adult
  • Aged
  • Breast Neoplasms / classification
  • Breast Neoplasms / diagnosis
  • Breast Neoplasms / epidemiology*
  • Early Detection of Cancer
  • Female
  • Humans
  • Machine Learning*
  • Middle Aged
  • Retrospective Studies
  • Risk
  • Young Adult