Curated benchmark dataset for ultrasound based breast lesion analysis

Sci Data. 2024 Jan 31;11(1):148. doi: 10.1038/s41597-024-02984-z.

Abstract

A new detailed dataset of breast ultrasound scans (BrEaST) containing images of benign and malignant lesions as well as normal tissue examples, is presented. The dataset consists of 256 breast scans collected from 256 patients. Each scan was manually annotated and labeled by a radiologist experienced in breast ultrasound examination. In particular, each tumor was identified in the image using a freehand annotation and labeled according to BIRADS features and lexicon. The histopathological classification of the tumor was also provided for patients who underwent a biopsy. The BrEaST dataset is the first breast ultrasound dataset containing patient-level labels, image-level annotations, and tumor-level labels with all cases confirmed by follow-up care or core needle biopsy result. To enable research into breast disease detection, tumor segmentation and classification, the BrEaST dataset is made publicly available with the CC-BY 4.0 license.

Publication types

  • Dataset

MeSH terms

  • Benchmarking
  • Breast Neoplasms* / diagnostic imaging
  • Breast* / diagnostic imaging
  • Breast* / pathology
  • Female
  • Humans
  • Ultrasonography, Mammary