Consore: A Powerful Federated Data Mining Tool Driving a French Research Network to Accelerate Cancer Research

Int J Environ Res Public Health. 2024 Feb 7;21(2):189. doi: 10.3390/ijerph21020189.

Abstract

Background: Real-world data (RWD) related to the health status and care of cancer patients reflect the ongoing medical practice, and their analysis yields essential real-world evidence. Advanced information technologies are vital for their collection, qualification, and reuse in research projects.

Methods: UNICANCER, the French federation of comprehensive cancer centres, has innovated a unique research network: Consore. This potent federated tool enables the analysis of data from millions of cancer patients across eleven French hospitals.

Results: Currently operational within eleven French cancer centres, Consore employs natural language processing to structure the therapeutic management data of approximately 1.3 million cancer patients. These data originate from their electronic medical records, encompassing about 65 million medical records. Thanks to the structured data, which are harmonized within a common data model, and its federated search tool, Consore can create patient cohorts based on patient or tumor characteristics, and treatment modalities. This ability to derive larger cohorts is particularly attractive when studying rare cancers.

Conclusions: Consore serves as a tremendous data mining instrument that propels French cancer centres into the big data era. With its federated technical architecture and unique shared data model, Consore facilitates compliance with regulations and acceleration of cancer research projects.

Keywords: big data; cancer; cancer research; data mining; data warehouse; natural language processing.

MeSH terms

  • Biomedical Research*
  • Data Mining
  • Electronic Health Records
  • Humans
  • Language
  • Neoplasms* / therapy