High Accuracy Open-Source Clinical Data De-Identification: The CliniDeID Solution

Stud Health Technol Inform. 2024 Jan 25:310:1370-1371. doi: 10.3233/SHTI231199.

Abstract

Clinical data de-identification offers patient data privacy protection and eases reuse of clinical data. As an open-source solution to de-identify unstructured clinical text with high accuracy, CliniDeID applies an ensemble method combining deep and shallow machine learning with rule-based algorithms. It reached high recall and precision when recently evaluated with a selection of clinical text corpora.

Keywords: AI; De-identification; natural language processing; privacy protection.

MeSH terms

  • Algorithms*
  • Humans
  • Machine Learning*