Aspiring to Unintended Consequences of Natural Language Processing: A Review of Recent Developments in Clinical and Consumer-Generated Text Processing

Yearb Med Inform. 2016 Nov 10:(1):224-233. doi: 10.15265/IY-2016-017.

Abstract

Objectives: This paper reviews work over the past two years in Natural Language Processing (NLP) applied to clinical and consumer-generated texts.

Methods: We included any application or methodological publication that leverages text to facilitate healthcare and address the health-related needs of consumers and populations.

Results: Many important developments in clinical text processing, both foundational and task-oriented, were addressed in community- wide evaluations and discussed in corresponding special issues that are referenced in this review. These focused issues and in-depth reviews of several other active research areas, such as pharmacovigilance and summarization, allowed us to discuss in greater depth disease modeling and predictive analytics using clinical texts, and text analysis in social media for healthcare quality assessment, trends towards online interventions based on rapid analysis of health-related posts, and consumer health question answering, among other issues.

Conclusions: Our analysis shows that although clinical NLP continues to advance towards practical applications and more NLP methods are used in large-scale live health information applications, more needs to be done to make NLP use in clinical applications a routine widespread reality. Progress in clinical NLP is mirrored by developments in social media text analysis: the research is moving from capturing trends to addressing individual health-related posts, thus showing potential to become a tool for precision medicine and a valuable addition to the standard healthcare quality evaluation tools.

Keywords: Natural Language Processing; computing methodologies; medical informatics applications; review; social media.

Publication types

  • Review

MeSH terms

  • Consumer Health Information
  • Electronic Health Records
  • Humans
  • Medical Informatics Applications*
  • Natural Language Processing*
  • Pharmacovigilance
  • Phenotype