Voice-dictated versus typed-in clinician notes: linguistic properties and the potential implications on natural language processing

AMIA Annu Symp Proc. 2011:2011:1630-8. Epub 2011 Oct 22.

Abstract

In this study, we comparatively examined the linguistic properties of narrative clinician notes created through voice dictation versus those directly entered by clinicians via a computer keyboard. Intuitively, the nature of voice-dictated notes would resemble that of natural language, while typed-in notes may demonstrate distinctive language features for reasons such as intensive usage of acronyms. The study analyses were based on an empirical dataset retrieved from our institutional electronic health records system. The dataset contains 30,000 voice-dictated notes and 30,000 notes that were entered manually; both were encounter notes generated in ambulatory care settings. The results suggest that between the narrative clinician notes created via these two different methods, there exists a considerable amount of lexical and distributional differences. Such differences could have a significant impact on the performance of natural language processing tools, necessitating these two different types of documents being differentially treated.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural

MeSH terms

  • Computer Peripherals*
  • Electronic Health Records*
  • Humans
  • Linguistics*
  • Medical Records*
  • Narration
  • Natural Language Processing*
  • Speech Recognition Software*
  • User-Computer Interface