Few-Shot and Prompt Training for Text Classification in German Doctor's Letters

Stud Health Technol Inform. 2023 May 18:302:819-820. doi: 10.3233/SHTI230275.

Abstract

To classify sentences in cardiovascular German doctor's letters into eleven section categories, we used pattern-exploiting training, a prompt-based method for text classification in few-shot learning scenarios (20, 50 and 100 instances per class) using language models with various pre-training approaches evaluated on CARDIO:DE, a freely available German clinical routine corpus. Prompting improves results by 5-28% accuracy compared to traditional methods, reducing manual annotation efforts and computational costs in a clinical setting.

Keywords: cardiology; deep learning; language models; prompting.

MeSH terms

  • Language*
  • Learning
  • Machine Learning*
  • Natural Language Processing