Opportunities and Pitfalls with Large Language Models for Biomedical Annotation

Pac Symp Biocomput. 2025:30:706-710.

Abstract

Large language models (LLMs) and biomedical annotations have a symbiotic relationship. LLMs rely on high-quality annotations for training and/or fine-tuning for specific biomedical tasks. These annotations are traditionally generated through expensive and time-consuming human curation. Meanwhile LLMs can also be used to accelerate the process of curation, thus simplifying the process, and potentially creating a virtuous feedback loop. However, their use also introduces new limitations and risks, which are as important to consider as the opportunities they offer. In this workshop, we will review the process that has led to the current rise of LLMs in several fields, and in particular in biomedicine, and discuss specifically the opportunities and pitfalls when they are applied to biomedical annotation and curation.

MeSH terms

  • Biomedical Research / statistics & numerical data
  • Computational Biology*
  • Data Curation / statistics & numerical data
  • Humans
  • Natural Language Processing
  • Programming Languages