AI-readiness for Biomedical Data: Bridge2AI Recommendations

Timothy Clark; Harry Caufield; Jillian A Parker; Sadnan Al Manir; Edilberto Amorim; James Eddy; Nayoon Gim; Brian Gow; Wesley Goar; Melissa Haendel; Jan N Hansen; Nomi Harris; Henning Hermjakob; Marcin Joachimiak; Gianna Jordan; In-Hee Lee; Shannon K McWeeney; Camille Nebeker; Milen Nikolov; Jamie Shaffer; Nathan Sheffield; Gloria Sheynkman; James Stevenson; Jake Y Chen; Chris Mungall; Alex Wagner; Sek Won Kong; Satrajit S Ghosh; Bhavesh Patel; Andrew Williams; Monica C Munoz-Torres

doi:10.1101/2024.10.23.619844

AI-readiness for Biomedical Data: Bridge2AI Recommendations

bioRxiv [Preprint]. 2024 Nov 24:2024.10.23.619844. doi: 10.1101/2024.10.23.619844.

Authors

Timothy Clark¹, Harry Caufield², Jillian A Parker³, Sadnan Al Manir¹, Edilberto Amorim⁴, James Eddy⁵, Nayoon Gim⁶, Brian Gow⁷, Wesley Goar⁸, Melissa Haendel⁹, Jan N Hansen¹⁰, Nomi Harris², Henning Hermjakob¹¹, Marcin Joachimiak², Gianna Jordan¹², In-Hee Lee¹³, Shannon K McWeeney¹⁴, Camille Nebeker³, Milen Nikolov¹², Jamie Shaffer⁶, Nathan Sheffield¹, Gloria Sheynkman¹, James Stevenson⁸, Jake Y Chen¹⁵, Chris Mungall², Alex Wagner⁸, Sek Won Kong¹³, Satrajit S Ghosh⁷, Bhavesh Patel¹⁶, Andrew Williams¹⁷, Monica C Munoz-Torres¹⁸

Affiliations

¹ University of Virginia.
² Lawrence Berkeley National Laboratory.
³ University of California San Diego.
⁴ University of California San Francisco.
⁵ Avantiqor.
⁶ University of Washington.
⁷ Massachusetts Institute of Technology.
⁸ Nationwide Children's Hospital.
⁹ University of North Carolina at Chapel Hill.
¹⁰ Stanford University.
¹¹ European Molecular Biology Laboratory - European Bioinformatics Institute.
¹² Sage Bionetworks.
¹³ Boston Children's Hospital.
¹⁴ Oregon Health and Science University.
¹⁵ University of Alabama at Birmingham.
¹⁶ California Medical Innovations Institute.
¹⁷ Tufts University.
¹⁸ University of Colorado Anschutz Medical Campus.

Abstract

Biomedical research and clinical practice are in the midst of a transition toward significantly increased use of artificial intelligence (AI) and machine learning (ML) methods. These advances promise to enable qualitatively deeper insight into complex challenges formerly beyond the reach of analytic methods and human intuition while placing increased demands on ethical and explainable artificial intelligence (XAI), given the opaque nature of many deep learning methods. The U.S. National Institutes of Health (NIH) has initiated a significant research and development program, Bridge2AI, aimed at producing new "flagship" datasets designed to support AI/ML analysis of complex biomedical challenges, elucidate best practices, develop tools and standards in AI/ML data science, and disseminate these datasets, tools, and methods broadly to the biomedical community. An essential set of concepts to be developed and disseminated in this program along with the data and tools produced are criteria for AI-readiness of data, including critical considerations for XAI and ethical, legal, and social implications (ELSI) of AI technologies. NIH Bridge to Artificial Intelligence (Bridge2AI) Standards Working Group members prepared this article to present methods for assessing the AI-readiness of biomedical data and the data standards perspectives and criteria we have developed throughout this program. While the field is rapidly evolving, these criteria are foundational for scientific rigor and the ethical design and application of biomedical AI methods.

Publication types

Preprint

Abstract

Publication types

Grants and funding