Assessment of supervised longitudinal learning methods: Insights from predicting low birth weight and very low birth weight using prenatal ultrasound measurements

Cancan Zhang; Xiufan Yu; Bo Zhang

doi:10.1016/j.compbiomed.2024.109084

Assessment of supervised longitudinal learning methods: Insights from predicting low birth weight and very low birth weight using prenatal ultrasound measurements

Comput Biol Med. 2024 Nov:182:109084. doi: 10.1016/j.compbiomed.2024.109084. Epub 2024 Sep 8.

Authors

Cancan Zhang¹, Xiufan Yu², Bo Zhang³

Affiliations

¹ Division of General Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA.
² Department of Applied and Computational Mathematics and Statistics, University of Notre Dame, Notre Dame, IN, USA. Electronic address: xiufan.yu@nd.edu.
³ Department of Neurology and Biostatistics and Research Design Center, Boston Children's Hospital, Harvard Medical School, Boston, Massachusetts, 02115, USA. Electronic address: bo.zhang@childrens.harvard.edu.

PMID: 39250874
DOI: 10.1016/j.compbiomed.2024.109084

Abstract

Background: This study aimed to assess the efficacy of various supervised longitudinal learning approaches, comparing traditional statistical models and machine learning algorithms for prediction with longitudinal data. The primary objectives were to evaluate the predictive performance of different supervised longitudinal learning methods for low birth weight (LBW) and very low birth weight (VLBW) based on prenatal ultrasound measurements. Additionally, the study sought to extract interpretable risk features for disease prediction.

Methods: The evaluation involved benchmarking the performance of longitudinal models against conventional machine learning methods. Classification accuracy for LBW and VLBW at birth, as well as prediction accuracy for birth weight using prenatal sonographic ultrasound measurements, were assessed.

Results: Among the learning approaches we investigated in this study, the longitudinal machine learning approach, specifically, the mixed effect random forest (MERF), delivered the overall best performance in predicting birthweights and classifying LBW/VLBW disease status.

Conclusion: The MERF combined the power of advanced machine learning algorithms to accommodate the inherent within-individual dependence in the observed data, delivering satisfactory performance in predicting the birthweight and classifying LBW/VLBW disease status. The study emphasized the importance of incorporating previous ultrasound measurements and considering correlations between repeated measurements for accurate prediction. The interpretable trees algorithm used for risk feature extraction proved reliable and applicable to other learning algorithms. These findings underscored the potential of longitudinal learning methods in improving birth weight prediction and highlighted the relevance of consistent risk features in line with established literature.

Keywords: Ensemble learning; Longitudinal data; Random effects; Sonographic ultrasound measurements; Supervised longitudinal learning; Within-subject dependence.

MeSH terms

Adult
Female
Humans
Infant, Low Birth Weight*
Infant, Newborn
Infant, Very Low Birth Weight*
Longitudinal Studies
Machine Learning
Male
Pregnancy
Supervised Machine Learning
Ultrasonography, Prenatal* / methods