Assessment of supervised longitudinal learning methods: Insights from predicting low birth weight and very low birth weight using prenatal ultrasound measurements

Comput Biol Med. 2024 Nov:182:109084. doi: 10.1016/j.compbiomed.2024.109084. Epub 2024 Sep 8.

Abstract

Background: This study aimed to assess the efficacy of various supervised longitudinal learning approaches, comparing traditional statistical models and machine learning algorithms for prediction with longitudinal data. The primary objectives were to evaluate the predictive performance of different supervised longitudinal learning methods for low birth weight (LBW) and very low birth weight (VLBW) based on prenatal ultrasound measurements. Additionally, the study sought to extract interpretable risk features for disease prediction.

Methods: The evaluation involved benchmarking the performance of longitudinal models against conventional machine learning methods. Classification accuracy for LBW and VLBW at birth, as well as prediction accuracy for birth weight using prenatal sonographic ultrasound measurements, were assessed.

Results: Among the learning approaches we investigated in this study, the longitudinal machine learning approach, specifically, the mixed effect random forest (MERF), delivered the overall best performance in predicting birthweights and classifying LBW/VLBW disease status.

Conclusion: The MERF combined the power of advanced machine learning algorithms to accommodate the inherent within-individual dependence in the observed data, delivering satisfactory performance in predicting the birthweight and classifying LBW/VLBW disease status. The study emphasized the importance of incorporating previous ultrasound measurements and considering correlations between repeated measurements for accurate prediction. The interpretable trees algorithm used for risk feature extraction proved reliable and applicable to other learning algorithms. These findings underscored the potential of longitudinal learning methods in improving birth weight prediction and highlighted the relevance of consistent risk features in line with established literature.

Keywords: Ensemble learning; Longitudinal data; Random effects; Sonographic ultrasound measurements; Supervised longitudinal learning; Within-subject dependence.

MeSH terms

  • Adult
  • Female
  • Humans
  • Infant, Low Birth Weight*
  • Infant, Newborn
  • Infant, Very Low Birth Weight*
  • Longitudinal Studies
  • Machine Learning
  • Male
  • Pregnancy
  • Supervised Machine Learning
  • Ultrasonography, Prenatal* / methods