Aims: Cardiopulmonary disease (CPD) is a leading cause of death worldwide. Increasing evidence shows that air pollution and exposure to weather conditions have important contributory roles. Understanding the interaction of these factors is difficult due to the complexity of the relationship between CPD, air pollution, and environmental factors.
Methods: This paper uses regression models and machine learning approaches to explore these relationships, and investigate whether meteorological factors and air pollution have a synergistic effect on CPD. We use daily data from 2009-2018 from four cities representing the heterogenous climate conditions in Norway: the far north, the west coast, mid-Norway, and the south-east.
Results: We demonstrate the importance of the interaction between weather and air pollution associated with higher CPD mortality, as is exposure to air pollution in the form of particulate matter. This impact is seasonal. Traffic is also positively related to CPD mortality, which may be caused indirectly through increased pollution. We demonstrate that machine learning outperforms regression models in terms of the accuracy of predicting CPD mortality.
Conclusions: The inclusion of rich lagged structures and interactions between environmental factors are both important but can lead to overfitting of traditional models; since these cities are not large cities by international standards, it is surprising that environmental factors have such obvious impacts on CPD mortality. CPD mortality shows a clear negative trend, implying an improvement in the public health situation.
Keywords: Machine learning; air pollution; health policy; mortality.