Background: Estimation of the clinical probability of malignancy in patients with pulmonary nodules will facilitate early diagnosis, determine optimum patient management strategies and reduce overall costs.
Methods: Data from the UK Lung Cancer Screening trial were analysed. Multivariable logistic regression models were used to identify independent predictors and to develop a parsimonious model to estimate the probability of lung cancer in lung nodules detected at baseline and at 3-month and 12-month repeat screening.
Results: Of 1994 participants who underwent CT scan, 1013 participants had a total of 5063 lung nodules and 52 (2.6%) of the participants developed lung cancer during a median follow-up of 4 years. Covariates that predict lung cancer in our model included female gender, asthma, bronchitis, asbestos exposure, history of cancer, early and late onset of family history of lung cancer, smoking duration, FVC, nodule type (pure ground-glass and part-solid) and volume as measured by semiautomated volumetry. The final model incorporating all predictors had excellent discrimination: area under the receiver operating characteristic curve (AUC 0.885, 95% CI 0.880 to 0.889). Internal validation suggested that the model will discriminate well when applied to new data (optimism-corrected AUC 0.882, 95% CI 0.848 to 0.907). The risk model had a good calibration (goodness-of-fit χ[8] 8.13, p=0.42).
Conclusions: Our model may be used in estimating the probability of lung cancer in nodules detected at baseline and at 3 months and 12 months from baseline, allowing more efficient stratification of follow-up in population-based lung cancer screening programmes.
Trial registration number: 78513845.
Keywords: CT Screening; lung cancer; risk prediction model; solitary pulmonary nodules.
© Author(s) (or their employer(s)) 2019. No commercial re-use. See rights and permissions. Published by BMJ.