Computational deconvolution of cell type-specific gene expression in COPD and IPF lungs reveals disease severity associations

BMC Genomics. 2024 Dec 18;25(1):1192. doi: 10.1186/s12864-024-11031-5.

Abstract

Background: Chronic obstructive pulmonary disease (COPD) and idiopathic pulmonary fibrosis (IPF) are debilitating diseases associated with divergent histopathological changes in the lungs. At present, due to cost and technical limitations, profiling cell types is not practical in large epidemiology cohorts (n > 1000). Here, we used computational deconvolution to identify cell types in COPD and IPF lungs whose abundances and cell type-specific gene expression are associated with disease diagnosis and severity.

Results: We analyzed lung tissue RNA-seq data from 1026 subjects (COPD, n = 465; IPF, n = 213; control, n = 348) from the Lung Tissue Research Consortium. We performed RNA-seq deconvolution, querying thirty-eight discrete cell-type varieties in the lungs. We tested whether deconvoluted cell-type abundance and cell type-specific gene expression were associated with disease severity. The abundance score of twenty cell types significantly differed between IPF and control lungs. In IPF subjects, eleven and nine cell types were significantly associated with forced vital capacity (FVC) and diffusing capacity for carbon monoxide (DLCO), respectively. Aberrant basaloid cells, a rare cells found in fibrotic lungs, were associated with worse FVC and DLCO in IPF subjects, indicating that this aberrant epithelial population increased with disease severity. Alveolar type 1 and vascular endothelial (VE) capillary A were decreased in COPD lungs compared to controls. An increase in macrophages and classical monocytes was associated with lower DLCO in IPF and COPD subjects. In both diseases, lower non-classical monocytes and VE capillary A cells were associated with increased disease severity. Alveolar type 2 cells and alveolar macrophages had the highest number of genes with cell type-specific differential expression by disease severity in COPD and IPF. In IPF, genes implicated in the pathogenesis of IPF, such as matrix metallopeptidase 7, growth differentiation factor 15, and eph receptor B2, were associated with disease severity in a cell type-specific manner.

Conclusions: Utilization of RNA-seq deconvolution enabled us to pinpoint cell types present in the lungs that are associated with the severity of COPD and IPF. This knowledge offers valuable insight into the alterations within tissues in more advanced illness, ultimately providing a better understanding of the underlying pathological processes that drive disease progression.

Keywords: Cell type-specific gene expression.; Chronic obstructive pulmonary disease; Computational deconvolution; Idiopathic pulmonary fibrosis; Lung function tests; RNA sequencing.

MeSH terms

  • Aged
  • Computational Biology / methods
  • Female
  • Gene Expression Profiling
  • Humans
  • Idiopathic Pulmonary Fibrosis* / genetics
  • Idiopathic Pulmonary Fibrosis* / metabolism
  • Idiopathic Pulmonary Fibrosis* / pathology
  • Lung* / metabolism
  • Lung* / pathology
  • Male
  • Middle Aged
  • Pulmonary Disease, Chronic Obstructive* / genetics
  • Pulmonary Disease, Chronic Obstructive* / metabolism
  • Pulmonary Disease, Chronic Obstructive* / pathology
  • Severity of Illness Index*