Robust Identification of Differential Gene Expression Patterns from Multiple Transcriptomics Datasets for Early Diagnosis, Prognosis, and Therapies for Breast Cancer

Medicina (Kaunas). 2023 Sep 24;59(10):1705. doi: 10.3390/medicina59101705.

Abstract

Background and Objectives: Breast cancer (BC) is one of the major causes of cancer-related death in women globally. Proper identification of BC-causing hub genes (HubGs) for prognosis, diagnosis, and therapies at an earlier stage may reduce such death rates. However, most of the previous studies detected HubGs through non-robust statistical approaches that are sensitive to outlying observations. Therefore, the main objectives of this study were to explore BC-causing potential HubGs from robustness viewpoints, highlighting their early prognostic, diagnostic, and therapeutic performance. Materials and Methods: Integrated robust statistics and bioinformatics methods and databases were used to obtain the required results. Results: We robustly identified 46 common differentially expressed genes (cDEGs) between BC and control samples from three microarrays (GSE26910, GSE42568, and GSE65194) and one scRNA-seq (GSE235168) dataset. Then, we identified eight cDEGs (COL11A1, COL10A1, CD36, ACACB, CD24, PLK1, UBE2C, and PDK4) as the BC-causing HubGs by the protein-protein interaction (PPI) network analysis of cDEGs. The performance of BC and survival probability prediction models with the expressions of HubGs from two independent datasets (GSE45827 and GSE54002) and the TCGA (The Cancer Genome Atlas) database showed that our proposed HubGs might be considered as diagnostic and prognostic biomarkers, where two genes, COL11A1 and CD24, exhibit better performance. The expression analysis of HubGs by Box plots with the TCGA database in different stages of BC progression indicated their early diagnosis and prognosis ability. The HubGs set enrichment analysis with GO (Gene ontology) terms and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways disclosed some BC-causing biological processes, molecular functions, and pathways. Finally, we suggested the top-ranked six drug molecules (Suramin, Rifaximin, Telmisartan, Tukysa Tucatinib, Lynparza Olaparib, and TG.02) for the treatment of BC by molecular docking analysis with the proposed HubGs-mediated receptors. Molecular docking analysis results also showed that these drug molecules may inhibit cancer-related post-translational modification (PTM) sites (Succinylation, phosphorylation, and ubiquitination) of hub proteins. Conclusions: This study's findings might be valuable resources for diagnosis, prognosis, and therapies at an earlier stage of BC.

Keywords: breast cancer; early diagnosis; hub-genes; integrated robust statistics and bioinformatics approaches; prognosis and therapies; transcriptomics profiles.

MeSH terms

  • Biomarkers, Tumor / genetics
  • Biomarkers, Tumor / metabolism
  • Breast Neoplasms* / diagnosis
  • Breast Neoplasms* / genetics
  • Breast Neoplasms* / therapy
  • Early Detection of Cancer
  • Female
  • Gene Expression Profiling / methods
  • Gene Expression Regulation, Neoplastic
  • Gene Regulatory Networks
  • Humans
  • Molecular Docking Simulation
  • Prognosis
  • Transcriptome / genetics

Substances

  • Biomarkers, Tumor