Identification of hub genes involved in the occurrence and development of hepatocellular carcinoma via bioinformatics analysis

Oncol Lett. 2020 Aug;20(2):1695-1708. doi: 10.3892/ol.2020.11752. Epub 2020 Jun 17.

Abstract

Hepatocellular carcinoma (HCC) is a heterogeneous malignancy, which is a major cause of cancer morbidity and mortality worldwide. Thus, the aim of the present study was to identify the hub genes and underlying pathways of HCC via bioinformatics analyses. The present study screened three datasets, including GSE112790, GSE84402 and GSE74656 from the Gene Expression Omnibus (GEO) database, and downloaded the RNA-sequencing of HCC from The Cancer Genome Atlas (TCGA) database. The differentially expressed genes (DEGs) in both the GEO and TCGA datasets were filtered, and the screened DEGs were subsequently analyzed for functional enrichment pathways. A protein-protein interaction (PPI) network was constructed, and hub genes were further screened to create the Kaplan-Meier curve using cBioPortal. The expression levels of hub genes were then validated in different datasets using the Oncomine database. In addition, associations between expression and tumor grade, hepatitis virus infection status, satellites and vascular invasion were assessed. A total of 126 DEGs were identified, containing 70 upregulated genes and 56 downregulated genes from the GEO and TCGA databases. By constructing the PPI network, the present study identified hub genes, including cyclin B1 (CCNB1), cell-division cycle protein 20 (CDC20), cyclin-dependent kinase 1, BUB1 mitotic checkpoint serine/threonine kinase β (BUB1B), cyclin A2, nucleolar and spindle associated protein 1, ubiquitin-conjugating enzyme E2 C (UBE2C) and ZW10 interactor. Furthermore, upregulated CCNB1, CDC20, BUB1B and UBE2C expression levels indicated worse disease-free and overall survival. Moreover, a meta-analysis of tumor and healthy tissues in the Oncomine database demonstrated that BUB1B and UBE2C were highly expressed in HCC. The present study also analyzed the data of HCC in TCGA database using univariate and multivariate Cox analyses, and demonstrated that BUB1B and UBE2C may be used as independent prognostic factors. In conclusion, the present study identified several genes and the signaling pathways that were associated with tumorigenesis using bioinformatics analyses, which could be potential targets for the diagnosis and treatment of HCC.

Keywords: bioinformatics; hepatocellular carcinoma; hub genes; microarray; prognosis.