Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Zi, Quana | Cui, Hanweib | Liang, Weic | Chi, Qingjiaa; *
Affiliations: [a] Department of Engineering Structure and Mechanics, Wuhan University of Technology, Wuhan, Hubei, China | [b] Department of Science and Education, Shenzhen Samii Medical Center, Shenzhen, Guangdong, China | [c] Division of Nephrology, Renmin Hospital of Wuhan University, Wuhan, Hubei, China
Correspondence: [*] Corresponding author: Qingjia Chi, Department of Engineering Structure and Mechanics, Wuhan University of Technology, Wuhan, Hubei, China. E-mail: qingjia@whut.edu.cn.
Abstract: BACKGROUND: Hepatocellular carcinoma (HCC) is one of the most common malignant tumors. Due to the lack of specific characteristics in the early stage of the disease, patients are usually diagnosed in the advanced stage of disease progression. OBJECTIVE: This study used machine learning algorithms to identify key genes in the progression of hepatocellular carcinoma and constructed a prediction model to predict the survival risk of HCC patients. METHODS: The transcriptome data and clinical information were downloaded from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). The differential expression analysis and COX proportional-hazards model participated in the identification of survival-related genes. K-Means, Random forests, and LASSO regression are involved in identifying novel subtypes of HCC and screening key genes. The prediction model was constructed by deep neural networks (DNN), and Gene Set Enrichment Analysis (GSEA) reveals the metabolic pathways where key genes are located. RESULTS: Two subtypes were identified with significantly different survival rates (p< 0.0001, AUC = 0.720) and 17 key genes associated with the subtypes. The accuracy rate of the deep neural network prediction model is greater than 93.3%. The GSEA analysis found that the survival-related genes were significantly enriched in hallmark gene sets in the MSigDB database. CONCLUSIONS: In this study, we used machine learning algorithms to screen out 17 genes related to the survival risk of HCC patients, and trained a DNN model based on them to predict the survival risk of HCC patients. The genes that make up the model are all key genes that affect the formation and development of cancer.
Keywords: Hepatocellular carcinoma, TCGA, machine learning, deep neural network, prediction model, survival
DOI: 10.3233/CBM-220147
Journal: Cancer Biomarkers, vol. 35, no. 3, pp. 305-320, 2022
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl