Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Luo, Yuana | Li, Yifana | Zhang, Yuweia | Zhang, Jianweib | Liang, Menga | Jiang, Lina; * | Guo, Lia; *
Affiliations: [a] School of Medical Imaging, Tianjin Medical University, Tianjin, China | [b] Department of Radiology, Tianjin Baodi Hospital, Tianjin, China
Correspondence: [*] Corresponding authors: Li Guo and Lin Jiang, School of Medical Imaging, Tianjin Medical University, Tianjin 300203, China. Tel.: +086 13612023224 / +086 13752061603; E-mails: yxgl@tmu.edu.cn (Li Guo) and yxxyjl@163.com (Lin Jiang)
Abstract: BACKGROUND:Lung cancer is one of the most common cancers, and early diagnosis and intervention can improve cancer cure rate. OBJECTIVE:To improve predictive performance of radiomics features for lung cancer by tuning the machine learning model parameters. METHODS:Using a dataset involving 263 cases (125 benign and 138 malignant) acquired from our hospital, each classifier model is trained and tested using 237 and 26 cases, respectively. We initially extract 867 radiomics features of CT images for model development and then test 10 feature selections and 7 models to determine the best method. We further tune the parameter of the final model to reach the best performance. The adjusted final model is then validated using 224 cases acquired from Lung Image Database Consortium (LIDC) dataset (64 benign and 160 malignant) with the same set of selected radiomics features. RESULTS:During model development, the feature selection via concave minimization method show the best performance of area under ROC curve (AUC = 0.765), followed by l0-norm regularization (AUC = 0.741) and Fisher discrimination criterion (AUC = 0.734). Support vector machine (SVM) and random forest (RF) are the top two machine learning algorithms showing the best performance (AUC = 0.765 and 0.734, respectively), using by the default parameter. After parameter tuning, SVM with linear kernel achieves the best performance (AUC = 0.837), whereas the best tuned RF with the number of trees is 510 and yields a slightly lower performance (AUC = 0.775) in 26 test samples data. During model validation, the SVM and RF models yield AUC = 0.78 and 0.77, respectively. CONCLUSION:Appropriate quantitative radiomics features and accurate parameters can improve the model’s performance to predict lung cancer.
Keywords: Lung neoplasms, machine learning, radiomics, parameter analysis, lung nodule classification
DOI: 10.3233/XST-211096
Journal: Journal of X-Ray Science and Technology, vol. 30, no. 3, pp. 477-490, 2022
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl