Diagnosing breast cancer tumors using stacked ensemble model
Article type: Research Article
Authors: Yurttakal, Ahmet Haşima; * | Erbay, Hasanb | İkizceli, Türkanc | Karaçavuş, Seyhand | Biçer, Cenkere
Affiliations: [a] Computer Engineering Department, EngineeringFaculty, Afyon Kocatepe University, Afyon-Turkey | [b] Computer Engineering Department, EngineeringFaculty, University of Turkish Aeronautical Association, 06790Etimesgut Ankara-Turkey | [c] Haseki Training and Research Hospital, Departmentof Radiology, University of Health Sciences, İstanbul-Turkey | [d] Kayseri Training and Research Hospital, Departmentof Nuclear Medicine, University of Health Sciences, Kayseri-Turkey | [e] Statistcs Department, Arts & Science Faculty, Kırıkkale University, Kırıkkale-Turkey
Correspondence: [*] Corresponding author. Ahmet Haşim Yurttakal, Computer Engineering Department, Engineering Faculty, Afyon Kocatepe University, Afyon-Turkey. Tel.: +90 2722182300; E-mail: ahyurttakal@aku.edu.tr.
Abstract: Breast cancer is the most common cancer that progresses from cells in the breast tissue among women. Early-stage detection could reduce death rates significantly, and the detection-stage determines the treatment process. Mammography is utilized to discover breast cancer at an early stage prior to any physical sign. However, mammography might return false-negative, in which case, if it is suspected that lesions might have cancer of chance greater than two percent, a biopsy is recommended. About 30 percent of biopsies result in malignancy that means the rate of unnecessary biopsies is high. So to reduce unnecessary biopsies, recently, due to its excellent capability in soft tissue imaging, Dynamic Contrast-Enhanced Magnetic Resonance Imaging (DCE-MRI) has been utilized to detect breast cancer. Nowadays, DCE-MRI is a highly recommended method not only to identify breast cancer but also to monitor its development, and to interpret tumorous regions. However, in addition to being a time-consuming process, the accuracy depends on radiologists’ experience. Radiomic data, on the other hand, are used in medical imaging and have the potential to extract disease characteristics that can not be seen by the naked eye. Radiomics are hard-coded features and provide crucial information about the disease where it is imaged. Conversely, deep learning methods like convolutional neural networks(CNNs) learn features automatically from the dataset. Especially in medical imaging, CNNs’ performance is better than compared to hard-coded features-based methods. However, combining the power of these two types of features increases accuracy significantly, which is especially critical in medicine. Herein, a stacked ensemble of gradient boosting and deep learning models were developed to classify breast tumors using DCE-MRI images. The model makes use of radiomics acquired from pixel information in breast DCE-MRI images. Prior to train the model, radiomics had been applied to the factor analysis to refine the feature set and eliminate unuseful features. The performance metrics, as well as the comparisons to some well-known machine learning methods, state the ensemble model outperforms its counterparts. The ensembled model’s accuracy is 94.87% and its AUC value is 0.9728. The recall and precision are 1.0 and 0.9130, respectively, whereas F1-score is 0.9545.
Keywords: Stacked ensemble, radiomics, deep learning, gradient boosting, breast cancer
DOI: 10.3233/JIFS-219176
Journal: Journal of Intelligent & Fuzzy Systems, vol. 42, no. 1, pp. 77-85, 2022