Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: B, Nithyaa; * | V, Ilangob
Affiliations: [a] New Horizon College of Engineering, Bengaluru, India | [b] Department of MCA, CMR Institute of Technology, Bengaluru, India
Correspondence: [*] Corresponding author: Nithya B, New Horizon College of Engineering, Bengaluru, India. E-mail: nithya.boopalan@gmail.com.
Abstract: A dataset that has massive features and imbalanced classes may be challenging for obtaining adequate accuracy in classification approaches of Machine Learning (ML). The purpose of this research is to find the optimal feature subset for cervical cancer diagnosis with efficient classification approach by estimating the performance of various Machine Learning predictive models. Filter-based feature selection techniques of Relief and Information Gain are applied in this study to calculate the rank for each feature that can be applied to order and select highest scoring features for feature selection. An optimal feature subset is generated with wrapper approach through Recursive Feature Elimination which uses a Random Forest procedure and Genetic Algorithm has been employed based on evolutionary principle. The predictive models are established with 10fold cross validation using prevalent classification algorithms like Random Forest, C5.0, K-Nearest Neighbour and Naïve Bayes. The results showed an enhancement in the average performance of these classifiers concurrently and the classification error for these classifiers decreases substantially. The experiments also exhibited that by employing this approach an optimal and reduced feature subset is desirable for the enrichment of classification accuracy with a lower computational cost. The features generated by fused approach of Relief and Genetic algorithm methods were able to predict the results in an efficient manner, hence an optimal feature subset has been nominated through this procedure. Maximum number of classifiers have shown good results in terms of performance outcomes. In addition, Random Forest method has shown advanced accuracy rate with an improved percentage of sensitivity and specificity results. Also, this work established that the best and optimal feature subset selection through Fused Feature Selection (FFS) approach could reduce the complexity of the predictive model.
Keywords: Cervical cancer, fused feature selection (FFS), classification, Relief, Genetic Algorithm (GA), Recursive Feature Elimination (RFE)
DOI: 10.3233/KES-220009
Journal: International Journal of Knowledge-based and Intelligent Engineering Systems, vol. 26, no. 1, pp. 79-89, 2022
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl