Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Shu, Wenhaoa | Li, Shipenga | Qian, Wenbinb; *
Affiliations: [a] School of Information Engineering, East China Jiaotong University, Nanchang, Jiangxi, China | [b] School of Software, Jiangxi Agriculture University, Nanchang, Jiangxi, China
Correspondence: [*] Corresponding author. W. Qian, School of Software, Jiangxi Agriculture University, Nanchang 330045, Jiangxi, China. E-mail: qianwenbin1027@126.com.
Abstract: In real-world scenarios, datasets generally exhibit containing mixed-type of attributes and imbalanced classes distribution, and the minority classes in the data are the primary research focus. Attribute reduction is a key step in the data preprocessing process, but traditional attribute reduction methods commonly overlook the significance of minority class samples, causing the critical information possessed in minority class samples to damage and decrease the performance of classification. In order to address this issue, we develop an attribute reduction algorithm based on a composite entropy-based uncertainty measure to handle imbalanced mixed-type data. To begin with, we design a novel oversampling method based on the three-way decisions boundary region to synthesize the samples of minority class, for the boundary region to contain more high-quality samples. Then, we propose an attribute measure to select candidate attributes, which considers the boundary entropy, degree of dependency and weight of classes. On this basis, a composite entropy-based uncertainty measure guided attribute reduction algorithm is developed to select the attribute subset for the imbalanced mixed-type data. Experimental on UCI imbalanced datasets, as well as the results indicate that the developed attribute reduction algorithm is significantly outperforms compared to other attribute reduction algorithms, especially in total AUC, F1-Score and G-Mean.
Keywords: imbalanced data, three-way decisions, neighborhood rough set, uncertainty measure, attribute reduction
DOI: 10.3233/JIFS-237211
Journal: Journal of Intelligent & Fuzzy Systems, vol. 46, no. 3, pp. 7307-7325, 2024
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl