Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Li, Yongjuna | Wei, Jianshuanga; * | Kang, Kaib | Wu, Zhouyanga
Affiliations: [a] School of Computer Science and Engineering, South China University of Technology, GuangZhou, China | [b] School of Information Science and Engineering, Central South University, ChangSha, China
Correspondence: [*] Corresponding author. Jianshuang Wei, School of Computer Science and Engineering, South China University of Technology, GuangZhou, 510006, China. E-mail: 201720139890@mail.scut.edu.cn.
Abstract: Aviation customer churn analysis is a difficult point, which has puzzled over airlines. The difficulties lie in the imbalance of customer churn data distribution and noisy data interference. Although some existing sampling techniques and ensemble models are good at dealing with class imbalance problem, noisy examples in dataset seriously affects the sampling quality and predictive accuracy of classifiers. Therefore, the purpose of our work is to effectively solve the problem of noise interference in imbalanced data classification and improve the effect of the ensemble classifier. In this paper, we propose a novel noise filtering algorithm that combined Tomek-link with distance weighted KNN (TWK), which can effectively filter the noise from both minority and majority class in the imbalanced dataset and prevent relative value samples from being rejected by mistake. We integrate TWK and feature sampling into EasyEnsemble to get a new ensemble model, named FSEE-TWK for short, for customer churn analysis. The introduction of feature sampling to FSEE-TWK accelerate the process of training and avoid model over-fitting. We obtained imbalanced customer data from a major Chinese airline to predict potential churn customers. We use F-Measure and G-Mean to evaluate the performance of the new ensemble model. The experimental results show that the proposed model can effectively improve the classification of datasets and significantly reduce the training time of the model.
Keywords: Aviation customer churn analysis, classification model, ensemble learning, noise filter, under-sampling
DOI: 10.3233/JIFS-182807
Journal: Journal of Intelligent & Fuzzy Systems, vol. 37, no. 2, pp. 2575-2585, 2019
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl