Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Business Analytics in Finance and Industry January 6-8, 2014, Santiago, Chile
Guest editors: Cristián Bravo, Matt Davison, Alejandro Jofré, Sebastián Maldonado and Richard Weber
Article type: Research Article
Authors: Stecking, Ralfa; * | Schebesch, Klaus B.b
Affiliations: [a] Department of Economics, Carl von Ossietzky University Oldenburg, Oldenburg, Germany | [b] Faculty of Economics, Vasile Goldiş Western University Arad, Arad, Romania
Correspondence: [*] Corresponding author: Ralf Stecking, Department of Economics, Carl von Ossietzky University Oldenburg, Ammerländer Heerstr. 114-118, D-26111 Oldenburg, Germany. Tel.: +49 0441 798 4840; Fax: +49 0441 798 4116; E-mail:ralf.w.stecking@uni-oldenburg.de
Abstract: Modern data collections create vast opportunities for detecting useful hidden relationships. Also, increasingly, they fuel data privacy concerns. A trade-off between privacy protection and data usefulness is by now widely acknowledged. Real world data classification tasks, as for example credit scoring applications have to deal with such data security limitations by finding a way to effectively incorporate privacy preserving procedures. To this end we propose as a first stage to use a microaggregation procedure in order to anonymize data over personal credit client feature information. In a second stage we examine the performance of support vector machines (SVM) on such anonymized data. SVM are powerful and robust machine learning methods, having superior credit scoring classification performance when applied to original, non-anonymized data. We first partition the original credit scoring data set and construct anonymized data representatives, which are then used for credit client behavior forecasting models constructed by SVM and other comparable learning methods. The validation procedure for such models is adapted to the two-stage modeling approach. In order to assess the loss owing to data anonymization, the different classification models are evaluated against models that are trained on the original data.
Keywords: Data privacy, microaggregation, credit scoring, support vector machines
DOI: 10.3233/IDA-150767
Journal: Intelligent Data Analysis, vol. 19, no. s1, pp. S3-S18, 2015
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl