Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Devi Arockia Vanitha, C.a; * | Devaraj, D.b | Venkatesulu, M.c
Affiliations: [a] Department of Computer Science, The S.F.R College for Women, Sivakasi, Tamil Nadu, India | [b] Department of Electrical and Electronics Engineering, Kalasalingam Academy of Research and Education, Krishnankoil, Tamil Nadu, India | [c] Department of Computer Applications, Kalasalingam Academy of Research and Education, Krishnankoil, Tamil Nadu, India
Correspondence: [*] Corresponding author: C. Devi Arockia Vanitha, Department of Computer Science, The S.F.R College for Women, Sivakasi 626123, Tamil Nadu, India. Tel.: +94 864 62030; E-mail:vanima_c@yahoo.co.in
Abstract: Gene expression profiles have been used for Cancer Classification recently. In this work, the multi-SVM (Support Vector Machine) approach with a novel Gene selection method using Mutual Information (MI) is developed for multi-class classification in the cancer diagnosis area. The mutual information between genes and class label is computed and used for identifying the discriminating genes in each category. All the genes are assigned rank based on their mutual information value and the optimal number of genes with the highest values are chosen and fed into the classifier. The multi-SVM classifier constructs separate classifier for each class and the combined multi-class classifier assigns a tissue sample to the class with the highest support. The performance of the proposed Multiclass Support Vector Machine (mSVM) with Gene Selection using the mutual information approach is evaluated on four benchmark gene expression datasets for cancer diagnosis, namely, the Leukemia dataset, the Lymphoma dataset, the NCI60 dataset and the GCM dataset. The multi-SVM approach develops the most effective classifier in achieving an accurate cancer diagnosis by analyzing gene expression data and it outperforms other popular machine learning algorithm like k-Nearest Neighbor. From the simulation study it is observed that the proposed approach reduces the dimension of the input features by identifying the most discriminating gene subset for each category and improves the predictive accuracy for multi-class cancer.
Keywords: Gene expression, gene selection, mutual information, multiclass classification, Support Vector Machine
DOI: 10.3233/IDA-150203
Journal: Intelligent Data Analysis, vol. 20, no. 6, pp. 1425-1439, 2016
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl