Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Qian, Yuronga; * | Shao, Jinxina | Zhang, Zhea | Leng, Hongyonga; b | Ma, Mengnana | Li, Zichenc
Affiliations: [a] School of Software, Xin Jiang University, Urumqi, China | [b] School of Computer Science & Technology, Beijing Institute of Technology, Beijing, China | [c] Big Data and Artificial Intelligence Academy, Guangdong Water Conservancy and Electricity Vocational and Technical College, Guangzhou, China
Correspondence: [*] Corresponding author. Yurong Qian, School of Software, XinJiang University, Urumqi, 830000, China. E-mail: qyr@xju.edu.cn.
Abstract: In traditional user portrait construction methods, static word vectors can extract only shallow semantic representations, which cannot manage word polysemy. Moreover, the common clustering algorithm K-means has the problems of initial K values and unstable initial centroid selection. A Bert-CK model based on Bert and CK-means+ is proposed. First, Bert is used to extract semantic and syntactic text features at various levels, and word vectors and sentence vectors are obtained according to the context. Then, the CK-means+ algorithm is improved based on canopy and mean calculation. Next, the K value and initial centroid are determined. The sentence vectors are input to CK-means+ to obtain user classification and topic features. Finally, semantic features and topic features are fused and classified. CK-means+ is evaluated on the Sogou user portrait dataset. The experimental results verify that Bert-CK is better than the baseline model.
Keywords: User profile, bert, canopy, K-means, text classification
DOI: 10.3233/JIFS-224531
Journal: Journal of Intelligent & Fuzzy Systems, vol. 45, no. 3, pp. 4585-4597, 2023
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl