Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Zhu, Linkaia; b | Wang, Wennanb; * | Huang, Maoyic | Chen, Maomaod | Wang, Yiyune; f; g | Cai, Zhimingb; *
Affiliations: [a] Institute of Software, Chinese Academy of Sciences, Beijing, China | [b] Institute of Data Science, City University of Macau, Macau, China | [c] Product Development, Ericsson, Gothenburg, Sweden | [d] Department of Computer Science and Engineering, University of Gothenburg, Gothenburg, Sweden | [e] Faculty of Education, University of Malaya, Kuala Lumpur, Malaysia | [f] Applied Psychology Program, BNU-HKBU United International College, China | [g] College of Education for the Future, Beijing Normal University, Zhuhai, PR China
Correspondence: [*] Corresponding authors. Zhiming Cai, E-mail: caizhiming@cityu.mo.; Wennan Wang, E-mail: d19092105106@cityu.mo.
Note: [1] This work has been posted as a preprint in arXiv. arXiv preprint arXiv:2110.11879.
Abstract: A lot of manual work goes into identifying a topic for an article. With a large volume of articles, the manual process can be exhausting. Our approach aims to address this issue by automatically extracting topics from the text of large numbers of articles. This approach takes into account the efficiency of the process. Based on existing N-gram analysis, our research examines how often certain words appear in documents in order to support automatic topic extraction. In order to improve efficiency, we apply custom filtering standards to our research. Additionally, delete as many noncritical or irrelevant phrases as possible. In this way, we can ensure we are selecting unique keyphrases for each article, which capture its core idea1. For our research, we chose to center on the autonomous vehicle domain, since the research is relevant to our daily lives. We have to convert the PDF versions of most of the research papers into editable types of files such as TXT. This is because most of the research papers are only in PDF format. To test our proposed idea of automating, numerous articles on robotics have been selected. Next, we evaluate our approach by comparing the result with other models.
Keywords: Automatic topic extraction, frequency statistic, keyphrase, N-gram
DOI: 10.3233/JIFS-220115
Journal: Journal of Intelligent & Fuzzy Systems, vol. 43, no. 5, pp. 6137-6146, 2022
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl