Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Issue title: Collective intelligent information and database systems
Guest editors: Ngoc-Thanh Nguyen, Manuel Núñez and Bogdan Trawiński
Article type: Research Article
Authors: Tran, Van Cuonga; b | Hoang, Dinh Tuyena; b | Nguyen, Ngoc Thanhc | Hwang, Dosama; *
Affiliations: [a] Department of Computer Engineering, Yeungnam University, Gyeongsan, South Korea | [b] Faculty of Engineering and Technology, Quang Binh University, Dong Hoi, Vietnam | [c] Faculty of Computer Science and Management, Wroclaw University of Science and Technology, Wroclaw, Poland
Correspondence: [*] Corresponding author. Dosam Hwang, Department ofComputer Engineering, Yeungnam University, Gyeongsan, South Korea. Tel.: +82 53 810 3515; E-mail: dosamhwang@gmail.com.
Abstract: In recent years, information extraction from tweets has been challenging for researchers in the fields of knowledge discovery and data mining. Unlike formal text, such as news articles and pieces of longer content, tweets are of a specific nature: short, noisy, and with dynamic content. Thus, it is difficult to apply the traditional natural language processing algorithms to analyze them. Active learning is well-suited to many problems in natural language processing, especially when unlabeled data may be abundant, but labeled data is limited. The method proposed here aims to minimize annotation costs while maximizing the desired performance from the model. The method recognizes named entities from tweet streams on Twitter by using an active learning method with different query strategies. The tweets are queried for labeling by a human annotator based on query-by-committee, uncertainty-based sampling, and diversity-based sampling. The experimental evaluations of the proposed method on tweet data achieved better results than random sampling.
Keywords: Named entity recognition, active learning, tweet streams, query strategy
DOI: 10.3233/JIFS-169126
Journal: Journal of Intelligent & Fuzzy Systems, vol. 32, no. 2, pp. 1277-1287, 2017
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl