Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Guo, Zhiqianga | Ling, Zhenhuaa; * | Li, Yunxiab; *
Affiliations: [a] National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei, China | [b] Department of Neurology, Shanghai Tongji Hospital, Tongji University School of Medicine, Shanghai, China
Correspondence: [*] Correspondence to: Zhenhua Ling, National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China, Hefei, 230027, Anhui, China. E-mail: zhling@ustc.edu.cn and Yunxia Li, Department of Neurology, Shanghai Tongji Hospital, Tongji University School of Medicine, 389 Xincun Road, Shanghai 200065, China. E-mail: doctorliyunxia@163.com.
Abstract: Background:Recently, many studies have been carried out to detect Alzheimer’s disease (AD) from continuous speech by linguistic analysis and modeling. However, few of them utilize language models (LMs) to extract linguistic features and to investigate the lexical-level differences between AD and healthy speech. Objective:Our goals include obtaining state-of-art performance of automatic AD detection, emphasizing N-gram LMs as powerful tools for distinguishing AD patients’ narratives from those of healthy controls, and discovering the differences of lexical usages between AD patients and healthy people. Method:We utilize a subset of the DementiaBank corpus, including 242 control samples from 99 control participants and 256 AD samples from 169 “PossibleAD” or “ProbableAD” participants. Baseline models are built through area under curve-based feature selection and using five machine learning algorithms for comparison. Perplexity features are extracted using LMs to build enhanced detection models. Finally, the differences of lexical usages between AD patients and healthy people are investigated by a proportion test based on unigram probabilities. Results:Our baseline model obtains a detection accuracy of 80.7%. This accuracy increases to 85.4% after integrating the perplexity features derived from LMs. Further investigations show that AD patients tend to use more general, less informative, and less accurate words to describe characters and actions than healthy controls. Conclusion:The perplexity features extracted by LMs can benefit the automatic AD detection from continuous speech. There exist lexical-level differences between AD and healthy speech that can be captured by statistical N-gram LMs.
Keywords: Geriatric assessment, language, machine learning, statistical
DOI: 10.3233/JAD-190452
Journal: Journal of Alzheimer's Disease, vol. 70, no. 4, pp. 1163-1174, 2019
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
sales@iospress.com
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
info@iospress.nl
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office info@iospress.nl
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
china@iospress.cn
For editorial issues, like the status of your submitted paper or proposals, write to editorial@iospress.nl
如果您在出版方面需要帮助或有任何建, 件至: editorial@iospress.nl