Predicting Cognitive Impairment and Dementia: A Machine Learning Approach
Article type: Research Article
Authors: Aschwanden, Damarisa; * | Aichele, Stephenb; c | Ghisletta, Paolob; d; e | Terracciano, Antonioa | Kliegel, Matthiasb; e | Sutin, Angelina R.a | Brown, Justina | Allemand, Mathiasf; g
Affiliations: [a] Florida State University, Tallahassee, FL, USA | [b] Faculty of Psychology and Educational Sciences, University of Geneva, Switzerland | [c] Colorado State University, Fort Collins, CO, USA | [d] Swiss Distance University Institute, Switzerland | [e] Swiss National Centre of Competence in Research LIVES – Overcoming Vulnerability: Life Course Perspectives, Universities of Lausanne and of Geneva, Switzerland | [f] University of Zurich, Zurich, Switzerland | [g] University Research Priority Program Dynamics of Healthy Aging, University of Zurich, Switzerland
Correspondence: [*] Correspondence to: Damaris Aschwanden, Department of Geriatrics, College of Medicine, Florida State University, 1115 West Call Street, Tallahassee, FL 32306, USA. E-mail: damaris.aschwanden@med.fsu.edu.
Abstract: Background:Efforts to identify important risk factors for cognitive impairment and dementia have to date mostly relied on meta-analytic strategies. A comprehensive empirical evaluation of these risk factors within a single study is currently lacking. Objective:We used a combined methodology of machine learning and semi-parametric survival analysis to estimate the relative importance of 52 predictors in forecasting cognitive impairment and dementia in a large, population-representative sample of older adults. Methods:Participants from the Health and Retirement Study (N = 9,979; aged 50–98 years) were followed for up to 10 years (M = 6.85 for cognitive impairment; M = 7.67 for dementia). Using a split-sample methodology, we first estimated the relative importance of predictors using machine learning (random forest survival analysis), and we then used semi-parametric survival analysis (Cox proportional hazards) to estimate effect sizes for the most important variables. Results:African Americans and individuals who scored high on emotional distress were at relatively highest risk for developing cognitive impairment and dementia. Sociodemographic (lower education, Hispanic ethnicity) and health variables (worse subjective health, increasing BMI) were comparatively strong predictors for cognitive impairment. Cardiovascular factors (e.g., smoking, physical inactivity) and polygenic scores (with and without APOE ɛ4) appeared less important than expected. Post-hoc sensitivity analyses underscored the robustness of these results. Conclusions:Higher-order factors (e.g., emotional distress, subjective health), which reflect complex interactions between various aspects of an individual, were more important than narrowly defined factors (e.g., clinical and behavioral indicators) when evaluated concurrently to predict cognitive impairment and dementia.
Keywords: Aging, cognitive impairment, Cox proportional hazard survival analysis, dementia, machine learning, protective factors, random forest survival analysis, risk factors
DOI: 10.3233/JAD-190967
Journal: Journal of Alzheimer's Disease, vol. 75, no. 3, pp. 717-728, 2020