论文标题
预测口头话语的认知能力下降的早期指标
Predicting Early Indicators of Cognitive Decline from Verbal Utterances
论文作者
论文摘要
痴呆是一组不可逆,慢性和进行性神经退行性疾病,导致记忆,通信和思维过程受损。近年来,大脑衰老的临床研究进展集中在最早可检测到的初期痴呆症的阶段,通常称为轻度认知障碍(MCI)。目前,使用对神经心理学检查的手动分析来诊断这些疾病。我们衡量使用老年受试者神经心理学考试中引起的语言话语的语言特征的可行性,以区分老年对照组,患有MCI的人,被诊断出患有阿尔茨海默氏病(AD)的人和可能的AD。我们研究了理论驱动的心理语言特征和数据驱动的上下文语言嵌入的性能,以识别不同的临床诊断组。我们的实验表明,由支持向量机提取的上下文和心理语言特征的结合改善了区分老年人控制,MCI,可能的AD和可能的AD的口头话语。这是在高度不平衡的数据集中识别四个临床诊断组的第一项工作。我们的工作表明,基于上下文和心理语言特征建立的机器学习算法可以从口头话语中学习语言生物标志物,即使数据有限,也有助于不同阶段和痴呆症的临床诊断。
Dementia is a group of irreversible, chronic, and progressive neurodegenerative disorders resulting in impaired memory, communication, and thought processes. In recent years, clinical research advances in brain aging have focused on the earliest clinically detectable stage of incipient dementia, commonly known as mild cognitive impairment (MCI). Currently, these disorders are diagnosed using a manual analysis of neuropsychological examinations. We measure the feasibility of using the linguistic characteristics of verbal utterances elicited during neuropsychological exams of elderly subjects to distinguish between elderly control groups, people with MCI, people diagnosed with possible Alzheimer's disease (AD), and probable AD. We investigated the performance of both theory-driven psycholinguistic features and data-driven contextual language embeddings in identifying different clinically diagnosed groups. Our experiments show that a combination of contextual and psycholinguistic features extracted by a Support Vector Machine improved distinguishing the verbal utterances of elderly controls, people with MCI, possible AD, and probable AD. This is the first work to identify four clinical diagnosis groups of dementia in a highly imbalanced dataset. Our work shows that machine learning algorithms built on contextual and psycholinguistic features can learn the linguistic biomarkers from verbal utterances and assist clinical diagnosis of different stages and types of dementia, even with limited data.