论文标题
带有生存监督的神经主题模型:共同预测事件的时间结果并了解临床特征如何相关
Neural Topic Models with Survival Supervision: Jointly Predicting Time-to-Event Outcomes and Learning How Clinical Features Relate
论文作者
论文摘要
我们提出了一个神经网络框架,用于学习一个生存模型,以预测事件时间的结果,同时学习一个揭示特征关系的主题模型。特别是,我们将每个主题建模为“主题”的分布,例如,一个主题可以对应于年龄组,疾病或疾病。主题中主题的存在意味着特定的临床特征更有可能出现在该主题中。主题编码有关相关特征的信息,并以监督方式学习,以预测事件时间的结果。我们的框架支持结合许多不同的主题和生存模型;训练所得的关节生存主题模型很容易使用带有Minibatch梯度下降的标准神经净优化器到大型数据集。例如,一种特殊情况是将LDA与COX模型相结合,在这种情况下,主题对主题的分布是COX模型的输入特征向量。我们解释了如何解决将这些神经监督的主题模型应用于临床数据时出现的实际实施问题,包括如何可视化结果以帮助临床解释。我们研究了我们提出的框架在七个临床数据集上的有效性,以预测时间和医院ICU的住院时间,我们发现神经生存习惯的主题模型通过现有方法实现竞争精度,同时产生可解释的临床主题来解释特征关系。我们的代码可在以下网址找到:https://github.com/georgehc/survival-topics
We present a neural network framework for learning a survival model to predict a time-to-event outcome while simultaneously learning a topic model that reveals feature relationships. In particular, we model each subject as a distribution over "topics", where a topic could, for instance, correspond to an age group, a disorder, or a disease. The presence of a topic in a subject means that specific clinical features are more likely to appear for the subject. Topics encode information about related features and are learned in a supervised manner to predict a time-to-event outcome. Our framework supports combining many different topic and survival models; training the resulting joint survival-topic model readily scales to large datasets using standard neural net optimizers with minibatch gradient descent. For example, a special case is to combine LDA with a Cox model, in which case a subject's distribution over topics serves as the input feature vector to the Cox model. We explain how to address practical implementation issues that arise when applying these neural survival-supervised topic models to clinical data, including how to visualize results to assist clinical interpretation. We study the effectiveness of our proposed framework on seven clinical datasets on predicting time until death as well as hospital ICU length of stay, where we find that neural survival-supervised topic models achieve competitive accuracy with existing approaches while yielding interpretable clinical topics that explain feature relationships. Our code is available at: https://github.com/georgehc/survival-topics