论文标题
机器学习会在大型多中心队列中使用胸部CT自动检测COVID-19
Machine Learning Automatically Detects COVID-19 using Chest CTs in a Large Multicenter Cohort
论文作者
论文摘要
目的:使用胸部CT研究机器学习分类器和可解释的模型,以检测COVID-19,并与其他肺炎,ILD和正常CT进行分化。 方法:我们的回顾性多机构研究从16个机构(包括1077 Covid-19患者)获得了2096个胸部CT。训练/测试队列包括927/100 Covid-11、388/33 ILD,189/33其他肺炎和559/34正常(无病理)CTS。基于逻辑回归和随机森林的基于度量的COVID-19进行分类的方法,使用了可解释的特征。基于深度学习的分类器通过3D特征从CT衰减和空域不透明的概率分布中提取的COVID-19。 结果:COVID-19的大多数判别特征是空域不透明的百分比,外围和基础主要的不透明性,与文献中Covid-19的典型表征一致。无监督的分层聚类比较了跨越19和控制队列的特征分布。基于指标的分类器实现了AUC = 0.83,灵敏度= 0.74,并且特异性= 0.79 = 0.79,而基于DL的分类器分别为0.93、0.90和0.83。大多数歧义都来自非旋转19个肺炎,其表现与19例重叠,以及轻度的Covid-19病例。 ILD的非旋转19分类性能为91%,其他肺炎为64%,没有病理学为94%,这证明了我们方法对对照组的不同组成的鲁棒性。 结论:我们的新方法可以准确区分COVID-19与其他类型的肺炎,ILD和无病理性CT,使用定量成像特征,同时使用定量成像特征,同时平衡结果和分类性能,因此对于促进Covid-19的诊断可能很有用。
Objectives: To investigate machine-learning classifiers and interpretable models using chest CT for detection of COVID-19 and differentiation from other pneumonias, ILD and normal CTs. Methods: Our retrospective multi-institutional study obtained 2096 chest CTs from 16 institutions (including 1077 COVID-19 patients). Training/testing cohorts included 927/100 COVID-19, 388/33 ILD, 189/33 other pneumonias, and 559/34 normal (no pathologies) CTs. A metric-based approach for classification of COVID-19 used interpretable features, relying on logistic regression and random forests. A deep learning-based classifier differentiated COVID-19 via 3D features extracted directly from CT attenuation and probability distribution of airspace opacities. Results: Most discriminative features of COVID-19 are percentage of airspace opacity and peripheral and basal predominant opacities, concordant with the typical characterization of COVID-19 in the literature. Unsupervised hierarchical clustering compares feature distribution across COVID-19 and control cohorts. The metrics-based classifier achieved AUC=0.83, sensitivity=0.74, and specificity=0.79 of versus respectively 0.93, 0.90, and 0.83 for the DL-based classifier. Most of ambiguity comes from non-COVID-19 pneumonia with manifestations that overlap with COVID-19, as well as mild COVID-19 cases. Non-COVID-19 classification performance is 91% for ILD, 64% for other pneumonias and 94% for no pathologies, which demonstrates the robustness of our method against different compositions of control groups. Conclusions: Our new method accurately discriminates COVID-19 from other types of pneumonia, ILD, and no pathologies CTs, using quantitative imaging features derived from chest CT, while balancing interpretability of results and classification performance, and therefore may be useful to facilitate diagnosis of COVID-19.