论文标题
评估使用逐渐生长的gan产生的合成胸部X射线的临床现实主义
Evaluating the Clinical Realism of Synthetic Chest X-Rays Generated Using Progressively Growing GANs
论文作者
论文摘要
胸部X射线是许多患者检查的重要工具。与大多数医学成像方式相似,它们具有深远的多模式,并且能够可视化各种条件组合。不断迫切需要更多的标记数据来开发新的诊断工具,但是这直接反对对患者机密性的担忧,这会通过许可请求和道德批准来限制访问权限。先前的工作试图通过创建特定于班级的甘体来解决这些问题,以综合图像以增强培训数据。当它们引入模型大小和班级编号之间的计算交易折扣时,这些方法无法缩放,这对产生的质量固定限制了。我们通过引入潜在类优化来解决这一问题,该优化能够从gan中获得有效的多模式采样,并通过它合成大量标记的生成档案。我们将PGGAN应用于无监督的X射线合成的任务,并让放射科医生评估所得样品的临床现实主义。我们对生成的不同病理的特性以及模型捕获的疾病多样性程度的概述提供了深入的综述。我们验证了Fréchet成立距离(FID)的应用以测量X射线产生的质量,并发现它们与其他高分辨率任务相似。我们通过要求放射学家区分真实和假扫描来量化X射线临床现实主义,并发现生成更有可能被归类为真实而不是偶然地归类为真实,但是要实现真正的现实主义仍然需要进步。我们通过评估实际扫描中的合成分类模型性能来确认这些发现。我们通过讨论PGGAN生成的局限性以及如何实现可控的,现实的生成来得出结论。
Chest x-rays are a vital tool in the workup of many patients. Similar to most medical imaging modalities, they are profoundly multi-modal and are capable of visualising a variety of combinations of conditions. There is an ever pressing need for greater quantities of labelled data to develop new diagnostic tools, however this is in direct opposition to concerns regarding patient confidentiality which constrains access through permission requests and ethics approvals. Previous work has sought to address these concerns by creating class-specific GANs that synthesise images to augment training data. These approaches cannot be scaled as they introduce computational trade offs between model size and class number which places fixed limits on the quality that such generates can achieve. We address this concern by introducing latent class optimisation which enables efficient, multi-modal sampling from a GAN and with which we synthesise a large archive of labelled generates. We apply a PGGAN to the task of unsupervised x-ray synthesis and have radiologists evaluate the clinical realism of the resultant samples. We provide an in depth review of the properties of varying pathologies seen on generates as well as an overview of the extent of disease diversity captured by the model. We validate the application of the Fréchet Inception Distance (FID) to measure the quality of x-ray generates and find that they are similar to other high resolution tasks. We quantify x-ray clinical realism by asking radiologists to distinguish between real and fake scans and find that generates are more likely to be classed as real than by chance, but there is still progress required to achieve true realism. We confirm these findings by evaluating synthetic classification model performance on real scans. We conclude by discussing the limitations of PGGAN generates and how to achieve controllable, realistic generates.