部分可观测时空混沌系统的无模型预测

论文标题

部分可观测时空混沌系统的无模型预测

Condensing Graphs via One-Step Gradient Matching

论文作者

Jin, Wei, Tang, Xianfeng, Jiang, Haoming, Li, Zheng, Zhang, Danqing, Tang, Jiliang, Yin, Bing

论文摘要

由于大型数据集中的培训深度学习模型需要大量时间和资源，因此希望构建一个小型合成数据集，我们可以通过它充分训练深度学习模型。最近有一些作品通过复杂的双层优化探索了有关凝结图像数据集的解决方案。例如，数据集冷凝（DC）匹配网络梯度W.R.T.大型数据和小合成数据，在每个外迭代处，网络权重优化了多个步骤。但是，现有方法具有其固有的局限性：（1）它们不直接适用于数据离散的图；（2）由于所嵌套的优化，冷凝过程在计算上昂贵。为了弥合差距，我们研究了针对图形数据集量身定制的有效数据集冷凝，在该数据集中我们将离散的图形结构建模为概率模型。我们进一步提出了一个单步梯度匹配方案，该方案仅执行一个步骤，而无需训练网络权重。我们的理论分析表明，该策略可以生成合成图，从而导致实际图上的分类损失降低。各种图数据集的广泛实验证明了该方法的有效性和效率。特别是，我们能够将数据集大小降低90％，而大约最多的原始性能的98％，并且我们的方法明显快于多步梯度匹配（例如，CIFAR10中的15倍用于合成500幅图）。代码可在\ url {https://github.com/amazon-research/doscond}中获得。

As training deep learning models on large dataset takes a lot of time and resources, it is desired to construct a small synthetic dataset with which we can train deep learning models sufficiently. There are recent works that have explored solutions on condensing image datasets through complex bi-level optimization. For instance, dataset condensation (DC) matches network gradients w.r.t. large-real data and small-synthetic data, where the network weights are optimized for multiple steps at each outer iteration. However, existing approaches have their inherent limitations: (1) they are not directly applicable to graphs where the data is discrete; and (2) the condensation process is computationally expensive due to the involved nested optimization. To bridge the gap, we investigate efficient dataset condensation tailored for graph datasets where we model the discrete graph structure as a probabilistic model. We further propose a one-step gradient matching scheme, which performs gradient matching for only one single step without training the network weights. Our theoretical analysis shows this strategy can generate synthetic graphs that lead to lower classification loss on real graphs. Extensive experiments on various graph datasets demonstrate the effectiveness and efficiency of the proposed method. In particular, we are able to reduce the dataset size by 90% while approximating up to 98% of the original performance and our method is significantly faster than multi-step gradient matching (e.g. 15x in CIFAR10 for synthesizing 500 graphs). Code is available at \url{https://github.com/amazon-research/DosCond}.

下载PDF全文

下载文献需遵守相关版权规定

论文标题