论文标题

懒惰对大型神经网络的重要性的懒惰估计

Lazy Estimation of Variable Importance for Large Neural Networks

论文作者

Gao, Yue, Stevens, Abby, Willet, Rebecca, Raskutti, Garvesh

论文摘要

随着不透明的预测模型越来越多地影响现代生活的许多领域,对量化给定输入变量进行特定预测的重要性的兴趣已经增长。最近,模型 - 不足的方法已经扩散,以测量可变重要性(VI),该方法分析了对所有变量训练的完整模型之间的预测能力差异,以及排除了感兴趣的变量的简化模型。这些方法共有的瓶颈是每个变量(或变量子集)的简化模型的估计,这是一个昂贵的过程,通常不带理论保证。在这项工作中,我们提出了一种快速,灵活的方法,用于近似于重要的推理保证。我们通过在完整模型参数上初始化的线性化来取代对宽神经网络进行完全检验的需求。通过添加类似山脊的惩罚来使问题凸出,我们证明,当山脊罚款参数足够大时,我们的方法估计了$ o(\ frac {1} {\ sqrt {n}}}} {n}})$ n $ n $ n $ n $ n $ n $是培训样品的数量。我们还表明,我们的估计器在渐近正常上是正常的,使我们能够为VI估计值提供置信界。我们通过模拟证明,在几个数据生成的制度下,我们的方法是快速准确的,我们在季节性气候预测示例中证明了其现实世界的适用性。

As opaque predictive models increasingly impact many areas of modern life, interest in quantifying the importance of a given input variable for making a specific prediction has grown. Recently, there has been a proliferation of model-agnostic methods to measure variable importance (VI) that analyze the difference in predictive power between a full model trained on all variables and a reduced model that excludes the variable(s) of interest. A bottleneck common to these methods is the estimation of the reduced model for each variable (or subset of variables), which is an expensive process that often does not come with theoretical guarantees. In this work, we propose a fast and flexible method for approximating the reduced model with important inferential guarantees. We replace the need for fully retraining a wide neural network by a linearization initialized at the full model parameters. By adding a ridge-like penalty to make the problem convex, we prove that when the ridge penalty parameter is sufficiently large, our method estimates the variable importance measure with an error rate of $O(\frac{1}{\sqrt{n}})$ where $n$ is the number of training samples. We also show that our estimator is asymptotically normal, enabling us to provide confidence bounds for the VI estimates. We demonstrate through simulations that our method is fast and accurate under several data-generating regimes, and we demonstrate its real-world applicability on a seasonal climate forecasting example.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源