将DNN压缩和合作培训与资源和数据可用性匹配

论文标题

将DNN压缩和合作培训与资源和数据可用性匹配

Matching DNN Compression and Cooperative Training with Resources and Data Availability

论文作者

Malandrino, Francesco, Di Giacomo, Giuseppe, Karamzade, Armin, Levorato, Marco, Chiasserini, Carla Fabiana

论文摘要

为了使机器学习（ML）可持续并易于在相关数据的不同设备上运行，必须根据需要压缩ML模型，同时仍然达到所需的学习质量和时间性能。但是，应压缩ML模型的数量以及何时应进行压缩，{\ em}应该执行其培训，这是艰难的决定，因为它们依赖于模型本身，可用节点的资源以及该节点拥有的数据。现有的研究重点是各个方面，但是，它们并不能说明如何共同做出这些决定并彼此适应。在这项工作中，我们对关注DNN的培训，正式化上述多维问题的网络系统进行建模，并鉴于其NP硬度，我们通过PACT算法框架提出了一个近似的动态编程问题。重要的是，协议利用了代表学习过程的时间扩展图，以及由于培训决策的结果，预测损失演变的数据驱动和理论方法。我们证明，PACT的解决方案可以根据需要获得接近最佳的最佳，以增加时间复杂性的成本，并且在任何情况下，这种复杂性都是多项式的。数值结果还表明，即使在最不利的设置下，契约都超过了最先进的替代方案，并且与最佳能源成本匹配。

To make machine learning (ML) sustainable and apt to run on the diverse devices where relevant data is, it is essential to compress ML models as needed, while still meeting the required learning quality and time performance. However, how much and when an ML model should be compressed, and {\em where} its training should be executed, are hard decisions to make, as they depend on the model itself, the resources of the available nodes, and the data such nodes own. Existing studies focus on each of those aspects individually, however, they do not account for how such decisions can be made jointly and adapted to one another. In this work, we model the network system focusing on the training of DNNs, formalize the above multi-dimensional problem, and, given its NP-hardness, formulate an approximate dynamic programming problem that we solve through the PACT algorithmic framework. Importantly, PACT leverages a time-expanded graph representing the learning process, and a data-driven and theoretical approach for the prediction of the loss evolution to be expected as a consequence of training decisions. We prove that PACT's solutions can get as close to the optimum as desired, at the cost of an increased time complexity, and that, in any case, such complexity is polynomial. Numerical results also show that, even under the most disadvantageous settings, PACT outperforms state-of-the-art alternatives and closely matches the optimal energy cost.

下载PDF全文

下载文献需遵守相关版权规定

论文标题