拆分场所：AI在移动边缘环境中增强大规模神经网络的分裂和放置

论文标题

拆分场所：AI在移动边缘环境中增强大规模神经网络的分裂和放置

SplitPlace: AI Augmented Splitting and Placement of Large-Scale Neural Networks in Mobile Edge Environments

论文作者

Tuli, Shreshth, Casale, Giuliano, Jennings, Nicholas R.

论文摘要

近年来，深度学习模型在行业和学术界都变得无处不在。深层神经网络可以解决当今最复杂的模式识别问题，但要带来庞大的计算和记忆要求的价格。这使得在资源受限的移动边缘计算平台中部署这样的大规模神经网络，特别是在关键任务领域（例如监视和医疗保健）中。为了解决这个问题，一个有前途的解决方案是将渴望资源的神经网络拆分为用于管道分布式处理的轻质分离较小的组件。目前，有两种主要方法：语义和层面分裂。前者将神经网络划分为平行的分离模型，该模型产生了结果的一部分，而后者将分区分为产生中间结果的顺序模型。但是，没有智能算法决定要使用哪种拆分策略，并将这种模块化拆分以边缘节点以获得最佳性能。为了解决这个问题，这项工作提出了一种新颖的AI驱动的在线政策Splitplace，该政策使用多军伴随的伴侣根据输入任务的服务截止日期的需求明智地在图层和语义分开策略之间智能决定。 Splitplace使用决策意识的增强学习在移动边缘设备上拆分碎片，以进行有效且可扩展的计算。此外，SplitPlace微调其位置引擎以适应挥发性环境。我们对具有实际工作量的物理移动边缘环境进行的实验表明，分裂场所可以显着改善最新响应时间，违规截止日期的率，推理准确性和总奖励，分别高达46％，69％，3％和12％。

In recent years, deep learning models have become ubiquitous in industry and academia alike. Deep neural networks can solve some of the most complex pattern-recognition problems today, but come with the price of massive compute and memory requirements. This makes the problem of deploying such large-scale neural networks challenging in resource-constrained mobile edge computing platforms, specifically in mission-critical domains like surveillance and healthcare. To solve this, a promising solution is to split resource-hungry neural networks into lightweight disjoint smaller components for pipelined distributed processing. At present, there are two main approaches to do this: semantic and layer-wise splitting. The former partitions a neural network into parallel disjoint models that produce a part of the result, whereas the latter partitions into sequential models that produce intermediate results. However, there is no intelligent algorithm that decides which splitting strategy to use and places such modular splits to edge nodes for optimal performance. To combat this, this work proposes a novel AI-driven online policy, SplitPlace, that uses Multi-Armed-Bandits to intelligently decide between layer and semantic splitting strategies based on the input task's service deadline demands. SplitPlace places such neural network split fragments on mobile edge devices using decision-aware reinforcement learning for efficient and scalable computing. Moreover, SplitPlace fine-tunes its placement engine to adapt to volatile environments. Our experiments on physical mobile-edge environments with real-world workloads show that SplitPlace can significantly improve the state-of-the-art in terms of average response time, deadline violation rate, inference accuracy, and total reward by up to 46, 69, 3 and 12 percent respectively.

下载PDF全文

下载文献需遵守相关版权规定

论文标题