论文标题
基于软演员评论的自适应设备边缘共同推动框架
An Adaptive Device-Edge Co-Inference Framework Based on Soft Actor-Critic
论文作者
论文摘要
最近,由于其出色的特征提取性能,深度神经网络(DNN)的应用非常突出,例如计算机视觉(CV)和自然语言处理(NLP)。但是,高维参数模型和大规模数学计算限制了执行效率,尤其是对于物联网(IoT)设备。与以前的云/边缘模式不同,它为上行链路通信和仅设备的时尚带来了巨大的压力,即实现无法承受的计算强度,我们重点介绍了DNN模型的设备和边缘之间的协作计算,这可以在通信负载和执行精度之间取得良好的平衡。具体而言,提出了一个系统的按需共同推动框架来利用多分支结构,其中预训练的Alexnet通过\ emph {早期 - exiT}正确大小,并在中间DNN层进行了分区。实施整数量化以进一步压缩传输位。结果,我们为离散(SAC-D)建立了一个新的深入强化学习(DRL)优化的演员评论家(SAC-D),该研究生成\ emph {exit Point},\ emph {partition Point}和\ emph {compressing Bits}。基于延迟和准确的意识奖励设计,这种优化器可以很好地适应复杂的环境,例如动态无线通道和任意CPU处理,并且能够支持5G URLLC。 Raspberry Pi 4和PC上的现实世界实验显示了所提出的解决方案的表现。
Recently, the applications of deep neural network (DNN) have been very prominent in many fields such as computer vision (CV) and natural language processing (NLP) due to its superior feature extraction performance. However, the high-dimension parameter model and large-scale mathematical calculation restrict the execution efficiency, especially for Internet of Things (IoT) devices. Different from the previous cloud/edge-only pattern that brings huge pressure for uplink communication and device-only fashion that undertakes unaffordable calculation strength, we highlight the collaborative computation between the device and edge for DNN models, which can achieve a good balance between the communication load and execution accuracy. Specifically, a systematic on-demand co-inference framework is proposed to exploit the multi-branch structure, in which the pre-trained Alexnet is right-sized through \emph{early-exit} and partitioned at an intermediate DNN layer. The integer quantization is enforced to further compress transmission bits. As a result, we establish a new Deep Reinforcement Learning (DRL) optimizer-Soft Actor Critic for discrete (SAC-d), which generates the \emph{exit point}, \emph{partition point}, and \emph{compressing bits} by soft policy iterations. Based on the latency and accuracy aware reward design, such an optimizer can well adapt to the complex environment like dynamic wireless channel and arbitrary CPU processing, and is capable of supporting the 5G URLLC. Real-world experiment on Raspberry Pi 4 and PC shows the outperformance of the proposed solution.