论文标题
PowerNet:可扩展的电动机控制的多代理深钢筋学习
PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control
论文作者
论文摘要
本文开发了一种有效的多代理深钢筋学习算法,用于PowerGrids中的合作控制。具体而言,我们考虑分散的发电机(DGS)中基于分散的逆变器的二级电压控制问题,该问题首先被称为合作的多代理增强学习(MARL)问题。然后,我们提出了一种新颖的polynet元素算法,每个代理商(DG)在其中基于(子)全球奖励但本地状态从其邻近的代理商那里学习了一个控制策略。由于一个代理商的当地控制对远离代理的影响有限的事实,我们利用了一种新型的空间折现因子来减少远程代理的影响,以加快训练过程并提高可扩展性。此外,还采用了一种基于学习的,基于学习的交流协议来促进相邻代理之间的合作。此外,为了减轻系统不确定性和在政策学习过程中引入的随机噪声的影响,我们利用动作平滑因子来稳定策略执行。为了促进培训和评估,我们开发了PGSIM,这是一个高效,高保真的动力格里德模拟平台。两个微电网设置的实验结果表明,开发的PowerNet优于常规模型的控制以及几种最先进的MARL算法。分散的学习方案和高样本效率也使大规模电网可行。
This paper develops an efficient multi-agent deep reinforcement learning algorithm for cooperative controls in powergrids. Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generators (DGs), which is first formulated as a cooperative multi-agent reinforcement learning (MARL) problem. We then propose a novel on-policy MARL algorithm, PowerNet, in which each agent (DG) learns a control policy based on (sub-)global reward but local states from its neighboring agents. Motivated by the fact that a local control from one agent has limited impact on agents distant from it, we exploit a novel spatial discount factor to reduce the effect from remote agents, to expedite the training process and improve scalability. Furthermore, a differentiable, learning-based communication protocol is employed to foster the collaborations among neighboring agents. In addition, to mitigate the effects of system uncertainty and random noise introduced during on-policy learning, we utilize an action smoothing factor to stabilize the policy execution. To facilitate training and evaluation, we develop PGSim, an efficient, high-fidelity powergrid simulation platform. Experimental results in two microgrid setups show that the developed PowerNet outperforms a conventional model-based control, as well as several state-of-the-art MARL algorithms. The decentralized learning scheme and high sample efficiency also make it viable to large-scale power grids.