使用图形神经网络的近端策略优化，以实现最佳功率流

论文标题

使用图形神经网络的近端策略优化，以实现最佳功率流

Proximal Policy Optimization with Graph Neural Networks for Optimal Power Flow

论文作者

López-Cardona, Ángela, Bernárdez, Guillermo, Barlet-Ros, Pere, Cabellos-Aparicio, Albert

论文摘要

最佳功率流（OPF）是电力系统领域中非常传统的研究领域，它寻求发电厂的最佳操作点，并且需要在现实世界中每隔几分钟解决一次。但是，由于发电系统中出现的非转化性，目前还没有一种快速，强大的解决方案技术，用于整个交替的当前最佳功率流（ACOPF）。在过去的几十年中，电网已演变为典型的动态，非线性和大规模控制系统（称为电力系统），因此寻找更好，更快的ACOPF解决方案变得至关重要。图形神经网络（GNN）的外观允许在图数据（例如电源网络）上自然使用机器学习（ML）算法。另一方面，深入的增强学习（DRL）以其解决复杂决策问题的强大能力而闻名。尽管分别使用这两种方法的解决方案已开始出现在文献中，但没有一个尚未将两者的优势结合在一起。我们提出了一种基于图形神经网络的近端策略优化算法的新型体系结构，以解决最佳功率流。目的是设计一个学习如何解决优化问题的体系结构，同时也能够概括为看不见的场景。我们将解决方案与DCOPF在IEEE 30总线系统上培训我们的DRL代理后的成本方面将我们的解决方案进行比较

Optimal Power Flow (OPF) is a very traditional research area within the power systems field that seeks for the optimal operation point of electric power plants, and which needs to be solved every few minutes in real-world scenarios. However, due to the nonconvexities that arise in power generation systems, there is not yet a fast, robust solution technique for the full Alternating Current Optimal Power Flow (ACOPF). In the last decades, power grids have evolved into a typical dynamic, non-linear and large-scale control system, known as the power system, so searching for better and faster ACOPF solutions is becoming crucial. Appearance of Graph Neural Networks (GNN) has allowed the natural use of Machine Learning (ML) algorithms on graph data, such as power networks. On the other hand, Deep Reinforcement Learning (DRL) is known for its powerful capability to solve complex decision-making problems. Although solutions that use these two methods separately are beginning to appear in the literature, none has yet combined the advantages of both. We propose a novel architecture based on the Proximal Policy Optimization algorithm with Graph Neural Networks to solve the Optimal Power Flow. The objective is to design an architecture that learns how to solve the optimization problem and that is at the same time able to generalize to unseen scenarios. We compare our solution with the DCOPF in terms of cost after having trained our DRL agent on IEEE 30 bus system and then computing the OPF on that base network with topology changes

下载PDF全文

下载文献需遵守相关版权规定

论文标题