论文标题
自私驾驶有多糟糕?界定城市驾驶游戏中平衡的效率低下
How Bad is Selfish Driving? Bounding the Inefficiency of Equilibria in Urban Driving Games
论文作者
论文摘要
我们考虑从事驾驶任务的代理商之间的互动,并将其模拟为通用游戏。这类游戏表现出多种不同的平衡,提出了均衡选择的问题。从计算的角度来看,选择最有效的平衡(按社会成本)通常是不切实际的,但在这项工作中,我们研究了任何均衡参与者的(在)效率上可能同意玩。更具体地说,我们通过将驾驶游戏作为特定类型的拥堵游戏建模而不是时空资源来限制均衡效率的效率低下。我们获得新颖的保证,可以根据问题依赖的游戏参数的函数来完善现有的无政府状态(POA)价格。例如,接近成本与舒适性和进步等个人目标之间的相对权衡。尽管获得的保证涉及开环轨迹,但即使代理人采用通过分散的多机构增强学习训练的闭环政策,我们也会观察到有效的平衡。
We consider the interaction among agents engaging in a driving task and we model it as general-sum game. This class of games exhibits a plurality of different equilibria posing the issue of equilibrium selection. While selecting the most efficient equilibrium (in term of social cost) is often impractical from a computational standpoint, in this work we study the (in)efficiency of any equilibrium players might agree to play. More specifically, we bound the equilibrium inefficiency by modeling driving games as particular type of congestion games over spatio-temporal resources. We obtain novel guarantees that refine existing bounds on the Price of Anarchy (PoA) as a function of problem-dependent game parameters. For instance, the relative trade-off between proximity costs and personal objectives such as comfort and progress. Although the obtained guarantees concern open-loop trajectories, we observe efficient equilibria even when agents employ closed-loop policies trained via decentralized multi-agent reinforcement learning.