使用探索性计划对行动模型的自我指导学习

论文标题

使用探索性计划对行动模型的自我指导学习

Self-directed Learning of Action Models using Exploratory Planning

论文作者

Dannenhauer, Dustin, Molineaux, Matthew, Floyd, Michael W., Reifsnyder, Noah, Aha, David W.

论文摘要

复杂的现实世界域可能无法为代理完全建模，尤其是如果代理以前从未在域中运行。代理在这样一个领域中有效计划和行动的能力受到其何时可以执行特定行动和这些行动影响的知识。我们描述了一个新颖的探索计划代理，该计划能够学习行动前提和效果，而没有专家痕迹或给定的目标。代理商的体系结构允许其执行探索性动作和目标指导的行动，这为应如何控制探索性计划和目标计划以及如何向代理商的行为解释了可能具有的任何队友的重要考虑。这项工作的贡献包括一个新的代表，称为“提升链接的子句”，一种使用这些条款的新型探索行动选择方法，一种探索计划者，该探索计划者使用提起的链接条款作为目标，以便到达新状态，并在以探索视频为中心的视频游戏中，在探索的视频游戏中进行了反对互动的探索模型，以实现探索的方式进行经验评估。

Complex, real-world domains may not be fully modeled for an agent, especially if the agent has never operated in the domain before. The agent's ability to effectively plan and act in such a domain is influenced by its knowledge of when it can perform specific actions and the effects of those actions. We describe a novel exploratory planning agent that is capable of learning action preconditions and effects without expert traces or a given goal. The agent's architecture allows it to perform both exploratory actions as well as goal-directed actions, which opens up important considerations for how exploratory planning and goal planning should be controlled, as well as how the agent's behavior should be explained to any teammates it may have. The contributions of this work include a new representation for contexts called Lifted Linked Clauses, a novel exploration action selection approach using these clauses, an exploration planner that uses lifted linked clauses as goals in order to reach new states, and an empirical evaluation in a scenario from an exploration-focused video game demonstrating that lifted linked clauses improve exploration and action model learning against non-planning baseline agents.

下载PDF全文

下载文献需遵守相关版权规定

论文标题