论文标题
一般政策,序列化和计划宽度
General Policies, Serializations, and Planning Width
论文作者
论文摘要
已经观察到,在许多基准规划域中,可以通过一个称为IW的简单多项式探索程序来达到原子目标,该程序在问题宽度中时及时运行。这些问题确实具有有界的宽度:与问题变量的数量相关的宽度,通常不超过两个。然而,尽管宽度的概念已成为BFW等最先进的计划算法的一部分,但仍然没有很好的解释说明为什么如此多的基准域界限宽度。在这项工作中,我们通过将有限的宽度和序列化宽度与广义计划的思想联系起来来解决这个问题,其中一般政策旨在一次解决计划问题的多个实例。我们表明,有限的宽度是计划域的属性,该域是根据在域编码中明确或隐式表示的特征来接受最佳一般策略的属性。结果将扩展到具有有限的序列化宽度的较大类别的域,其中一般策略不必是最佳的。该研究还导致了一种新的简单,有意义且表现力的语言,用于以策略草图的形式指定域序列化,可用于通过手动编码域控制知识或从痕迹中学习它。草图的使用和理论结果的含义均通过许多示例说明。
It has been observed that in many of the benchmark planning domains, atomic goals can be reached with a simple polynomial exploration procedure, called IW, that runs in time exponential in the problem width. Such problems have indeed a bounded width: a width that does not grow with the number of problem variables and is often no greater than two. Yet, while the notion of width has become part of the state-of-the-art planning algorithms like BFWS, there is still no good explanation for why so many benchmark domains have bounded width. In this work, we address this question by relating bounded width and serialized width to ideas of generalized planning, where general policies aim to solve multiple instances of a planning problem all at once. We show that bounded width is a property of planning domains that admit optimal general policies in terms of features that are explicitly or implicitly represented in the domain encoding. The results are extended to much larger class of domains with bounded serialized width where the general policies do not have to be optimal. The study leads also to a new simple, meaningful, and expressive language for specifying domain serializations in the form of policy sketches which can be used for encoding domain control knowledge by hand or for learning it from traces. The use of sketches and the meaning of the theoretical results are all illustrated through a number of examples.