在神经网络中将最佳路径搜索与任务依赖性学习相结合

论文标题

在神经网络中将最佳路径搜索与任务依赖性学习相结合

Combining Optimal Path Search With Task-Dependent Learning in a Neural Network

论文作者

Kulvicius, Tomas, Tamosiunaite, Minija, Wörgötter, Florentin

论文摘要

在连接图中找到最佳路径需要确定沿图的边缘行驶的最小总成本。可以通过几种经典算法来解决此问题，在这些算法中通常，所有边缘的成本均已预定。因此，常规规划方法通常在想要按照某些任务要求以自适应方式更改成本时通常不使用。在这里，我们表明可以通过将成本值转换为突触权重，从而定义路径问题的神经网络表示，从而可以使用网络学习机制进行在线权重适应。从一个最初的活动值开始时，该网络中的活动传播将导致解决方案，这些解决方案与Bellman-Ford算法相同。神经网络具有与Bellman-Ford相同的算法复杂性，此外，我们可以证明网络学习机制（例如HEBBIAN学习）可以根据手头的某些任务来适应网络中的权重，从而增强所得路径。我们通过学习在具有障碍的环境以及学习遵循路径节点序列的环境中进行导航来证明这一点。因此，这里出现的新颖算法可能打开不同的应用程序，在这些应用程序中，道路启发（通过学习）直接与自然方式相结合的路径发现。

Finding optimal paths in connected graphs requires determining the smallest total cost for traveling along the graph's edges. This problem can be solved by several classical algorithms where, usually, costs are predefined for all edges. Conventional planning methods can, thus, normally not be used when wanting to change costs in an adaptive way following the requirements of some task. Here we show that one can define a neural network representation of path finding problems by transforming cost values into synaptic weights, which allows for online weight adaptation using network learning mechanisms. When starting with an initial activity value of one, activity propagation in this network will lead to solutions, which are identical to those found by the Bellman-Ford algorithm. The neural network has the same algorithmic complexity as Bellman-Ford and, in addition, we can show that network learning mechanisms (such as Hebbian learning) can adapt the weights in the network augmenting the resulting paths according to some task at hand. We demonstrate this by learning to navigate in an environment with obstacles as well as by learning to follow certain sequences of path nodes. Hence, the here-presented novel algorithm may open up a different regime of applications where path-augmentation (by learning) is directly coupled with path finding in a natural way.

下载PDF全文

下载文献需遵守相关版权规定

论文标题