SPSA的概述：最新开发和应用

论文标题

SPSA的概述：最新开发和应用

An overview of SPSA: recent development and applications

论文作者

Wang, Chen

论文摘要

在非确定性环境下解决高维优化问题的需求越来越大。最近，同时扰动随机近似（SPSA）算法最近引起了很大的关注，以解决无法达到分析公式的高维优化问题。 SPSA旨在通过在每次迭代的尺寸随机子集上施加扰动来估计梯度。 SPSA可以轻松实施，并且高效，因为它依赖于目标函数的测量，而不是基于目标函数梯度的测量。自发明以来，SPSA已在强化学习，生产优化等各个领域实施。本文简要讨论了SPSA及其应用的最新发展。

There is an increasing need in solving high-dimensional optimization problems under non-deterministic environment. The simultaneous perturbation stochastic approximation (SPSA) algorithm has recently attracted considerable attention for solving high-dimensional optimization problems where the analytical formula cannot be attained. SPSA is designed to estimate the gradient by applying perturbation on a random subset of dimensions at each iteration. SPSA can be easily implemented and is highly efficient in that that it relies on measurements of the objective function, not on measurements of the gradient of the objective function. Since its invention, SPSA has been implemented in various fields such as reinforcement learning, production optimization etc. The paper briefly discuss the recent development of SPSA and its applications.

下载PDF全文

下载文献需遵守相关版权规定

论文标题