Epoch-incremental reinforcement learning algorithms

In this article, a new class of the epoch-incremental reinforcement learning algorithm is proposed. In the incremental mode, the fundamental TD(0) or TD(λ) algorithm is performed and an environment model is created. In the epoch mode, on the basis of the environment model, the distances of past-active states to the terminal state are computed. These distances and the reinforcement terminal state signal are used to improve the agent policy.

Idioma:: Inglés

Calendario de la edición:: 4 veces al año
Temas de la revista:: Matemáticas, Matemáticas aplicadas

RSS Feed de revista

Epoch-incremental reinforcement learning algorithms

Roman Zajdel

Publicado en línea: 30 sept 2013

Páginas: 623 - 635

DOI: https://doi.org/10.2478/amcs-2013-0047

Palabras clavereinforcement learning, epoch-incremental algorithm, grid world

This content is open access.

Palabras clave
reinforcement learning, epoch-incremental algorithm, grid world