Reinforcement learning in discrete and continuous domains applied to ship trajectory generation

This paper presents the application of the reinforcement learning algorithmsto the task of autonomous determination of the ship trajectory during thein-harbour and harbour approaching manoeuvres. Authors used Markovdecision processes formalism to build up the background of algorithmpresentation. Two versions of RL algorithms were tested in the simulations:discrete (Q-learning) and continuous form (Least-Squares Policy Iteration).The results show that in both cases ship trajectory can be found. Howeverdiscrete Q-learning algorithm suffered from many limitations (mainly curseof dimensionality) and practically is not applicable to the examined task. On the other hand, LSPI gavepromising results. To be fully operational, proposed solution should be extended by taking into accountship heading and velocity and coupling with advanced multi-variable controller.

ISSN:: 1233-2585
Idioma:: Inglés

Calendario de la edición:: 4 veces al año
Temas de la revista:: Engineering, Introductions and Overviews, other, Geosciences, Atmospheric Science and Climatology, Life Sciences

RSS Feed de revista

Reinforcement learning in discrete and continuous domains applied to ship trajectory generation

Publicado en línea: 31 oct 2012

Páginas: 31 - 36

DOI: https://doi.org/10.2478/v10012-012-0020-8

Palabras claveship motion control, trajectory generation, autonomous navigation, reinforcement learning, least-squares policy iteration

This content is open access.

Palabras clave
ship motion control, trajectory generation, autonomous navigation, reinforcement learning, least-squares policy iteration