Acceso abierto

Reinforcement learning in discrete and continuous domains applied to ship trajectory generation

This paper presents the application of the reinforcement learning algorithmsto the task of autonomous determination of the ship trajectory during thein-harbour and harbour approaching manoeuvres. Authors used Markovdecision processes formalism to build up the background of algorithmpresentation. Two versions of RL algorithms were tested in the simulations:discrete (Q-learning) and continuous form (Least-Squares Policy Iteration).The results show that in both cases ship trajectory can be found. Howeverdiscrete Q-learning algorithm suffered from many limitations (mainly curseof dimensionality) and practically is not applicable to the examined task. On the other hand, LSPI gavepromising results. To be fully operational, proposed solution should be extended by taking into accountship heading and velocity and coupling with advanced multi-variable controller.

ISSN:
1233-2585
Idioma:
Inglés
Calendario de la edición:
4 veces al año
Temas de la revista:
Engineering, Introductions and Overviews, other, Geosciences, Atmospheric Science and Climatology, Life Sciences