Accesso libero

Reinforcement learning in discrete and continuous domains applied to ship trajectory generation

INFORMAZIONI SU QUESTO ARTICOLO

This paper presents the application of the reinforcement learning algorithmsto the task of autonomous determination of the ship trajectory during thein-harbour and harbour approaching manoeuvres. Authors used Markovdecision processes formalism to build up the background of algorithmpresentation. Two versions of RL algorithms were tested in the simulations:discrete (Q-learning) and continuous form (Least-Squares Policy Iteration).The results show that in both cases ship trajectory can be found. Howeverdiscrete Q-learning algorithm suffered from many limitations (mainly curseof dimensionality) and practically is not applicable to the examined task. On the other hand, LSPI gavepromising results. To be fully operational, proposed solution should be extended by taking into accountship heading and velocity and coupling with advanced multi-variable controller.

ISSN:
1233-2585
Lingua:
Inglese
Frequenza di pubblicazione:
4 volte all'anno
Argomenti della rivista:
Engineering, Introductions and Overviews, other, Geosciences, Atmospheric Science and Climatology, Life Sciences