This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Guo Rui, Wu Min, Peng Jun,etc. A new Q learning algorithm for multi-agent systems. Acta automatica sinica, Vol.33, No.4, p. 367-372, 2007Search in Google Scholar
LIAN Chuanqiang, XU Xin , WU Jun, LI Zhaobin. Q-CF multi-Agent reinforcement learning for resource allocation problems. CAAI Transactions on Intelligent Systems, Vol.6, No. 2, p.95-100, 2011Search in Google Scholar
FANGMin, LI Hao. Heuristically Accelerated State Backtracking Q–Learning Based on Cost Analysis. PR & A, Vol.26, No.9, p. 838-844, 2013Search in Google Scholar
HU Jun, ZHU Qing-bao. Path planning of robot for unknown environment based on prior knowledge rolling Q-learning. Control and Decision, Vol.25, No.9, p.1364-1368,2010Search in Google Scholar
J. C. H. Watkins Christopher, Dayan Peter. Q-learning. Machine Learning, Vol.8, No.3, p.279-292,199210.1023/A:1022676722315Search in Google Scholar