Seek of an Optimal Way by Q-Learning
Y. Dahmani and A. Benyettou
DOI : 10.3844/jcssp.2005.28.30
Journal of Computer Science
Volume 1, Issue 1
In this article, we presented the Q-Learning training method which is a derivative of the reinforcement learning called sometimes training by penalty-reward. We illustrate this by an application to the mobility of a mobile in an enclosure closed on the basis of a starting point towards an unspecified arrival point. The objective is to find an optimal way optimal without leaving the enclosure.
© 2005 Y. Dahmani and A. Benyettou. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.