TY - JOUR
T1 - Acquisition of Inducing Policy in Collaborative Robot Navigation Based on Multiagent Deep Reinforcement Learning
AU - Kamezaki, Mitsuhiro
AU - Ong, Ryan
AU - Sugano, Shigeki
N1 - Funding Information:
This work was supported in part by the Japan Science and Technology Agency (JST) PRESTO under Grant JPMJPR1754; in part by the Japan Society for the Promotion of Science (JSPS) KAKENHI under Grant 19H01130; and in part by the Waseda Research Institute Science and Engineering, Waseda University.
Publisher Copyright:
© 2013 IEEE.
PY - 2023
Y1 - 2023
N2 - To avoid inefficient movement or the freezing problem in crowded environments, we previously proposed a human-aware interactive navigation method that uses inducement, i.e., voice reminders or physical touch. However, the use of inducement largely depends on many factors, including human attributes, task contents, and environmental contexts. Thus, it is unrealistic to pre-design a set of parameters such as the coefficients in the cost function, personal space, and velocity in accordance with the situation. To understand and evaluate if inducement (voice reminder in this study) is effective and how and when it must be used, we propose to comprehend them through multiagent deep reinforcement learning in which the robot voluntarily acquires an inducing policy suitable for the situation. Specifically, we evaluate whether a voice reminder can improve the time to reach the goal by learning when the robot uses it. Results of simulation experiments with four different situations show that the robot could learn inducing policies suited for each situation, and the effectiveness of inducement is greatly improved in more congested and narrow situations.
AB - To avoid inefficient movement or the freezing problem in crowded environments, we previously proposed a human-aware interactive navigation method that uses inducement, i.e., voice reminders or physical touch. However, the use of inducement largely depends on many factors, including human attributes, task contents, and environmental contexts. Thus, it is unrealistic to pre-design a set of parameters such as the coefficients in the cost function, personal space, and velocity in accordance with the situation. To understand and evaluate if inducement (voice reminder in this study) is effective and how and when it must be used, we propose to comprehend them through multiagent deep reinforcement learning in which the robot voluntarily acquires an inducing policy suitable for the situation. Specifically, we evaluate whether a voice reminder can improve the time to reach the goal by learning when the robot uses it. Results of simulation experiments with four different situations show that the robot could learn inducing policies suited for each situation, and the effectiveness of inducement is greatly improved in more congested and narrow situations.
KW - Autonomous mobile robot
KW - collaborative robot navigation
KW - inducing policy acquisition
KW - multiagent deep reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85149844961&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85149844961&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2023.3253513
DO - 10.1109/ACCESS.2023.3253513
M3 - Article
AN - SCOPUS:85149844961
SN - 2169-3536
VL - 11
SP - 23946
EP - 23955
JO - IEEE Access
JF - IEEE Access
ER -