TY - JOUR
T1 - Efficient multitask learning with an embodied predictive model for door opening and entry with whole-body control
AU - Ito, Hiroshi
AU - Yamamoto, Kenjiro
AU - Mori, Hiroki
AU - Ogata, Tetsuya
N1 - Publisher Copyright:
© 2022 The Authors, some rights reserved;
PY - 2022/4/1
Y1 - 2022/4/1
N2 - Robots need robust models to effectively perform tasks that humans do on a daily basis. These models often require substantial developmental costs to maintain because they need to be adjusted and adapted over time. Deep reinforcement learning is a powerful approach for acquiring complex real-world models because there is no need for a human to design the model manually. Furthermore, a robot can establish new motions and optimal trajectories that may not have been considered by a human. However, the cost of learning is an issue because it requires a huge amount of trial and error in the real world. Here, we report a method for realizing complicated tasks in the real world with low design and teaching costs based on the principle of prediction error minimization. We devised a module integration method by introducing a mechanism that switches modules based on the prediction error of multiple modules. The robot generates appropriate motions according to the door’s position, color, and pattern with a low teaching cost. We also show that by calculating the prediction error of each module in real time, it is possible to execute a sequence of tasks (opening door outward and passing through) by linking multiple modules and responding to sudden changes in the situation and operating procedures. The experimental results show that the method is effective at enabling a robot to operate autonomously in the real world in response to changes in the environment.
AB - Robots need robust models to effectively perform tasks that humans do on a daily basis. These models often require substantial developmental costs to maintain because they need to be adjusted and adapted over time. Deep reinforcement learning is a powerful approach for acquiring complex real-world models because there is no need for a human to design the model manually. Furthermore, a robot can establish new motions and optimal trajectories that may not have been considered by a human. However, the cost of learning is an issue because it requires a huge amount of trial and error in the real world. Here, we report a method for realizing complicated tasks in the real world with low design and teaching costs based on the principle of prediction error minimization. We devised a module integration method by introducing a mechanism that switches modules based on the prediction error of multiple modules. The robot generates appropriate motions according to the door’s position, color, and pattern with a low teaching cost. We also show that by calculating the prediction error of each module in real time, it is possible to execute a sequence of tasks (opening door outward and passing through) by linking multiple modules and responding to sudden changes in the situation and operating procedures. The experimental results show that the method is effective at enabling a robot to operate autonomously in the real world in response to changes in the environment.
UR - http://www.scopus.com/inward/record.url?scp=85127648125&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85127648125&partnerID=8YFLogxK
U2 - 10.1126/scirobotics.aax8177
DO - 10.1126/scirobotics.aax8177
M3 - Article
C2 - 35385295
AN - SCOPUS:85127648125
SN - 2470-9476
VL - 7
JO - Science Robotics
JF - Science Robotics
IS - 65
M1 - eaax8177
ER -