TY - GEN
T1 - Curriculum-based Offline Network Training for Improvement of Peg-in-hole Task Performance for Holes in Concrete
AU - Yasutomi, Andre Yuji
AU - Mori, Hiroki
AU - Ogata, Tetsuya
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - A method for reducing the time required to train a deep neural network (DNN) via deep reinforcement learning (DRL) to enable a robot to conduct anchor-bolt insertion, a peg-in-hole task for holes in concrete, is proposed. The proposed method is also intended to reduce task execution time. The method consists of two steps. The first step involves creating a map of state observations and search results for holes opened in a concrete wall and using this map to train the DNN via DRL in an offline manner. The second step involves training the DNN with a curriculum that involves gradually increasing the step-size options the DNN can output to command the robot. Experimental evaluations of the method demonstrate that the offline training reduces DNN training time by about 87.5%, while enabling task execution with success rates and execution times that are similar to those obtained with a DNN trained online. Moreover, the evaluations show that curriculum training reduces task execution time, and enables execution of the peg-in-hole task for unknown holes with success rate of 97.5% and execution time of 7.77 s. This result represents a 12.8% higher success rate and a 4.71 s shorter execution time than those obtained with a DNN trained online. These results demonstrate the effectiveness of the proposed method and its applicability to the construction industry. Although the proposed method was applied to anchor-bolt insertion, it can be extended to any other peg-in-hole tasks conducted in discrete steps.
AB - A method for reducing the time required to train a deep neural network (DNN) via deep reinforcement learning (DRL) to enable a robot to conduct anchor-bolt insertion, a peg-in-hole task for holes in concrete, is proposed. The proposed method is also intended to reduce task execution time. The method consists of two steps. The first step involves creating a map of state observations and search results for holes opened in a concrete wall and using this map to train the DNN via DRL in an offline manner. The second step involves training the DNN with a curriculum that involves gradually increasing the step-size options the DNN can output to command the robot. Experimental evaluations of the method demonstrate that the offline training reduces DNN training time by about 87.5%, while enabling task execution with success rates and execution times that are similar to those obtained with a DNN trained online. Moreover, the evaluations show that curriculum training reduces task execution time, and enables execution of the peg-in-hole task for unknown holes with success rate of 97.5% and execution time of 7.77 s. This result represents a 12.8% higher success rate and a 4.71 s shorter execution time than those obtained with a DNN trained online. These results demonstrate the effectiveness of the proposed method and its applicability to the construction industry. Although the proposed method was applied to anchor-bolt insertion, it can be extended to any other peg-in-hole tasks conducted in discrete steps.
UR - http://www.scopus.com/inward/record.url?scp=85126260795&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126260795&partnerID=8YFLogxK
U2 - 10.1109/SII52469.2022.9708766
DO - 10.1109/SII52469.2022.9708766
M3 - Conference contribution
AN - SCOPUS:85126260795
T3 - 2022 IEEE/SICE International Symposium on System Integration, SII 2022
SP - 712
EP - 717
BT - 2022 IEEE/SICE International Symposium on System Integration, SII 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE/SICE International Symposium on System Integration, SII 2022
Y2 - 9 January 2022 through 12 January 2022
ER -