TY - GEN
T1 - Biped Robot Terrain Adaptability Based on Improved SAC Algorithm
AU - Zhang, Yilin
AU - Xie, Jianan
AU - Du, Xiaohan
AU - Sun, Huimin
AU - Wang, Shanshan
AU - Hashimoto, Kenji
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024
Y1 - 2024
N2 - This study introduces an improved Soft Actor-Critic (SAC) algorithm designed to improve the gait stability of biped robots in complex terrain conditions. The core innovations lie in the redesign of the network model and the creation of a tailored reward function. The network model revision enhances the learning process and responsiveness of the robot to varied terrain conditions, while the customized reward function ensures effective adaptation and stability maintenance. Experimental results show that biped robots using our advanced learning model exhibit substantial improvements in stability while navigating complex terrains, compared to those employing traditional methods. These improvements significantly increase the robustness and adaptability of the algorithm, enabling it to effectively meet diverse environmental challenges. This research marks a significant step forward in the development of advanced and reliable biped robot systems, emphasizing the power of deep reinforcement learning to transcend the limitations of conventional robotic control approaches, especially in complex environmental interactions.
AB - This study introduces an improved Soft Actor-Critic (SAC) algorithm designed to improve the gait stability of biped robots in complex terrain conditions. The core innovations lie in the redesign of the network model and the creation of a tailored reward function. The network model revision enhances the learning process and responsiveness of the robot to varied terrain conditions, while the customized reward function ensures effective adaptation and stability maintenance. Experimental results show that biped robots using our advanced learning model exhibit substantial improvements in stability while navigating complex terrains, compared to those employing traditional methods. These improvements significantly increase the robustness and adaptability of the algorithm, enabling it to effectively meet diverse environmental challenges. This research marks a significant step forward in the development of advanced and reliable biped robot systems, emphasizing the power of deep reinforcement learning to transcend the limitations of conventional robotic control approaches, especially in complex environmental interactions.
KW - Biped robot
KW - Gait control
KW - Reinforcement learning
KW - Soft actor-critic
UR - http://www.scopus.com/inward/record.url?scp=85195843928&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85195843928&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-60618-2_8
DO - 10.1007/978-3-031-60618-2_8
M3 - Conference contribution
AN - SCOPUS:85195843928
SN - 9783031606175
T3 - Mechanisms and Machine Science
SP - 93
EP - 104
BT - Proceedings of MSR-RoManSy 2024 - Combined IFToMM Symposium of RoManSy and USCToMM Symposium on Mechanical Systems and Robotics
A2 - Larochelle, Pierre
A2 - McCarthy, J. Michael
A2 - Lusk, Craig P.
PB - Springer Science and Business Media B.V.
T2 - Joint Mechanical Systems and Robotics and RoManSy Symposium, MSR-RoManSy 2024
Y2 - 22 May 2024 through 25 May 2024
ER -