Abstract
As technology rapidly evolves, the application of bipedal robots in various environments has widely expanded. These robots, compared to their wheeled counterparts, exhibit a greater degree of freedom and a higher complexity in control, making the challenge of maintaining balance and stability under changing wind speeds particularly intricate. Overcoming this challenge is critical as it enables bipedal robots to sustain more stable gaits during outdoor tasks, thereby increasing safety and enhancing operational efficiency in outdoor settings. To transcend the constraints of existing methodologies, this research introduces an adaptive bio-inspired exploration framework for bipedal robots facing wind disturbances, which is based on the Deep Deterministic Policy Gradient (DDPG) approach. This framework allows the robots to perceive their bodily states through wind force inputs and adaptively modify their exploration coefficients. Additionally, to address the convergence challenges posed by sparse rewards, this study incorporates Hindsight Experience Replay (HER) and a reward-reshaping strategy to provide safer and more effective training guidance for the agents. Simulation outcomes reveal that robots utilizing this advanced method can more swiftly explore behaviors that contribute to stability in complex conditions, and demonstrate improvements in training speed and walking distance over traditional DDPG algorithms.
Original language | English |
---|---|
Article number | 346 |
Journal | Biomimetics |
Volume | 9 |
Issue number | 6 |
DOIs | |
Publication status | Published - 2024 Jun |
Keywords
- adaptive exploration
- biomimetics exploration
- biped robot
- reinforcement learning
- wind disturbance
ASJC Scopus subject areas
- Biotechnology
- Bioengineering
- Biomaterials
- Biochemistry
- Biomedical Engineering
- Molecular Medicine