TY - GEN
T1 - Sample efficiency analysis of Neuroevolution algorithms on a quadruped robot
AU - Xu, Shengbo
AU - Moriguch, Hirotaka
AU - Honiden, Shinichi
PY - 2013
Y1 - 2013
N2 - In reinforcement learning tasks with continuous state-action, parameterized policy search has been known to be a powerful method. Applying NeuroEvolution (NE) to optimizing the policy represented by artificial neural network (ANN) is a particularly active research field. In most cases, NE algorithms cost a large amount of trial-and-error (episode) to optimize policies. However, due to time and cost constraints, researchers and practitioners cannot repeat a number of episodes on physical robots. Thus, choosing an efficient NE algorithm is a key to optimize policies with limited time and cost. In this work, our goal is to help users to choose an efficient NE algorithm. We compare and analyze sample efficiency of two successful state-of-the-art NE algorithms: CMA-NeuroES and NEAT in a gait generation task of a quadruped robot. Moreover, we run both algorithms with various initial topologies in order to analyze the performance difference between each topology. From experimental results, we show CMA-NeuroES outperforms NEAT regardless of initial topologies when the limited number of episodes can be executed. Additional experiments conclude that the optimization method for connection weights in NEAT results in its inferior performance to CMA-NeuroES, while a probability-weighted averaging characteristic and self-adaptive factors make CMA-NeuroES to be advantageous.
AB - In reinforcement learning tasks with continuous state-action, parameterized policy search has been known to be a powerful method. Applying NeuroEvolution (NE) to optimizing the policy represented by artificial neural network (ANN) is a particularly active research field. In most cases, NE algorithms cost a large amount of trial-and-error (episode) to optimize policies. However, due to time and cost constraints, researchers and practitioners cannot repeat a number of episodes on physical robots. Thus, choosing an efficient NE algorithm is a key to optimize policies with limited time and cost. In this work, our goal is to help users to choose an efficient NE algorithm. We compare and analyze sample efficiency of two successful state-of-the-art NE algorithms: CMA-NeuroES and NEAT in a gait generation task of a quadruped robot. Moreover, we run both algorithms with various initial topologies in order to analyze the performance difference between each topology. From experimental results, we show CMA-NeuroES outperforms NEAT regardless of initial topologies when the limited number of episodes can be executed. Additional experiments conclude that the optimization method for connection weights in NEAT results in its inferior performance to CMA-NeuroES, while a probability-weighted averaging characteristic and self-adaptive factors make CMA-NeuroES to be advantageous.
KW - CMA-NeuroES
KW - NEAT
KW - evolution
KW - neural network
KW - neuroevolution
UR - http://www.scopus.com/inward/record.url?scp=84881586655&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84881586655&partnerID=8YFLogxK
U2 - 10.1109/CEC.2013.6557826
DO - 10.1109/CEC.2013.6557826
M3 - Conference contribution
AN - SCOPUS:84881586655
SN - 9781479904549
T3 - 2013 IEEE Congress on Evolutionary Computation, CEC 2013
SP - 2170
EP - 2177
BT - 2013 IEEE Congress on Evolutionary Computation, CEC 2013
T2 - 2013 IEEE Congress on Evolutionary Computation, CEC 2013
Y2 - 20 June 2013 through 23 June 2013
ER -