TY - JOUR
T1 - A novel graph-based estimation of the distribution algorithm and its extension using reinforcement learning
AU - Li, Xianneng
AU - Mabu, Shingo
AU - Hirasawa, Kotaro
PY - 2014/2
Y1 - 2014/2
N2 - In recent years, numerous studies have drawn the success of estimation of distribution algorithms (EDAs) to avoid the frequent breakage of building blocks of the conventional stochastic genetic operators-based evolutionary algorithms (EAs). In this paper, a novel graph-based EDA called probabilistic model building genetic network programming (PMBGNP) is proposed. Using the distinguished graph (network) structure of a graph-based EA called genetic network programming (GNP), PMBGNP ensures higher expression ability than the conventional EDAs to solve some specific problems. Furthermore, an extended algorithm called reinforced PMBGNP is proposed to combine PMBGNP and reinforcement learning to enhance the performance in terms of fitness values, search speed, and reliability. The proposed algorithms are applied to solve the problems of controlling the agents' behavior. Two problems are selected to demonstrate the effectiveness of the proposed algorithms, including the benchmark one, i.e., the Tileworld system, and a real mobile robot control.
AB - In recent years, numerous studies have drawn the success of estimation of distribution algorithms (EDAs) to avoid the frequent breakage of building blocks of the conventional stochastic genetic operators-based evolutionary algorithms (EAs). In this paper, a novel graph-based EDA called probabilistic model building genetic network programming (PMBGNP) is proposed. Using the distinguished graph (network) structure of a graph-based EA called genetic network programming (GNP), PMBGNP ensures higher expression ability than the conventional EDAs to solve some specific problems. Furthermore, an extended algorithm called reinforced PMBGNP is proposed to combine PMBGNP and reinforcement learning to enhance the performance in terms of fitness values, search speed, and reliability. The proposed algorithms are applied to solve the problems of controlling the agents' behavior. Two problems are selected to demonstrate the effectiveness of the proposed algorithms, including the benchmark one, i.e., the Tileworld system, and a real mobile robot control.
KW - Agent control
KW - estimation of distribution algorithm (EDA)
KW - genetic network programming (GNP)
KW - graph structure
KW - reinforcement learning (RL)
UR - http://www.scopus.com/inward/record.url?scp=84893820282&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893820282&partnerID=8YFLogxK
U2 - 10.1109/TEVC.2013.2238240
DO - 10.1109/TEVC.2013.2238240
M3 - Article
AN - SCOPUS:84893820282
SN - 1089-778X
VL - 18
SP - 98
EP - 113
JO - IEEE Transactions on Evolutionary Computation
JF - IEEE Transactions on Evolutionary Computation
IS - 1
M1 - 6408015
ER -