TY - GEN
T1 - Autonomous learning of target decision strategies without communications for continuous coordinated cleaning tasks
AU - Yoneda, Keisuke
AU - Kato, Chihiro
AU - Sugawara, Toshiharu
PY - 2013
Y1 - 2013
N2 - We propose a method for the autonomous learning of target decision strategies for coordination in the continuous cleaning domain. With ongoing advances in computer and sensor technologies, we can expect robot applications for covering large areas that often require coordinated/cooperative activities by multiple robots. In this paper, we focus the cleaning tasks by multiple robots or by agents, software to control the robots. We assume that agents cannot directly exchange internal information such as plans and targets for coordination, but rather individually learn their target decision strategies by observing how much trash/dirt has been vacuumed up in the multi-agent system environments. We experimentally evaluated the proposed method by comparing its performance with those obtained by the regimes of agents with a single strategy. Results showed that the proposed method enables agents to select target decision strategies from their own perspectives, resulting in the appropriate combinations of multiple strategies.
AB - We propose a method for the autonomous learning of target decision strategies for coordination in the continuous cleaning domain. With ongoing advances in computer and sensor technologies, we can expect robot applications for covering large areas that often require coordinated/cooperative activities by multiple robots. In this paper, we focus the cleaning tasks by multiple robots or by agents, software to control the robots. We assume that agents cannot directly exchange internal information such as plans and targets for coordination, but rather individually learn their target decision strategies by observing how much trash/dirt has been vacuumed up in the multi-agent system environments. We experimentally evaluated the proposed method by comparing its performance with those obtained by the regimes of agents with a single strategy. Results showed that the proposed method enables agents to select target decision strategies from their own perspectives, resulting in the appropriate combinations of multiple strategies.
KW - Coordination
KW - Learning
KW - Multi-robot sweeping
KW - Robot patrolling
UR - http://www.scopus.com/inward/record.url?scp=84893330539&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893330539&partnerID=8YFLogxK
U2 - 10.1109/WI-IAT.2013.112
DO - 10.1109/WI-IAT.2013.112
M3 - Conference contribution
AN - SCOPUS:84893330539
SN - 9781479929023
T3 - Proceedings - 2013 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT 2013
SP - 216
EP - 223
BT - Proceedings - 2013 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT 2013
PB - IEEE Computer Society
T2 - 2013 12th IEEE/WIC/ACM International Conference on Intelligent Agent Technology, IAT 2013
Y2 - 17 November 2013 through 20 November 2013
ER -