TY - GEN
T1 - Distributed service area control for ride sharing by using multi-agent deep reinforcement learning
AU - Yoshida, Naoki
AU - Noda, Itsuki
AU - Sugawara, Toshiharu
N1 - Funding Information:
This work was partly supported by JSPS KAKENHI (17KT0044) and JST-Mirai Program Grant Number JPMJMI19B5, Japan.
Publisher Copyright:
© 2021 by SCITEPRESS - Science and Technology Publications, Lda. All rights reserved
PY - 2021
Y1 - 2021
N2 - We propose a decentralized system to determine where ride-sharing vehicle agents should wait for passengers using multi-agent deep reinforcement learning. Although numerous drivers have begun participating in ride-sharing services as the demand for these services has increased, much of their time is idle. The result is not only inefficiency but also wasted energy and increased traffic congestion in metropolitan area, while also causing a shortage of ride-sharing vehicles in the surrounding areas. We therefore developed the distributed service area adaptation method for ride sharing (dSAAMS) to decide the areas where each agent should wait for passengers through deep reinforcement learning based on the networks of individual agents and the demand prediction data provided by an external system. We evaluated the performance and characteristics of our proposed method in a simulated environment with varied demand occurrence patterns and by using actual data obtained in the Manhattan area. We compare the performance of our method to that of other conventional methods and the centralized version of the dSAAMS. Our experiments indicate that by using the dSAAMS, agents individually wait and move more effectively around their service territory, provide better quality service, and exhibit better performance in dynamically changing environments than when using the comparison methods.
AB - We propose a decentralized system to determine where ride-sharing vehicle agents should wait for passengers using multi-agent deep reinforcement learning. Although numerous drivers have begun participating in ride-sharing services as the demand for these services has increased, much of their time is idle. The result is not only inefficiency but also wasted energy and increased traffic congestion in metropolitan area, while also causing a shortage of ride-sharing vehicles in the surrounding areas. We therefore developed the distributed service area adaptation method for ride sharing (dSAAMS) to decide the areas where each agent should wait for passengers through deep reinforcement learning based on the networks of individual agents and the demand prediction data provided by an external system. We evaluated the performance and characteristics of our proposed method in a simulated environment with varied demand occurrence patterns and by using actual data obtained in the Manhattan area. We compare the performance of our method to that of other conventional methods and the centralized version of the dSAAMS. Our experiments indicate that by using the dSAAMS, agents individually wait and move more effectively around their service territory, provide better quality service, and exhibit better performance in dynamically changing environments than when using the comparison methods.
KW - Deep reinforcement learning
KW - Multi-agent learning
KW - Ride sharing
KW - Transportation and logistics
UR - http://www.scopus.com/inward/record.url?scp=85103837247&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85103837247&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85103837247
T3 - ICAART 2021 - Proceedings of the 13th International Conference on Agents and Artificial Intelligence
SP - 101
EP - 112
BT - ICAART 2021 - Proceedings of the 13th International Conference on Agents and Artificial Intelligence
A2 - Rocha, Ana Paula
A2 - Steels, Luc
A2 - van den Herik, Jaap
PB - SciTePress
T2 - 13th International Conference on Agents and Artificial Intelligence, ICAART 2021
Y2 - 4 February 2021 through 6 February 2021
ER -