TY - GEN
T1 - DASH Live Video Streaming Control Using Actor-Critic Reinforcement Learning Method
AU - Wei, Bo
AU - Song, Hang
AU - Nguyen, Quang Ngoc
AU - Katto, Jiro
N1 - Funding Information:
Acknowledgement. This research is supported by JSPS KAKENHI Grant Number 20K14740 and Waseda University Grant for Special Research Projects (Project Number: 2021C-132, 2021E-013).
Publisher Copyright:
© 2022, ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering.
PY - 2022
Y1 - 2022
N2 - With the COVID19 pandemic, video streaming traffic is increasing rapidly. Especially, the live streaming traffic accounts for large amount due to the fact that many events have been switched to the online forms. Therefore, the demand to ensure a high-quality streaming experience is increasing urgently. Since the network condition is expected to fluctuate dynamically, the video streaming needs to be controlled adaptively according to the network condition to provide high quality of experience (QoE) for users. In this paper, a method was proposed to control the live video streaming using the actor-critic reinforcement learning (RL) technique. In this method, the historical video streaming logs such as throughput, buffer size, rebuffering time, latency are taken consideration as the states of RL, then the model is established to map the states to an action such as bitrate decision. In this study, the live streaming simulation is utilized to evaluate the method since the model needs training and the simulation can generate data much faster than real experiment. Experiments were conducted to evaluate the proposed method. Results demonstrate that the total QoE in Bus and Car scenarios show the best performance. The QoE of Tram case shows the lowest due to the low bandwidth.
AB - With the COVID19 pandemic, video streaming traffic is increasing rapidly. Especially, the live streaming traffic accounts for large amount due to the fact that many events have been switched to the online forms. Therefore, the demand to ensure a high-quality streaming experience is increasing urgently. Since the network condition is expected to fluctuate dynamically, the video streaming needs to be controlled adaptively according to the network condition to provide high quality of experience (QoE) for users. In this paper, a method was proposed to control the live video streaming using the actor-critic reinforcement learning (RL) technique. In this method, the historical video streaming logs such as throughput, buffer size, rebuffering time, latency are taken consideration as the states of RL, then the model is established to map the states to an action such as bitrate decision. In this study, the live streaming simulation is utilized to evaluate the method since the model needs training and the simulation can generate data much faster than real experiment. Experiments were conducted to evaluate the proposed method. Results demonstrate that the total QoE in Bus and Car scenarios show the best performance. The QoE of Tram case shows the lowest due to the low bandwidth.
KW - Actor-critic
KW - Dash
KW - Live streaming
KW - QoE
UR - http://www.scopus.com/inward/record.url?scp=85124038850&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85124038850&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-94763-7_2
DO - 10.1007/978-3-030-94763-7_2
M3 - Conference contribution
AN - SCOPUS:85124038850
SN - 9783030947620
T3 - Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST
SP - 17
EP - 24
BT - Mobile Networks and Management - 11th EAI International Conference, MONAMI 2021, Proceedings
A2 - Calafate, Carlos T.
A2 - Chen, Xianfu
A2 - Wu, Yuan
PB - Springer Science and Business Media Deutschland GmbH
T2 - 11th EAI International Conference on Mobile Networks and Management, MONAMI 2021
Y2 - 27 October 2021 through 29 October 2021
ER -