TY - JOUR
T1 - A low-power shared cache design with modified PID controller for efficient multicore embedded systems
AU - Zhao, Huatao
AU - Ye, Jiongyao
AU - Watanabe, Takahiro
N1 - Publisher Copyright:
© 2019 Information Processing Society of Japan.
PY - 2019
Y1 - 2019
N2 - Nowadays, on-chip cache scales are oversized in multicore embedded systems, and those caches even consume half of the total energy debit. However, we observe that a large portion of cache banks are wasted, meaning that those banks are rarely used but consume a great deal of energy during their entire lifetime. In this paper, we propose a controllable shared last level cache (SLLC) scheme to dynamically trace cache bank demands for each thread. Thus, energy on useless banks can be largely saved. Specifically, we (1) propose an effective cache bank allocating policy to explore bank demands corresponding to executed applications, (2) discuss the interrelations between application locality change and bank demands for further energy saving and (3) represent a modified PID controller to generate optimal banks for each thread. Experimental results show that our controllable SLLC design can save on average 39.7 percent shared cache access energy over conventional cache, while its performance is slightly improved and additional hardware overhead is less than 0.6 percent.
AB - Nowadays, on-chip cache scales are oversized in multicore embedded systems, and those caches even consume half of the total energy debit. However, we observe that a large portion of cache banks are wasted, meaning that those banks are rarely used but consume a great deal of energy during their entire lifetime. In this paper, we propose a controllable shared last level cache (SLLC) scheme to dynamically trace cache bank demands for each thread. Thus, energy on useless banks can be largely saved. Specifically, we (1) propose an effective cache bank allocating policy to explore bank demands corresponding to executed applications, (2) discuss the interrelations between application locality change and bank demands for further energy saving and (3) represent a modified PID controller to generate optimal banks for each thread. Experimental results show that our controllable SLLC design can save on average 39.7 percent shared cache access energy over conventional cache, while its performance is slightly improved and additional hardware overhead is less than 0.6 percent.
KW - Cache optimization
KW - Chip multicore
KW - Embedded system
KW - Low power
KW - PID control
UR - http://www.scopus.com/inward/record.url?scp=85062393241&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85062393241&partnerID=8YFLogxK
U2 - 10.2197/ipsjjip.27.149
DO - 10.2197/ipsjjip.27.149
M3 - Article
AN - SCOPUS:85062393241
SN - 0387-5806
VL - 27
SP - 149
EP - 158
JO - Journal of Information Processing
JF - Journal of Information Processing
ER -