TY - GEN
T1 - A parallelizing compiler cooperative heterogeneous multicore processor architecture
AU - Wada, Yasutaka
AU - Hayashi, Akihiro
AU - Masuura, Takeshi
AU - Shirako, Jun
AU - Nakano, Hirofumi
AU - Shikano, Hiroaki
AU - Kimura, Keiji
AU - Kasahara, Hironori
N1 - Copyright:
Copyright 2021 Elsevier B.V., All rights reserved.
PY - 2011
Y1 - 2011
N2 - Heterogeneous multicore architectures, integrating several kinds of accelerator cores in addition to general purpose processor cores, have been attracting much attention to realize high performance with low power consumption. To attain effective high performance, high application software productivity, and low power consumption on heterogeneous multicores, cooperation between an architecture and a parallelizing compiler is important. This paper proposes a compiler cooperative heterogeneous multicore architecture and parallelizing compilation scheme for it. Performance of the proposed scheme is evaluated on the heterogeneous multicore integrating Hitachi and Renesas' SH4A processor cores and Hitachi's FE-GA accelerator cores, using an MP3 encoder. The heterogeneous multicore gives us 14.34 times speedup with two SH4As and two FE-GAs, and 26.05 times speedup with four SH4As and four FE-GAs against sequential execution with a single SH4A. The cooperation between the heterogeneous multicore architecture and the parallelizing compiler enables to achieve high performance in a short development period.
AB - Heterogeneous multicore architectures, integrating several kinds of accelerator cores in addition to general purpose processor cores, have been attracting much attention to realize high performance with low power consumption. To attain effective high performance, high application software productivity, and low power consumption on heterogeneous multicores, cooperation between an architecture and a parallelizing compiler is important. This paper proposes a compiler cooperative heterogeneous multicore architecture and parallelizing compilation scheme for it. Performance of the proposed scheme is evaluated on the heterogeneous multicore integrating Hitachi and Renesas' SH4A processor cores and Hitachi's FE-GA accelerator cores, using an MP3 encoder. The heterogeneous multicore gives us 14.34 times speedup with two SH4As and two FE-GAs, and 26.05 times speedup with four SH4As and four FE-GAs against sequential execution with a single SH4A. The cooperation between the heterogeneous multicore architecture and the parallelizing compiler enables to achieve high performance in a short development period.
UR - http://www.scopus.com/inward/record.url?scp=84856611294&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84856611294&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-24568-8_11
DO - 10.1007/978-3-642-24568-8_11
M3 - Conference contribution
AN - SCOPUS:84856611294
SN - 9783642245671
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 215
EP - 233
BT - Transactions on High-Performance Embedded Architectures and Compilers IV
PB - Springer Verlag
ER -