TY - GEN
T1 - Hierarchical parallelism control for multigrain parallel processing
AU - Obata, Motoki
AU - Shirako, Jun
AU - Kaminaga, Hiroki
AU - Ishizaka, Kazuhisa
AU - Kasahara, Hironori
PY - 2005/12/1
Y1 - 2005/12/1
N2 - To improve effective performance and usability of shared memory multiprocessor systems, a multi-grain compilation scheme, which hierarchically exploits coarse grain parallelism among loops, subroutines and basic blocks, conventional loop parallelism and near fine grain parallelism among statements inside a basic block, is important. In order to efficiently use hierarchical parallelism of each nest level, or layer, in multigrain parallel processing, it is required to determine how many processors or groups of processors should be assigned to each layer, according to the parallelism of the layer. This paper proposes an automatic hierarchical parallelism control scheme to assign suitable number of processors to each layer so that the parallelism of each hierarchy can be used efficiently. Performance of the proposed scheme is evaluated on IBM RS6000 SMP server with 8 processors using 8 programs of SPEC95FP.
AB - To improve effective performance and usability of shared memory multiprocessor systems, a multi-grain compilation scheme, which hierarchically exploits coarse grain parallelism among loops, subroutines and basic blocks, conventional loop parallelism and near fine grain parallelism among statements inside a basic block, is important. In order to efficiently use hierarchical parallelism of each nest level, or layer, in multigrain parallel processing, it is required to determine how many processors or groups of processors should be assigned to each layer, according to the parallelism of the layer. This paper proposes an automatic hierarchical parallelism control scheme to assign suitable number of processors to each layer so that the parallelism of each hierarchy can be used efficiently. Performance of the proposed scheme is evaluated on IBM RS6000 SMP server with 8 processors using 8 programs of SPEC95FP.
UR - http://www.scopus.com/inward/record.url?scp=33745129333&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33745129333&partnerID=8YFLogxK
U2 - 10.1007/11596110_3
DO - 10.1007/11596110_3
M3 - Conference contribution
AN - SCOPUS:33745129333
SN - 3540307818
SN - 9783540307815
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 31
EP - 44
BT - Languages and Compilers for Parallel Computing - 15th Workshop, LCPC 2002, Revised Papers
T2 - 15th Workshop on Languages and Compilers for Parallel Computing, LCPC 2002
Y2 - 25 July 2002 through 27 July 2002
ER -