TY - JOUR
T1 - A bitwidth-aware high-level synthesis algorithm using operation chainings for tiled-DR architectures
AU - Terada, Kotaro
AU - Yanagisawa, Masao
AU - Togawa, Nozomu
N1 - Publisher Copyright:
Copyright © 2017 The Institute of Electronics, Information and Communication Engineers.
PY - 2017/12
Y1 - 2017/12
N2 - As application hardware designs and implementations in a short term are required, high-level synthesis is more and more essential EDA technique nowadays. In deep-submicron era, interconnection delays are not negligible even in high-level synthesis thus distributed-register and - controller architectures (DR architectures) have been proposed in order to cope with this problem. It is also profitable to take data-bitwidth into account in high-level synthesis. In this paper, we propose a bitwidth-aware high-level synthesis algorithm using operation chainings targeting Tiled-DR architectures. Our proposed algorithm optimizes bitwidths of functional units and utilizes the vacant tiles by adding some extra functional units to realize effective operation chainings to generate high performance circuits without increasing the total area. Experimental results show that our proposed algorithm reduces the overall latency by up to 47% comparedtothe conventional approach without area overheads by eliminating unnecessary bitwidths and adding efficient extra FUs for Tiled-DR architectures.
AB - As application hardware designs and implementations in a short term are required, high-level synthesis is more and more essential EDA technique nowadays. In deep-submicron era, interconnection delays are not negligible even in high-level synthesis thus distributed-register and - controller architectures (DR architectures) have been proposed in order to cope with this problem. It is also profitable to take data-bitwidth into account in high-level synthesis. In this paper, we propose a bitwidth-aware high-level synthesis algorithm using operation chainings targeting Tiled-DR architectures. Our proposed algorithm optimizes bitwidths of functional units and utilizes the vacant tiles by adding some extra functional units to realize effective operation chainings to generate high performance circuits without increasing the total area. Experimental results show that our proposed algorithm reduces the overall latency by up to 47% comparedtothe conventional approach without area overheads by eliminating unnecessary bitwidths and adding efficient extra FUs for Tiled-DR architectures.
KW - Distributed-register architecture
KW - Floorplan
KW - High-level synthesis
KW - Interconnection delay
KW - Multiple-bitwidth
KW - Operation chaining
UR - http://www.scopus.com/inward/record.url?scp=85038216925&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85038216925&partnerID=8YFLogxK
U2 - 10.1587/transfun.E100.A.2911
DO - 10.1587/transfun.E100.A.2911
M3 - Article
AN - SCOPUS:85038216925
SN - 0916-8508
VL - E100A
SP - 2911
EP - 2924
JO - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
JF - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
IS - 12
ER -