TY - GEN
T1 - Design of Power and Area Efficient Lower-Part-OR Approximate Multiplier
AU - Guo, Yi
AU - Sun, Heming
AU - Kimura, Shinji
PY - 2019/2/22
Y1 - 2019/2/22
N2 - Approximate computing has been paid attention as a promising technique to decrease power and area for error-tolerant applications by simplifying the internal operations with sacrificing their accuracy. In this paper, a new power and area efficient approximate multiplier is proposed using OR based compressor with no carry propagation for lower bit positions and a carry propagation compressor with inexact half adders and full adders for upper bit positions. The proposal is effective to reduce the critical path delay with almost the same precision with previous methods. Firstly, an inexact half adder and an inexact full adder are proposed and a construction method of 4×4 multiplier is shown. Then, a construction method of 8×8 multiplier is proposed using OR based compressor, approximate 4×4 multipliers and an accurate 4×4 multiplier. The proposed construction method can also be applied to 16×16 multiplier. The accuracy loss of proposed multipliers is evaluated using MATLAB simulation and that of the proposed 8×8 multiplier is low as 0.20%, the effect of which is shown to be negligible by applying to discrete cosine transform (DCT), inverse DCT and convolutional neural networks for image classification. The proposed 8×8 multiplier reduces power and area by 50.78% and 53.19%, respectively, compared with the accurate Wallace tree multiplier when evaluated using SMIC 40nm process.
AB - Approximate computing has been paid attention as a promising technique to decrease power and area for error-tolerant applications by simplifying the internal operations with sacrificing their accuracy. In this paper, a new power and area efficient approximate multiplier is proposed using OR based compressor with no carry propagation for lower bit positions and a carry propagation compressor with inexact half adders and full adders for upper bit positions. The proposal is effective to reduce the critical path delay with almost the same precision with previous methods. Firstly, an inexact half adder and an inexact full adder are proposed and a construction method of 4×4 multiplier is shown. Then, a construction method of 8×8 multiplier is proposed using OR based compressor, approximate 4×4 multipliers and an accurate 4×4 multiplier. The proposed construction method can also be applied to 16×16 multiplier. The accuracy loss of proposed multipliers is evaluated using MATLAB simulation and that of the proposed 8×8 multiplier is low as 0.20%, the effect of which is shown to be negligible by applying to discrete cosine transform (DCT), inverse DCT and convolutional neural networks for image classification. The proposed 8×8 multiplier reduces power and area by 50.78% and 53.19%, respectively, compared with the accurate Wallace tree multiplier when evaluated using SMIC 40nm process.
KW - approximate multiplier
KW - high performance
KW - OR gate
KW - top-down construction
UR - http://www.scopus.com/inward/record.url?scp=85063227472&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85063227472&partnerID=8YFLogxK
U2 - 10.1109/TENCON.2018.8650108
DO - 10.1109/TENCON.2018.8650108
M3 - Conference contribution
AN - SCOPUS:85063227472
T3 - IEEE Region 10 Annual International Conference, Proceedings/TENCON
SP - 2110
EP - 2115
BT - Proceedings of TENCON 2018 - 2018 IEEE Region 10 Conference
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2018 IEEE Region 10 Conference, TENCON 2018
Y2 - 28 October 2018 through 31 October 2018
ER -