TY - GEN
T1 - Reconfigurable SAD tree architecture based on adaptive sub-sampling in HDTV application
AU - Huang, Yiqing
AU - Liu, Qin
AU - Goto, Satoshi
AU - Ikenaga, Takeshi
PY - 2009
Y1 - 2009
N2 - In H.264/AVC based integer motion estimation engine, fixed architectures based on full pixel or direct sub-sampling pattern are widely used for HDTV application. However, these architectures suffer from either high complexity or quality loss problems. In this paper, an adaptive sub-sampling based reconfigurable architecture is given out. Firstly, by executing pixel difference analysis, the adaptive sub-sampling scheme which uses three hardware friendly patterns is applied on homogeneous macroblock (MB). Secondly, the related architecture introduces one more pipeline stage to build up configurable partial SAD values so that system performance is enhanced. Thirdly, a two-level pixel data organization scheme is proposed to solve data reuse and hardware utilization problems caused by adaptive algorithm. Moreover, one cross based SAD generation structure is introduced to achieve adaptive output results with less hardware cost. Experimental results show that, the proposed architecture can averagely save 61.71% clock cycles and accomplish twice or four times processing capability for homogeneous MBs. The maximum clock frequency is 208MHz under the TSMC 0.18μm technology in worst case conditions(1.62V, 125°C).
AB - In H.264/AVC based integer motion estimation engine, fixed architectures based on full pixel or direct sub-sampling pattern are widely used for HDTV application. However, these architectures suffer from either high complexity or quality loss problems. In this paper, an adaptive sub-sampling based reconfigurable architecture is given out. Firstly, by executing pixel difference analysis, the adaptive sub-sampling scheme which uses three hardware friendly patterns is applied on homogeneous macroblock (MB). Secondly, the related architecture introduces one more pipeline stage to build up configurable partial SAD values so that system performance is enhanced. Thirdly, a two-level pixel data organization scheme is proposed to solve data reuse and hardware utilization problems caused by adaptive algorithm. Moreover, one cross based SAD generation structure is introduced to achieve adaptive output results with less hardware cost. Experimental results show that, the proposed architecture can averagely save 61.71% clock cycles and accomplish twice or four times processing capability for homogeneous MBs. The maximum clock frequency is 208MHz under the TSMC 0.18μm technology in worst case conditions(1.62V, 125°C).
KW - H.264
KW - Reconfigurable Architecture
KW - VLSI
UR - http://www.scopus.com/inward/record.url?scp=70350607862&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70350607862&partnerID=8YFLogxK
U2 - 10.1145/1531542.1531648
DO - 10.1145/1531542.1531648
M3 - Conference contribution
AN - SCOPUS:70350607862
SN - 9781605585222
T3 - Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI
SP - 463
EP - 468
BT - GLSVLSI 2009 - Proceedings of the 2009 Great Lakes Symposium on VLSI
T2 - 19th ACM Great Lakes Symposium on VLSI, GLSVLSI '09
Y2 - 10 May 2009 through 12 May 2009
ER -