TY - JOUR
T1 - A 530 Mpixels/s 4096×2160@60fps H.264/AVC high profile video decoder chip
AU - Zhou, Dajiang
AU - Zhou, Jinjia
AU - He, Xun
AU - Zhu, Jiayi
AU - Kong, Ji
AU - Liu, Peilin
AU - Goto, Satoshi
PY - 2011/4
Y1 - 2011/4
N2 - The increased resolution of Quad Full High Definition (QFHD) offers significantly enhanced visual experience. However, the corresponding huge data throughput of up to 530 Mpixels/s greatly challenges the design of real-time video decoder VLSI with the extensive requirement on both DRAM bandwidth and computational power. In this work, a lossless frame recompression technique and a partial MB reordering scheme are proposed to save the DRAM access of a QFHD video decoder chip. Besides, pipelining and parallelization techniques such as NAL/slice-parallel entropy decoding are implemented to efficiently enhance its computational power. The chip supporting H.264/AVC high profile is fabricated in 90 nm CMOS and verified. It delivers a maximum throughput of 4096×2160@60fps, which is at least 4.3 times higher than the state-of-the-art. DRAM bandwidth requirement is reduced by typically 51%, which fits the design into a 64-bit LPDDR SDRAM interface and results in 58% DRAM power saving. Meanwhile, the core energy is saved by 54% by pipelining and parallelization.
AB - The increased resolution of Quad Full High Definition (QFHD) offers significantly enhanced visual experience. However, the corresponding huge data throughput of up to 530 Mpixels/s greatly challenges the design of real-time video decoder VLSI with the extensive requirement on both DRAM bandwidth and computational power. In this work, a lossless frame recompression technique and a partial MB reordering scheme are proposed to save the DRAM access of a QFHD video decoder chip. Besides, pipelining and parallelization techniques such as NAL/slice-parallel entropy decoding are implemented to efficiently enhance its computational power. The chip supporting H.264/AVC high profile is fabricated in 90 nm CMOS and verified. It delivers a maximum throughput of 4096×2160@60fps, which is at least 4.3 times higher than the state-of-the-art. DRAM bandwidth requirement is reduced by typically 51%, which fits the design into a 64-bit LPDDR SDRAM interface and results in 58% DRAM power saving. Meanwhile, the core energy is saved by 54% by pipelining and parallelization.
KW - DRAM bandwidth
KW - embedded compression
KW - frame recompression
KW - H.264/AVC
KW - QFHD
KW - ultra high definition
KW - video decoder
UR - http://www.scopus.com/inward/record.url?scp=79953168363&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79953168363&partnerID=8YFLogxK
U2 - 10.1109/JSSC.2011.2109550
DO - 10.1109/JSSC.2011.2109550
M3 - Article
AN - SCOPUS:79953168363
SN - 0018-9200
VL - 46
SP - 777
EP - 788
JO - IEEE Journal of Solid-State Circuits
JF - IEEE Journal of Solid-State Circuits
IS - 4
M1 - 5727920
ER -