TY - JOUR
T1 - A bandwidth optimized, 64 cycles/MB joint parameter decoder architecture for ultra high definition H.264/AVC applications
AU - Zhou, Jinjia
AU - Zhou, Dajiang
AU - He, Xun
AU - Goto, Satoshi
PY - 2010
Y1 - 2010
N2 - In this paper, VLSI architecture of a joint parameter decoder is proposed to realize the calculation of motion vector (MV), intra prediction mode (IPM) and boundary strength (BS) for ultra high definition H.264/AVC applications. For this architecture, a 64-cycle-per-MB pipeline with simplified control modes is designed to increase system throughput and reduce hardware cost. Moreover, in order to save memory bandwidth, the data which includes the motion information for the co-located picture and the last decoded line, is pre-processed before being stored to DRAM. A partition based storage format is applied to condense the MB level data, while variable length coding based compression method is utilized to reduce the data size in each partition. Experimental results show our design is capable of real-time 3840×2160@60 fps decoding at less than 133 MHz, with 37.2 k logic gates. Meanwhile, by applying the proposed scheme, 85-98% bandwidth saving is achieved, compared with storing the original information for every 4 × 4 block to DRAM.
AB - In this paper, VLSI architecture of a joint parameter decoder is proposed to realize the calculation of motion vector (MV), intra prediction mode (IPM) and boundary strength (BS) for ultra high definition H.264/AVC applications. For this architecture, a 64-cycle-per-MB pipeline with simplified control modes is designed to increase system throughput and reduce hardware cost. Moreover, in order to save memory bandwidth, the data which includes the motion information for the co-located picture and the last decoded line, is pre-processed before being stored to DRAM. A partition based storage format is applied to condense the MB level data, while variable length coding based compression method is utilized to reduce the data size in each partition. Experimental results show our design is capable of real-time 3840×2160@60 fps decoding at less than 133 MHz, with 37.2 k logic gates. Meanwhile, by applying the proposed scheme, 85-98% bandwidth saving is achieved, compared with storing the original information for every 4 × 4 block to DRAM.
KW - DRAM bandwidth
KW - H.264/av c
KW - Motion vector derivation
KW - Ultra high resolution
KW - Video decoder
UR - http://www.scopus.com/inward/record.url?scp=77955386547&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77955386547&partnerID=8YFLogxK
U2 - 10.1587/transfun.E93.A.1425
DO - 10.1587/transfun.E93.A.1425
M3 - Article
AN - SCOPUS:77955386547
SN - 0916-8508
VL - E93-A
SP - 1425
EP - 1433
JO - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
JF - IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
IS - 8
ER -