TY - JOUR
T1 - A Fast QTMT Partition Decision Strategy for VVC Intra Prediction
AU - Fan, Yibo
AU - Chen, Jun'An
AU - Sun, Heming
AU - Katto, Jiro
AU - Jing, Ming'E
N1 - Funding Information:
This work was supported in part by Fudan University Key Laboratory Senior Visiting Scholarship, in part by the National Natural Science Foundation of China under Grant 61674041, in part by Alibaba Innovative Research (AIR) Program, in part by IBM Faculty Award, in part by the Innovation Program of Shanghai Municipal Education Commission, in part by the Pioneering Project of Academy for Engineering and Technology, in part by the Fudan-CIOMP (Changchun Institute of Optics, Fine Mechanics and Physics) joint fund, and in part by JST, PRESTO under Grant JPMJPR19M5, Japan.
Publisher Copyright:
© 2013 IEEE.
PY - 2020
Y1 - 2020
N2 - Different from the traditional quaternary tree (QT) structure utilized in the previous generation video coding standard H.265/HEVC, a brand new partition structure named quadtree with nested multi-type tree (QTMT) is applied in the latest codec H.266/VVC. The introduction of QTMT brings in superior encoding performance at the cost of great time-consuming. Therefore, a fast intra partition algorithm based on variance and Sobel operator is proposed in this paper. The proposed method settles the novel asymmetrical partition issue in VVC by well balancing the reduction of computational complexity and the loss of encoding quality. To be more concrete, we first terminate further splitting of a coding unit (CU) when the texture of it is judged as smooth. Then, we use Sobel operator to extract gradient features to decide whether to split this CU by QT, thus terminating further MT partitions. Finally, a completely novel method to choose only one partition from five QTMT partitions is applied. Obviously, homogeneous area tends to use a larger CU as a whole to do prediction while CUs with complicated texture are prone to be divided into small sub-CUs and these sub-CUs usually have different textures from each other. We calculate the variance of variance of each sub-CU to decide which partition will distinguish the sub-textures best. Our method is embedded into the latest VVC official reference software VTM-7.0. Comparing to anchor VTM-7.0, our method saves the encoding time by 49.27% on average at the cost of only 1.63% BDBR increase. As a traditional scheme based on variance and gradient to decrease the computational complexity in VVC intra coding, our method outperforms other relative existing state-of-the-art methods, including traditional machine learning and convolution neural network methods.
AB - Different from the traditional quaternary tree (QT) structure utilized in the previous generation video coding standard H.265/HEVC, a brand new partition structure named quadtree with nested multi-type tree (QTMT) is applied in the latest codec H.266/VVC. The introduction of QTMT brings in superior encoding performance at the cost of great time-consuming. Therefore, a fast intra partition algorithm based on variance and Sobel operator is proposed in this paper. The proposed method settles the novel asymmetrical partition issue in VVC by well balancing the reduction of computational complexity and the loss of encoding quality. To be more concrete, we first terminate further splitting of a coding unit (CU) when the texture of it is judged as smooth. Then, we use Sobel operator to extract gradient features to decide whether to split this CU by QT, thus terminating further MT partitions. Finally, a completely novel method to choose only one partition from five QTMT partitions is applied. Obviously, homogeneous area tends to use a larger CU as a whole to do prediction while CUs with complicated texture are prone to be divided into small sub-CUs and these sub-CUs usually have different textures from each other. We calculate the variance of variance of each sub-CU to decide which partition will distinguish the sub-textures best. Our method is embedded into the latest VVC official reference software VTM-7.0. Comparing to anchor VTM-7.0, our method saves the encoding time by 49.27% on average at the cost of only 1.63% BDBR increase. As a traditional scheme based on variance and gradient to decrease the computational complexity in VVC intra coding, our method outperforms other relative existing state-of-the-art methods, including traditional machine learning and convolution neural network methods.
KW - Asymmetric block size
KW - fast partition decision
KW - intra prediction
KW - quadtree with multi-type tree
KW - versatile video coding
UR - http://www.scopus.com/inward/record.url?scp=85086988978&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85086988978&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2020.3000565
DO - 10.1109/ACCESS.2020.3000565
M3 - Article
AN - SCOPUS:85086988978
SN - 2169-3536
VL - 8
SP - 107900
EP - 107911
JO - IEEE Access
JF - IEEE Access
M1 - 9110597
ER -