TY - JOUR
T1 - Enhanced Intra Prediction for Video Coding by Using Multiple Neural Networks
AU - Sun, Heming
AU - Cheng, Zhengxue
AU - Takeuchi, Masaru
AU - Katto, Jiro
N1 - Funding Information:
Manuscript received May 8, 2019; revised November 12, 2019; accepted December 12, 2019. Date of publication January 1, 2020; date of current version October 23, 2020. This work was supported in part by JST, PRESTO under Grant JPMJPR19M5, Japan, in part by a research fund from Fujitsu, in part by JSPS KAKENHI under Grant 15H01684, in part by Waseda University Grant for Special Research Projects 2019Q-049. (Corresponding author: Zhengxue Cheng.) H. Sun is with the Waseda Research Institute for Science and Engineering, Tokyo 169-8555, Japan, and also with the JST, PRESTO, 4-1-8 Honcho, Kawaguchi, Saitama 332-0012, Japan (e-mail: hemingsun@aoni.waseda.jp).
Publisher Copyright:
© 1999-2012 IEEE.
PY - 2020/11
Y1 - 2020/11
N2 - This paper enhances the intra prediction by using multiple neural network modes (NM). Each NM serves as an end-To-end mapping from the neighboring reference blocks to the current coding block. For the provided NMs, we present two schemes (appending and substitution) to integrate the NMs with the traditional modes (TM) defined in high efficiency video coding (HEVC). For the appending scheme, each NM is corresponding to a certain range of TMs. The categorization of TMs is based on the expected prediction errors. After determining the relevant TMs for each NM, we present a probability-Aware mode signaling scheme. The NMs with higher probabilities to be the best mode are signaled with fewer bits. For the substitution scheme, we propose to replace the highest and lowest probable TMs. New most probable mode (MPM) generation method is also employed when substituting the lowest probable TMs. Experimental results demonstrate that using multiple NMs will improve the coding efficiency apparently compared with the single NM. Specifically, proposed appending scheme with seven NMs can save 2.6%, 3.8%, and 3.1% BD-rate for Y, U, and V components compared with using single NM in the state-of-The-Art works.
AB - This paper enhances the intra prediction by using multiple neural network modes (NM). Each NM serves as an end-To-end mapping from the neighboring reference blocks to the current coding block. For the provided NMs, we present two schemes (appending and substitution) to integrate the NMs with the traditional modes (TM) defined in high efficiency video coding (HEVC). For the appending scheme, each NM is corresponding to a certain range of TMs. The categorization of TMs is based on the expected prediction errors. After determining the relevant TMs for each NM, we present a probability-Aware mode signaling scheme. The NMs with higher probabilities to be the best mode are signaled with fewer bits. For the substitution scheme, we propose to replace the highest and lowest probable TMs. New most probable mode (MPM) generation method is also employed when substituting the lowest probable TMs. Experimental results demonstrate that using multiple NMs will improve the coding efficiency apparently compared with the single NM. Specifically, proposed appending scheme with seven NMs can save 2.6%, 3.8%, and 3.1% BD-rate for Y, U, and V components compared with using single NM in the state-of-The-Art works.
KW - High efficiency video coding (HEVC)
KW - intra prediction
KW - neural network
KW - probability
UR - http://www.scopus.com/inward/record.url?scp=85077379644&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85077379644&partnerID=8YFLogxK
U2 - 10.1109/TMM.2019.2963620
DO - 10.1109/TMM.2019.2963620
M3 - Article
AN - SCOPUS:85077379644
SN - 1520-9210
VL - 22
SP - 2764
EP - 2779
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
IS - 11
M1 - 8947942
ER -