TY - JOUR
T1 - Energy Compaction-Based Image Compression Using Convolutional AutoEncoder
AU - Cheng, Zhengxue
AU - Sun, Heming
AU - Takeuchi, Masaru
AU - Katto, Jiro
N1 - Funding Information:
Manuscript received October 22, 2018; revised May 31, 2019; accepted August 13, 2019. Date of publication August 29, 2019; date of current version March 24, 2020. This work was supported in part by the Japan Society for the Promotion of Science (JSPS) Research Fellowship DC2 under Grant 201914620, in part by JST, PRESTO under Grant JPMJPR19M5, Japan, in part by JSPS KAKENHI under Grant 15H01684, and in part by Waseda University Grant for Special Research Projects 2019Q-049. The associate editor coordinating the review of this manuscript and approving it for publication was Prof. Susanto Rahardja. (Corresponding author: Heming Sun.) Z. Cheng, M. Takeuchi, and J. Katto are with the Department of Computer Science and Communication Engineering, Graduate School of Fundamental Science and Engineering, Waseda University, Tokyo 169-8555, Japan (e-mail: zxcheng@asagi.waseda.jp; masaru-t@aoni.waseda.jp; katto@waseda.jp).
Publisher Copyright:
© 2019 IEEE.
PY - 2020/4
Y1 - 2020/4
N2 - Image compression has been an important research topic for many decades. Recently, deep learning has achieved great success in many computer vision tasks, and its use in image compression has gradually been increasing. In this paper, we present an energy compaction-based image compression architecture using a convolutional autoencoder (CAE) to achieve high coding efficiency. Our main contributions include three aspects: 1) we propose a CAE architecture for image compression by decomposing it into several down(up)sampling operations; 2) for our CAE architecture, we offer a mathematical analysis on the energy compaction property and we are the first work to propose a normalized coding gain metric in neural networks, which can act as a measurement of compression capability; 3) based on the coding gain metric, we propose an energy compaction-based bit allocation method, which adds a regularizer to the loss function during the training stage to help the CAE maximize the coding gain and achieve high compression efficiency. The experimental results demonstrate our proposed method outperforms BPG (HEVC-intra), in terms of the MS-SSIM quality metric. Additionally, we achieve better performance in comparison with existing bit allocation methods, and provide higher coding efficiency compared with state-of-the-art learning compression methods at high bit rates.
AB - Image compression has been an important research topic for many decades. Recently, deep learning has achieved great success in many computer vision tasks, and its use in image compression has gradually been increasing. In this paper, we present an energy compaction-based image compression architecture using a convolutional autoencoder (CAE) to achieve high coding efficiency. Our main contributions include three aspects: 1) we propose a CAE architecture for image compression by decomposing it into several down(up)sampling operations; 2) for our CAE architecture, we offer a mathematical analysis on the energy compaction property and we are the first work to propose a normalized coding gain metric in neural networks, which can act as a measurement of compression capability; 3) based on the coding gain metric, we propose an energy compaction-based bit allocation method, which adds a regularizer to the loss function during the training stage to help the CAE maximize the coding gain and achieve high compression efficiency. The experimental results demonstrate our proposed method outperforms BPG (HEVC-intra), in terms of the MS-SSIM quality metric. Additionally, we achieve better performance in comparison with existing bit allocation methods, and provide higher coding efficiency compared with state-of-the-art learning compression methods at high bit rates.
KW - Image compression
KW - convolutional autoencoder
KW - energy compaction
KW - optimum bit allocation
UR - http://www.scopus.com/inward/record.url?scp=85082885024&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85082885024&partnerID=8YFLogxK
U2 - 10.1109/TMM.2019.2938345
DO - 10.1109/TMM.2019.2938345
M3 - Article
AN - SCOPUS:85082885024
SN - 1520-9210
VL - 22
SP - 860
EP - 873
JO - IEEE Transactions on Multimedia
JF - IEEE Transactions on Multimedia
IS - 4
M1 - 8820051
ER -