Semi-supervised attention based merging network with hybrid dilated convolution module for few-shot HDR video reconstruction

Fengshan Zhao*, Qin Liu, Takeshi Ikenaga

*この研究の対応する著者

研究成果: Article査読

抄録

Deep learning based methods for high dynamic range (HDR) video reconstruction require collecting large-scale HDR video dataset with ground truth which is time-consuming. Recent training strategies under the few-shot learning paradigm, which aim to build an effective model upon only a few labeled samples, have shown success in image classification and image segmentation. In this paper, a semi-supervised learning based framework for few-shot HDR video reconstruction is proposed. An attention based merging network with the hybrid dilated convolution module is used to recover missing contents and remove artifacts. The hybrid dilated convolution module extracts additional features from ill-exposed regions and the attention module corrects them to suppress harmful information. In the semi-supervised framework, designed training losses for the supervised branch and the unsupervised branch are utilized to constrain the network during training under the few-shot scenario. Experimental results show that the proposed method trained with only 5 labeled samples and 45 unlabeled samples achieves a PSNR score of 41.664dB on synthetic evaluation dataset, compared with 35.201dB which is the best score among supervised methods trained in the same few-shot condition.

本文言語English
ページ(範囲)37409-37430
ページ数22
ジャーナルMultimedia Tools and Applications
83
13
DOI
出版ステータスPublished - 2024 4月

ASJC Scopus subject areas

  • ソフトウェア
  • メディア記述
  • ハードウェアとアーキテクチャ
  • コンピュータ ネットワークおよび通信

フィンガープリント

「Semi-supervised attention based merging network with hybrid dilated convolution module for few-shot HDR video reconstruction」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル