Region-of-interest based H.264 encoder for videophone with a hardware macroblock level face detector

Tianruo Zhang*, Chen Liu, Minghui Wang, Satoshi Goto

*この研究の対応する著者

研究成果: Conference contribution

9 被引用数 (Scopus)

抄録

Region-of-interest (ROI) can be applied in H.264 video encoder to enhance subjective quality and reduce computation complexity. For the aiming application of low cost hardware real-time encoder in videophone with faces as ROI, this paper proposes a face detection algorithm to detect each macroblock (MB) as one part of a face or not. This face detection algorithm has a unique estimation-and-verification process and can be combined with a H.264 encoder by MB level pipeline architecture. 97.91% MBs in faces can be detected. VLSI architecture of proposed face detection algorithm is designed and an area of 4.3k gates is achieved. Power consumption is only 1.45mW at 100MHz. A ROI based H.264 encoder with dynamic parameters is proposed to enhance subjective quality and reduce the rate-distortion-optimization (RDO) complexity. The PSNR in ROI increases for 4.8dB under similar bit rate. Encoding time is reduced to 54.4% in videophone-like sequences.

本文言語English
ホスト出版物のタイトル2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09
DOI
出版ステータスPublished - 2009
イベント2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09 - Rio De Janeiro
継続期間: 2009 10月 52009 10月 7

Other

Other2009 IEEE International Workshop on Multimedia Signal Processing, MMSP '09
CityRio De Janeiro
Period09/10/509/10/7

ASJC Scopus subject areas

  • 人工知能
  • コンピュータ ネットワークおよび通信
  • コンピュータ ビジョンおよびパターン認識
  • 信号処理

フィンガープリント

「Region-of-interest based H.264 encoder for videophone with a hardware macroblock level face detector」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル