A low-band spectrum envelope reconstruction method for PSOLA-based F 0 modification

Ryo Mochizuki*, Tetsunori Kobayashi

*この研究の対応する著者

研究成果: Article査読

抄録

A low-band spectrum envelope reconstruction method was tested to see if it could improve the sound quality of F0 modified speech with the PSOLA (Pitch Synchronous Overlap Add) method. In the conventional PSOLA method, the extracted spectrum envelope using a Hanning window with two-pitch-period length had no reliable information in the band of frequencies lower than the original F0. This problem causes sound degradation of the F0 modified speech when the F0 is shifted downward. In the proposed method, the low-band spectrum envelope was properly modified according to the F0 modification rate. The amplitude of the F0 harmonic components in the low-band were reproduced based on the spectral tilt of the spectrum envelope. Subjective listening tests suggest the proposed method yields improved sound quality than the conventional TD-PSOLA method when the downward modification rate exceeds 0.4 octave.

本文言語English
ページ(範囲)2426-2429
ページ数4
ジャーナルIEICE Transactions on Information and Systems
E87-D
10
出版ステータスPublished - 2004 10月

ASJC Scopus subject areas

  • ソフトウェア
  • ハードウェアとアーキテクチャ
  • コンピュータ ビジョンおよびパターン認識
  • 電子工学および電気工学
  • 人工知能

フィンガープリント

「A low-band spectrum envelope reconstruction method for PSOLA-based F 0 modification」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル