TY - GEN
T1 - ACOUSTIC APPLICATION OF PHASE RECONSTRUCTION ALGORITHMS IN OPTICS
AU - Kobayashi, Tomoki
AU - Tanaka, Tomoro
AU - Yatabe, Kohei
AU - Oikawa, Yasuhiro
N1 - Publisher Copyright:
© 2022 IEEE
PY - 2022
Y1 - 2022
N2 - Phase reconstruction from amplitude spectrograms has attracted attention in recent acoustics because of its potential applications in speech synthesis and enhancement. The most well-known algorithm in acoustics is based on alternating projection and called Griffin-Lim algorithm (GLA). At the same time, GLA is known as the Gerchberg-Saxton algorithm in optics, and a lot of its variants have been proposed independently of those in acoustics. In this paper, we propose to apply phase reconstruction algorithms developed in the optics community to acoustic applications and evaluate them using acoustical metrics. Specifically, we propose to apply the averaged alternating reflections (AAR), relaxed AAR (RAAR), and hybrid input-output (HIO) algorithms to acoustic signals. Our experimental results suggested that RAAR has enough potential for acoustic applications because it clearly outperformed GLA.
AB - Phase reconstruction from amplitude spectrograms has attracted attention in recent acoustics because of its potential applications in speech synthesis and enhancement. The most well-known algorithm in acoustics is based on alternating projection and called Griffin-Lim algorithm (GLA). At the same time, GLA is known as the Gerchberg-Saxton algorithm in optics, and a lot of its variants have been proposed independently of those in acoustics. In this paper, we propose to apply phase reconstruction algorithms developed in the optics community to acoustic applications and evaluate them using acoustical metrics. Specifically, we propose to apply the averaged alternating reflections (AAR), relaxed AAR (RAAR), and hybrid input-output (HIO) algorithms to acoustic signals. Our experimental results suggested that RAAR has enough potential for acoustic applications because it clearly outperformed GLA.
KW - alternating projection
KW - Griffin-Lim algorithm
KW - Phase retrieval
KW - reflection operator
KW - short-time Fourier transform
UR - http://www.scopus.com/inward/record.url?scp=85130481756&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85130481756&partnerID=8YFLogxK
U2 - 10.1109/ICASSP43922.2022.9747423
DO - 10.1109/ICASSP43922.2022.9747423
M3 - Conference contribution
AN - SCOPUS:85130481756
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 6212
EP - 6216
BT - 2022 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022
Y2 - 23 May 2022 through 27 May 2022
ER -