TY - GEN
T1 - ANOMALOUS SOUND DETECTION BASED ON ATTENTION MECHANISM
AU - Mori, Hayato
AU - Tamura, Satoshi
AU - Hayamizu, Satoru
N1 - Publisher Copyright:
© 2021 European Signal Processing Conference. All rights reserved.
PY - 2021
Y1 - 2021
N2 - For the automation of maintenance of mechanical facilities and devices, anomalous sound detection from machines has been explored. For these years, methods by machine learning and deep learning have been proposed for anomaly detection in various fields. Some deep-learning-based works calculate an anomaly score based on reconstruction errors obtained from an autoencoder model. However, the performance may not be sufficient, depending on characteristics of machines. In this study, we propose a method for detecting anomalous sounds using an autoencoder model with an attention-based mechanism. Given multiple frames of the log-scale mel spectrogram with a missing frame, our model computes the reconstruction error between an predicted frame and the removed frame as an abnormal score. We conducted experiments to compare our scheme to conventional ones, with visualizing attention weights. Our method achieved better performance, and it is found the missing frame can be well predicted using surrounds frames emphasized by the attention model. It is also found our approach can perform well independent on kind of machines and the number of input frames.
AB - For the automation of maintenance of mechanical facilities and devices, anomalous sound detection from machines has been explored. For these years, methods by machine learning and deep learning have been proposed for anomaly detection in various fields. Some deep-learning-based works calculate an anomaly score based on reconstruction errors obtained from an autoencoder model. However, the performance may not be sufficient, depending on characteristics of machines. In this study, we propose a method for detecting anomalous sounds using an autoencoder model with an attention-based mechanism. Given multiple frames of the log-scale mel spectrogram with a missing frame, our model computes the reconstruction error between an predicted frame and the removed frame as an abnormal score. We conducted experiments to compare our scheme to conventional ones, with visualizing attention weights. Our method achieved better performance, and it is found the missing frame can be well predicted using surrounds frames emphasized by the attention model. It is also found our approach can perform well independent on kind of machines and the number of input frames.
UR - http://www.scopus.com/inward/record.url?scp=85123213851&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85123213851&partnerID=8YFLogxK
U2 - 10.23919/EUSIPCO54536.2021.9616201
DO - 10.23919/EUSIPCO54536.2021.9616201
M3 - Conference contribution
AN - SCOPUS:85123213851
T3 - European Signal Processing Conference
SP - 581
EP - 585
BT - 29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings
PB - European Signal Processing Conference, EUSIPCO
T2 - 29th European Signal Processing Conference, EUSIPCO 2021
Y2 - 23 August 2021 through 27 August 2021
ER -