TY - GEN
T1 - Acoustic event detection with classifier chains
AU - Komatsu, Tatsuya
AU - Watanabe, Shinji
AU - Miyazaki, Koichi
AU - Hayashi, Tomoki
N1 - Publisher Copyright:
Copyright © 2021 ISCA.
PY - 2021
Y1 - 2021
N2 - This paper proposes acoustic event detection (AED) with classifier chains, a new classifier based on the probabilistic chain rule. The proposed AED with classifier chains consists of a gated recurrent unit and performs iterative binary detection of each event one by one. In each iteration, the event's activity is estimated and used to condition the next output based on the probabilistic chain rule to form classifier chains. Therefore, the proposed method can handle the interdependence among events upon classification, while the conventional AED methods with multiple binary classifiers with a linear layer and sigmoid function have placed an assumption of conditional independence. In the experiments with a real-recording dataset, the proposed method demonstrates its superior AED performance to a relative 14.80% improvement compared to a convolutional recurrent neural network baseline system with the multiple binary classifiers.
AB - This paper proposes acoustic event detection (AED) with classifier chains, a new classifier based on the probabilistic chain rule. The proposed AED with classifier chains consists of a gated recurrent unit and performs iterative binary detection of each event one by one. In each iteration, the event's activity is estimated and used to condition the next output based on the probabilistic chain rule to form classifier chains. Therefore, the proposed method can handle the interdependence among events upon classification, while the conventional AED methods with multiple binary classifiers with a linear layer and sigmoid function have placed an assumption of conditional independence. In the experiments with a real-recording dataset, the proposed method demonstrates its superior AED performance to a relative 14.80% improvement compared to a convolutional recurrent neural network baseline system with the multiple binary classifiers.
KW - Acoustic event detection
KW - Chain rule
KW - Classifier chains
KW - Multi-label classification
UR - http://www.scopus.com/inward/record.url?scp=85119479866&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119479866&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2021-2218
DO - 10.21437/Interspeech.2021-2218
M3 - Conference contribution
AN - SCOPUS:85119479866
T3 - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
SP - 46
EP - 50
BT - 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021
PB - International Speech Communication Association
T2 - 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021
Y2 - 30 August 2021 through 3 September 2021
ER -