TY - GEN
T1 - Confidence estimation and keyword extraction from speech recognition result based on Web information
AU - Kensuke, Hara
AU - Hideki, Sekiya
AU - Tetsuya, Kawase
AU - Satoshi, Tamura
AU - Satoru, Hayamizu
PY - 2013
Y1 - 2013
N2 - This paper proposes to use Web information for confidence measure and to extract keywords for speech recognition results. Spoken document processing has been attracting attention particularly for information retrieval and video (audiovisual) content systems. For example, measuring a confidence score which indicates how likely a document or a segmented document includes recognition errors has been studied. It is well known keyword extraction from recognition results is also an important issue. For these purposes, in this paper, pointwise mutual information (PMI) between two words is employed. PMI has been used to calculate a confidence measure of speech recognition, as a coherence measure by co-occurrence of words. We propose to further improve the method by using a Web query expansion technique with term triplets which consist of nouns in the same document. We also apply PMI to keyword estimation by summing a co-occurrence score (sumPMI) between a targeting keyword candidate and each term. The proposed methods were tested with 10 lectures in Corpus of Spontaneous Japanese (CSJ) and 2 simulated movie dialogues. In the experiments it is shown that the estimated confidence score has high relationship with recognition accuracy, indicating the effectiveness of our method. And sumPMI scores for keywords have higher values in the subjective tests.
AB - This paper proposes to use Web information for confidence measure and to extract keywords for speech recognition results. Spoken document processing has been attracting attention particularly for information retrieval and video (audiovisual) content systems. For example, measuring a confidence score which indicates how likely a document or a segmented document includes recognition errors has been studied. It is well known keyword extraction from recognition results is also an important issue. For these purposes, in this paper, pointwise mutual information (PMI) between two words is employed. PMI has been used to calculate a confidence measure of speech recognition, as a coherence measure by co-occurrence of words. We propose to further improve the method by using a Web query expansion technique with term triplets which consist of nouns in the same document. We also apply PMI to keyword estimation by summing a co-occurrence score (sumPMI) between a targeting keyword candidate and each term. The proposed methods were tested with 10 lectures in Corpus of Spontaneous Japanese (CSJ) and 2 simulated movie dialogues. In the experiments it is shown that the estimated confidence score has high relationship with recognition accuracy, indicating the effectiveness of our method. And sumPMI scores for keywords have higher values in the subjective tests.
UR - http://www.scopus.com/inward/record.url?scp=84893247656&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893247656&partnerID=8YFLogxK
U2 - 10.1109/APSIPA.2013.6694114
DO - 10.1109/APSIPA.2013.6694114
M3 - Conference contribution
AN - SCOPUS:84893247656
SN - 9789869000604
T3 - 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013
BT - 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013
T2 - 2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2013
Y2 - 29 October 2013 through 1 November 2013
ER -