TY - GEN
T1 - Speech recognition of a named entity
AU - Tomita, Tatsuhiko
AU - Okimoto, Yoshiyuki
AU - Yamamoto, Hirofumi
AU - Sagisaka, Yoshinori
PY - 2005
Y1 - 2005
N2 - A hierarchical language model is newly applied to identify a named entity consisting of multiple word sequences for continuous speech recognition. By redesigning an out-of-vocabulary model of a single word using phonotactic constraints for a named entity, a hierarchical model is composed harmoniously with conventional word and word-class N-grams. Continuous speech recognition experiments aiming at movie-title identification showed the effectiveness of this modeling in the task of inquiries on these titles. These results ensure that the proposed hierarchical language modeling architecture is applicable to multiple word successions for speech recognition to cope with unregistered expressions and enables the mix use of different statistics harmoniously.
AB - A hierarchical language model is newly applied to identify a named entity consisting of multiple word sequences for continuous speech recognition. By redesigning an out-of-vocabulary model of a single word using phonotactic constraints for a named entity, a hierarchical model is composed harmoniously with conventional word and word-class N-grams. Continuous speech recognition experiments aiming at movie-title identification showed the effectiveness of this modeling in the task of inquiries on these titles. These results ensure that the proposed hierarchical language modeling architecture is applicable to multiple word successions for speech recognition to cope with unregistered expressions and enables the mix use of different statistics harmoniously.
UR - http://www.scopus.com/inward/record.url?scp=33646768658&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33646768658&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2005.1415299
DO - 10.1109/ICASSP.2005.1415299
M3 - Conference contribution
AN - SCOPUS:33646768658
SN - 0780388747
SN - 9780780388741
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - I1057-I1060
BT - 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05 - Proceedings - Image and Multidimensional Signal Processing Multimedia Signal Processing
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '05
Y2 - 18 March 2005 through 23 March 2005
ER -