TY - GEN
T1 - Data collection for mobile audio-visual speech recognition in various environments
AU - Tamura, Satoshi
AU - Seko, Takumi
AU - Hayamizu, Satoru
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/2/27
Y1 - 2014/2/27
N2 - This paper introduces our recent activities for audio-visual speech recognition on mobile devices and data collection in various environments. Audio-visual automatic speech recognition is effective in noisy or real conditions to enhance the robustness of speech recognizer and to improve the recognition accuracy. We have developed an audio-visual speech recognition interface for mobile devices. In order to evaluate the recognizer and investigate issues related to audio-visual processing on mobile computers, we collected speech data and lip images of 16 subjects in eight conditions, where there were various audio noises and visual difficulties. Audio-only speech recognition and visual-only lipreading were then conducted. Through these experiments, we found some issues and future works not only for construction of audio-visual database but also for robust audio-visual speech recognition.
AB - This paper introduces our recent activities for audio-visual speech recognition on mobile devices and data collection in various environments. Audio-visual automatic speech recognition is effective in noisy or real conditions to enhance the robustness of speech recognizer and to improve the recognition accuracy. We have developed an audio-visual speech recognition interface for mobile devices. In order to evaluate the recognizer and investigate issues related to audio-visual processing on mobile computers, we collected speech data and lip images of 16 subjects in eight conditions, where there were various audio noises and visual difficulties. Audio-only speech recognition and visual-only lipreading were then conducted. Through these experiments, we found some issues and future works not only for construction of audio-visual database but also for robust audio-visual speech recognition.
UR - http://www.scopus.com/inward/record.url?scp=84949923993&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84949923993&partnerID=8YFLogxK
U2 - 10.1109/ICSDA.2014.7051434
DO - 10.1109/ICSDA.2014.7051434
M3 - Conference contribution
AN - SCOPUS:84949923993
T3 - Oriental COCOSDA 2014 - 17th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment / CASLRE (Conference on Asian Spoken Language Research and Evaluation)
BT - Oriental COCOSDA 2014 - 17th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment / CASLRE (Conference on Asian Spoken Language Research and Evaluation)
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 17th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment, Oriental COCOSDA 2014
Y2 - 10 September 2014 through 12 September 2014
ER -