抄録
This paper presents an interactive humanoid robot that can moderate a multi-player fastest-voice-first-type quiz game by leveraging state-of-the-art robot audition techniques such as sound source localization and separation and speech recognition. In this game, a player who says 'Yes' first gets a right to answer a question, and players are allowed to barge in a questionary utterance of the quizmaster. The robot needs to identify which player says 'Yes' first, even if multiple players respond at almost exactly the same time, and must judge the correctness of the answer given by the player. To enable natural human-robot interaction, we believe that the robot should use its own microphones (i.e., ears) embedded in the head, rather than having pin microphones attached to individual players. In this paper we use a robot audition system called HARK for separating the mixture of audio signals recorded by the ears into multiple source signals (i.e., almost the simultaneous utterances of 'Yes' and the questionary utterance) and estimating the direction of each source. To judge the correctness of an answer, we use a speech recognizer called Julius. Experimental results showed that our robot can correctly identify which player spoke first when the players' utterances differed by 60 msec.
本文言語 | English |
---|---|
ホスト出版物のタイトル | IEEE-RAS International Conference on Humanoid Robots |
出版社 | IEEE Computer Society |
ページ | 967-972 |
ページ数 | 6 |
巻 | 2015-February |
ISBN(印刷版) | 9781479971749 |
DOI | |
出版ステータス | Published - 2015 2月 12 |
イベント | 2014 14th IEEE-RAS International Conference on Humanoid Robots, Humanoids 2014 - Madrid, Spain 継続期間: 2014 11月 18 → 2014 11月 20 |
Other
Other | 2014 14th IEEE-RAS International Conference on Humanoid Robots, Humanoids 2014 |
---|---|
国/地域 | Spain |
City | Madrid |
Period | 14/11/18 → 14/11/20 |
ASJC Scopus subject areas
- 人工知能
- コンピュータ ビジョンおよびパターン認識
- ハードウェアとアーキテクチャ
- 人間とコンピュータの相互作用
- 電子工学および電気工学