Man-machine interaction using a vision system with dual viewing angles

Ying Jieh Huang*, Hiroshi Dohi, Mitsurti Ishizuka


研究成果: Article査読

1 被引用数 (Scopus)


This paper describes a vision system with dual viewing angles, i.e., wide and narrow viewing angles, and a scheme of user-friendly speech dialogue environment based on the vision system. The wide viewing angle provides a wide viewing field for wide range motion tracking, and the narrow viewing angle is capable of following a target in wide viewing field to take the image of the target with sufficient resolution. For a fast and robust motion tracking, modified motion energy (MME) and existence energy (££) arc defined to detect the motion of the target and extract the motion region at the same time. Instead of using a physical device such as a foot switch commonly used in speech dialogue systems, the begin/end of an utterance is detected from the movement of user's mouth in our system. Without recognizing the movement of lips directly, the shape variation of the region between lips is tracked for more stable recognition of the span of a dialogue. The tracking speed is about 10 frames/sec when no recognition is performed and about 5 frames/sec when both tracking and recognition are performed without using any special hardware.

ジャーナルIEICE Transactions on Information and Systems
出版ステータスPublished - 1997

ASJC Scopus subject areas

  • 情報システム
  • コンピュータ グラフィックスおよびコンピュータ支援設計
  • ソフトウェア


「Man-machine interaction using a vision system with dual viewing angles」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。