Study on automated audio descriptions overlapping live television commentary

Manon Ichiki*, Toshihiro Shimizu, Atsushi Imai, Tohru Takagi, Mamoru Iwabuchi, Kiyoshi Kurihara, Taro Miyazaki, Tadashi Kumano, Hiroyuki Kaneko, Shoei Sato, Nobumasa Seiyama, Yuko Yamanouchi, Hideki Sumiyoshi

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)


We are conducting research on “automated audio description (AAD)” which automatically generates audio descriptions from real-time competition data for visually impaired people to enjoy live sports programs. However, there is a problem that AAD overlaps with the live television commentary voice, making it difficult to hear each other’s comment. In this paper, first, we show that the game situation is conveyed effectively when visually impaired persons listen to the AAD alone. Then we state the results of experiments on the following items to solve the overlap issue: (1) There is a difference in optimum volume level between live commentary and AAD, (2) The ease of listening differs depending on the difference in the characteristics of text-to-speech synthesizer for AAD, (3) Playing back AAD through a speaker placed differently from the TV speaker makes both voice sounds easier to listen to. We had clues to solve that depending on the presentation method of AAD, we can make AAD easy to listen to even when AAD overlaps the live television commentary.

Original languageEnglish
Title of host publicationComputers Helping People with Special Needs - 16th International Conference, ICCHP 2018, Proceedings
EditorsKlaus Miesenberger, Georgios Kouroupetroglou
Number of pages5
ISBN (Print)9783319942766
Publication statusPublished - 2018 Jan 1
Externally publishedYes
Event16th International Conference on Computers Helping People with Special Needs, ICCHP 2018 - Linz, Austria
Duration: 2018 Jul 112018 Jul 13

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10896 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference16th International Conference on Computers Helping People with Special Needs, ICCHP 2018


  • Audio description
  • Text-to-speech synthesizer
  • Visually impaired

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)


Dive into the research topics of 'Study on automated audio descriptions overlapping live television commentary'. Together they form a unique fingerprint.

Cite this