TY - GEN
T1 - Faster responses are better responses
T2 - 9th International Workshop on Spoken Dialogue System Technology, IWSDS 2018
AU - Tsai, Vivian
AU - Baumann, Timo
AU - Pecune, Florian
AU - Cassell, Justine
N1 - Publisher Copyright:
© Springer Nature Singapore Pte Ltd 2019.
PY - 2019
Y1 - 2019
N2 - Speech-based interactive systems, such as virtual personal assistants, inevitably use complex architectures, with a multitude of modules working in series (or less often in parallel) to perform a task (e.g., giving personalized movie recommendations via dialog). Add modules for evoking and sustaining sociability with the user and the accumulation of processing latencies through the modules results in considerable turn-taking delays. We introduce incremental speech processing into the generation pipeline of the system to overcome this challenge with only minimal changes to the system architecture, through partial underspecification that is resolved as necessary. A user study with a sociable movie recommendation agent objectively diminishes turn-taking delays; furthermore, users not only rate the incremental system as more responsive, but also rate its recommendation performance as higher.
AB - Speech-based interactive systems, such as virtual personal assistants, inevitably use complex architectures, with a multitude of modules working in series (or less often in parallel) to perform a task (e.g., giving personalized movie recommendations via dialog). Add modules for evoking and sustaining sociability with the user and the accumulation of processing latencies through the modules results in considerable turn-taking delays. We introduce incremental speech processing into the generation pipeline of the system to overcome this challenge with only minimal changes to the system architecture, through partial underspecification that is resolved as necessary. A user study with a sociable movie recommendation agent objectively diminishes turn-taking delays; furthermore, users not only rate the incremental system as more responsive, but also rate its recommendation performance as higher.
UR - http://www.scopus.com/inward/record.url?scp=85076135489&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85076135489&partnerID=8YFLogxK
U2 - 10.1007/978-981-13-9443-0_10
DO - 10.1007/978-981-13-9443-0_10
M3 - Conference contribution
AN - SCOPUS:85076135489
SN - 9789811394423
T3 - Lecture Notes in Electrical Engineering
SP - 111
EP - 118
BT - 9th International Workshop on Spoken Dialogue System Technology, IWSDS 2018
A2 - D’Haro, Luis Fernando
A2 - Banchs, Rafael E.
A2 - Li, Haizhou
PB - Springer
Y2 - 18 April 2018 through 20 April 2018
ER -