Recognising conversational speech: What an incremental ASR should do for a dialogue system and how to get there

Timo Baumann, Casey Kennington, Julian Hough, David Schlangen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

22 Scopus citations

Abstract

Automatic speech recognition (asr) is not only becoming increasingly accurate, but also increasingly adapted for producing timely, incremental output. However, overall accuracy and timeliness alone are insufficient when it comes to interactive dialogue systems which require stability in the output and responsivity to the utterance as it is unfolding. Furthermore, for a dialogue system to deal with phenomena such as disfluencies, to achieve deep understanding of user utterances these should be preserved or marked up for use by downstream components, such as language understanding, rather than be filtered out. Similarly, word timing can be informative for analyzing deictic expressions in a situated environment and should be available for analysis. Here we investigate the overall accuracy and incremental performance of three widely used systems and discuss their suitability for the aforementioned perspectives. From the differing performance along these measures we provide a picture of the requirements for incremental asr in dialogue systems and describe freely available tools for using and evaluating incremental ASR.

Original languageEnglish
Title of host publicationDialogues with Social Robots - Enablements, Analyses, and Evaluation
EditorsKristiina Jokinen, Graham Wilcock
PublisherSpringer Verlag
Pages421-432
Number of pages12
ISBN (Print)9789811025846
DOIs
StatePublished - 2017
Event7th International Workshop on Spoken Dialogue Systems, IWSDS 2016 - Saariselka, Finland
Duration: 13 Jan 201616 Jan 2016

Publication series

NameLecture Notes in Electrical Engineering
Volume427 427 LNEE
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Conference

Conference7th International Workshop on Spoken Dialogue Systems, IWSDS 2016
Country/TerritoryFinland
CitySaariselka
Period13/01/1616/01/16

Keywords

  • Conversational speech
  • Evaluation
  • Incremental ASR
  • System requirements

Fingerprint

Dive into the research topics of 'Recognising conversational speech: What an incremental ASR should do for a dialogue system and how to get there'. Together they form a unique fingerprint.

Cite this