Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Prosodic adaptation in human-computer interaction
KTH Speech, Music and Hearing. (Speech group)
KTH Speech, Music and Hearing. (Speech group)
KTH Speech, Music and Hearing. (Speech group)ORCID iD: 0000-0002-0034-0924
2003 (English)In: Proceedings ICPhS 2003, Barcelona, Spain: ISCA , 2003, 2453-2456 p.Conference paper, Published paper (Refereed)
Abstract [en]

State-of-the-art speech recognizers are trained on predominantly normal speech and have difficulties handling either exceedingly slow and hyperarticulated or fast and sloppy speech. Explicitly instructing users on how to speak, however, can make the human–computer interaction stilted and unnatural. If it is possible to affect users’ speaking rate while maintaining the naturalness of the dialogue, this could prove useful in the development of future human–computer interfaces. Users could thus be subtly influenced to adapt their speech to better match the current capabilities of the system, so that errors can be reduced and the overall quality of the human–computer interaction is improved. At the same time, speakers are allowed to express themselves freely and naturally. In this article, we investigate whether people adapt their speech as they interact with an animated character in a simulated spoken dialogue system. A user experiment involving 16 subjects was performed to examine whether people who speak with a simulated dialogue system adapt their speaking rate to that of the system. The experiment confirmed that the users adapted to the speaking rate of the system, and no subjects afterwards seemed to be aware they had been affected in this way. Another finding was that speakers varied their speaking rate substantially in the course of the dialogue. In particular, problematic sequences where subjects had to repeat or rephrase the same utterance several times elicited slower speech.

Place, publisher, year, edition, pages
Barcelona, Spain: ISCA , 2003. 2453-2456 p.
National Category
General Language Studies and Linguistics
Identifiers
URN: urn:nbn:se:su:diva-64399OAI: oai:DiVA.org:su-64399DiVA: diva2:458528
Conference
ICPhS 2003
Available from: 2011-11-23 Created: 2011-11-17 Last updated: 2014-04-11Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Heldner, Mattias
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 36 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf