Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Computing the fundamental frequency variation spectrum in conversational spoken dialogue systems
KTH Speech, Music and Hearing. (Speech group)
KTH Speech, Music and Hearing. (Speech group)ORCID iD: 0000-0002-0034-0924
KTH Speech, Music and Hearing. (Speech group)ORCID iD: 0000-0001-9327-9482
2008 (English)In: Proceedings of the 155th Meeting of the Acoustical Society of America, 5th EAA Forum Acusticum, and 9th SFA Congrés Français d'Acoustique (Acoustics2008), Paris, France: ASA , 2008, 3305-3310 p.Conference paper, Published paper (Refereed)
Abstract [en]

Continuous modeling of intonation in natural speech has long been hampered by a focus on modeling pitch, of which several normative aspects are particularly problematic. The latter include, among others, the fact that pitch is undefined in unvoiced segments, that its absolute magnitude is speaker-specific, and that its robust estimation and modeling, at a particular point in time, rely on a patchwork of long-time stability heuristics. In the present work, we continue our analysis of the fundamental frequency variation (FFV) spectrum, a recently proposed instantaneous, continuous, vector-valued representation of pitch variation, which is obtained by comparing the harmonic structure of the frequency magnitude spectra of the left and right half of an analysis frame. We analyze the sensitivity of a task-specific error rate in a conversational spoken dialogue system to the specific definition of the left and right halves of a frame, resulting in operational recommendations regarding the framing policy and window shape.

Place, publisher, year, edition, pages
Paris, France: ASA , 2008. 3305-3310 p.
National Category
Computer Science General Language Studies and Linguistics
Identifiers
URN: urn:nbn:se:su:diva-64371OAI: oai:DiVA.org:su-64371DiVA: diva2:457303
Conference
155th Meeting of the Acoustical Society of America, 5th EAA Forum Acusticum, and 9th SFA Congrés Français d'Acoustique (Acoustics2008)
Note

June 29th- July 4th

Available from: 2011-11-17 Created: 2011-11-17 Last updated: 2014-11-06

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Heldner, MattiasEdlund, Jens
Computer ScienceGeneral Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 28 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf