Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Classification of affect in speech using normalized time-frequency cepstra
Stockholm University, Faculty of Social Sciences, Department of Psychology.
2010 (English)In: Speech Prosody 2010, 2010, 100071-1-4 p.Conference paper, Published paper (Refereed)
Abstract [en]

Subtle temporal and spectral differences between categorical realizations of para-linguistic phenomena (e.g., affective vocal expressions) are hard to capture and describe. In this paper we present a signal representation based on Time Varying Constant-Q Cepstral Coeffcients (TVCQCC) derived for this purpose. A method which utilizes the special properties of the constant Q-transform for mean F0 estimation and normalization is described. The coeffcients are invariant to segment length, and as a special case, a representation for prosody is considered. Speaker independent classifcation results using v-SVM with the Berlin EMO-DB and two closed sets of basic (anger, disgust, fear, happiness, sadness, neutral) and social/interpersonal (affection, pride, shame) emotions recorded by forty professional actors from two English dialect areas are reported. The accuracy for the Berlin EMO-DB is 71.2 %, and the accuracies for the first set including basic emotions was 44.6% and for the second set including basic and social emotions the accuracy was 31.7% . It was found that F0 normalization boosts the performance and a combined feature set shows the best performance.

Place, publisher, year, edition, pages
2010. 100071-1-4 p.
National Category
Psychology
Research subject
Psychology
Identifiers
URN: urn:nbn:se:su:diva-47381ISBN: 978-0-557-51931-6 (print)OAI: oai:DiVA.org:su-47381DiVA: diva2:373844
Conference
Speech Prosody 2010
Funder
Swedish Research Council, 2006-1360
Note
This work was partly funded by the Swedish Research Council under contract 2006-1360.Available from: 2010-12-01 Created: 2010-12-01 Last updated: 2010-12-08Bibliographically approved

Open Access in DiVA

No full text

Other links

Fulltext [PDF]

Search in DiVA

By author/editor
Laukka, Petri
By organisation
Department of Psychology
Psychology

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 34 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf