Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Improving Prediction of Speech Activity Using Multi-Participant Respiratory State
Stockholm University, Faculty of Humanities, Department of Linguistics, Phonetics.ORCID iD: 0000-0003-3824-2980
Stockholm University, Faculty of Humanities, Department of Linguistics, Phonetics.ORCID iD: 0000-0002-0034-0924
Stockholm University, Faculty of Humanities, Department of Linguistics, Phonetics.ORCID iD: 0000-0003-4432-2602
2017 (English)In: Proceedings of Interspeech 2017 / [ed] Francisco Lacerda, David House, Mattias Heldner, Joakim Gustafson, Sofia Strömbergsson, Marcin Włodarczak, Stockholm: The International Speech Communication Association (ISCA), 2017, p. 1666-1670Conference paper, Published paper (Refereed)
Abstract [en]

One consequence of situated face-to-face conversation is the co- observability of participants’ respiratory movements and sounds. We explore whether this information can be exploited in pre- dicting incipient speech activity. Using a methodology called stochastic turn-taking modeling, we compare the performance of a model trained on speech activity alone to one additionally trained on static and dynamic lung volume features. The method- ology permits automatic discovery of temporal dependencies across participants and feature types. Our experiments show that respiratory information substantially lowers cross-entropy rates, and that this generalizes to unseen data. 

Place, publisher, year, edition, pages
Stockholm: The International Speech Communication Association (ISCA), 2017. p. 1666-1670
Series
Interspeech, E-ISSN 1990-9772
Keywords [en]
respiratory kinematics, interaction chronograms, stochastic turn-taking models
National Category
General Language Studies and Linguistics
Research subject
Phonetics
Identifiers
URN: urn:nbn:se:su:diva-145398DOI: 10.21437/Interspeech.2017-1176ISI: 000457505000344ISBN: 9781510848764 (print)OAI: oai:DiVA.org:su-145398DiVA, id: diva2:1128903
Conference
Interspeech 2017, Stockholm, Sweden, 20-24 August, 2017
Projects
Andning i samtal (Breathing in conversation)
Funder
Swedish Research Council, 2014-1072Available from: 2017-07-31 Created: 2017-07-31 Last updated: 2022-02-02Bibliographically approved

Open Access in DiVA

fulltext(309 kB)415 downloads
File information
File name FULLTEXT01.pdfFile size 309 kBChecksum SHA-512
3d977ae8c3af65b600a2ccd4c0add23594dcb9e276c05d210813558861affb1381ccaccbd32ca4e31841bceb23ff434b4dd057e78d35f244a41b051446075394
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Włodarczak, MarcinHeldner, MattiasAare, Kätlin

Search in DiVA

By author/editor
Włodarczak, MarcinHeldner, MattiasAare, Kätlin
By organisation
Phonetics
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 415 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 737 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf