Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Evaluating normalization accounts against the dense vowel space of Central Swedish
Stockholms universitet, Humanistiska fakulteten, Institutionen för svenska och flerspråkighet, Svenska/Nordiska språk.ORCID-id: 0000-0001-5226-8568
Antal upphovsmän: 22023 (Engelska)Ingår i: Frontiers in Psychology, E-ISSN 1664-1078, Vol. 14, artikel-id 1165742Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Talkers vary in the phonetic realization of their vowels. One influential hypothesis holds that listeners overcome this inter-talker variability through pre-linguistic auditory mechanisms that normalize the acoustic or phonetic cues that form the input to speech recognition. Dozens of competing normalization accounts exist-including both accounts specific to vowel perception and general purpose accounts that can be applied to any type of cue. We add to the cross-linguistic literature on this matter by comparing normalization accounts against a new phonetically annotated vowel database of Swedish, a language with a particularly dense vowel inventory of 21 vowels differing in quality and quantity. We evaluate normalization accounts on how they differ in predicted consequences for perception. The results indicate that the best performing accounts either center or standardize formants by talker. The study also suggests that general purpose accounts perform as well as vowel-specific accounts, and that vowel normalization operates in both temporal and spectral domains.

Ort, förlag, år, upplaga, sidor
2023. Vol. 14, artikel-id 1165742
Nyckelord [en]
vowel normalization, ideal observers, speech production, speech perception, category separability
Nationell ämneskategori
Psykologi Jämförande språkvetenskap och allmän lingvistik
Identifikatorer
URN: urn:nbn:se:su:diva-221128DOI: 10.3389/fpsyg.2023.1165742ISI: 001022324900001PubMedID: 37416548Scopus ID: 2-s2.0-85164513909OAI: oai:DiVA.org:su-221128DiVA, id: diva2:1797754
Tillgänglig från: 2023-09-15 Skapad: 2023-09-15 Senast uppdaterad: 2024-10-14Bibliografiskt granskad
Ingår i avhandling
1. Comparing theories of pre-linguistic normalization for vowel perception
Öppna denna publikation i ny flik eller fönster >>Comparing theories of pre-linguistic normalization for vowel perception
2024 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

The present thesis compares competing theories of pre-linguistic normalization for the perception of Swedish and English vowels. Specifically, the overall aim is to investigate whether normalization might be key to understanding the mechanisms supporting robust cross-talker perception, and to gain more insights into the specific computations involved. The thesis is based on three articles that employ acoustic analysis, behavioral experiments and computational modeling to address the question of vowel normalization.

Article I uses a novel phonetically annotated database of Swedish vowel recordings, the SwehVd, to provide an updated acoustic description of the Central Swedish vowel system and to evaluate certain claims of cue-to-category mappings introduced by previous work. Replicating previous studies, the results of Article I suggest that F1, F2 and vowel duration are the most important cues to vowel identity in Central Swedish. In addition, the results highlight the importance of formant dynamics for reliable category distinctions. The acoustic characteristics of Article I further constitute the input to the computational modeling presented in Article II.

Article II evaluates 15 competing normalization accounts in terms of how well they predict the intended vowel category of Central Swedish, as represented by the talkers in SwehVd. Specifically, a computational model of vowel perception, a Bayesian ideal observer, is used to assess the predicted consequences of normalization. The results indicate that normalization accounts that assume the learning and storing of talker-specific acoustics (i.e., extrinsic accounts) achieve the best fit against vowel production data. The evaluation against the SwehVd database further contributes to the insight that languages with dense vowel spaces do not necessarily require more complex normalization mechanisms.

Article III evaluates 20 different normalization accounts in how well they predict listeners' categorization behavior in two vowel categorization experiments on US English vowels. Paralleling the results from Article II, the results indicate that more complex extrinsic normalization is needed for robust cross-talker perception. However, it is a computationally minimalist extrinsic account – uniform scaling – that provides the best fit when evaluated against listeners' responses. This would seem to suggest that more complex computations (as in, e.g., Lobanov normalization) are not required for human speech perception.

The thesis aimed for a broad-scale evaluation of competing theories of pre-linguistic normalization, assessing the predictions of different accounts using different types of experiment stimuli, different vowel spaces, and different sets of acoustic cues. This broad-scale evaluation was made possible through the implementation of a holistic and stringent computational framework, for an unbiased comparison of accounts. The main contributions of this thesis include the open-access publication of the framework and the vowel database, to facilitate replication and future studies.

Ort, förlag, år, upplaga, sidor
Stockholm: Department of Swedish Language and Multilingualism, Stockholm University, 2024. s. 82
Serie
Stockholm studies in Scandinavian philology, ISSN 0562-1097 ; 73
Nyckelord
vowels, speech perception, formants, normalization, ideal observers, spectral acoustics
Nationell ämneskategori
Jämförande språkvetenskap och allmän lingvistik
Forskningsämne
nordiska språk
Identifikatorer
urn:nbn:se:su:diva-234282 (URN)978-91-8014-973-0 (ISBN)978-91-8014-974-7 (ISBN)
Disputation
2024-11-29, hörsal 3, hus B, Universitetsvägen 10, Stockholm, 13:00 (Engelska)
Opponent
Handledare
Tillgänglig från: 2024-11-06 Skapad: 2024-10-14 Senast uppdaterad: 2024-10-28Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextPubMedScopus

Person

Persson, Anna

Sök vidare i DiVA

Av författaren/redaktören
Persson, AnnaJaeger, T. Florian
Av organisationen
Svenska/Nordiska språk
I samma tidskrift
Frontiers in Psychology
PsykologiJämförande språkvetenskap och allmän lingvistik

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetricpoäng

doi
pubmed
urn-nbn
Totalt: 134 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf