CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Finding case through personal names in parallel texts
Stockholm University, Faculty of Humanities, Department of Linguistics.
2019 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

The aim of this study is to evaluate whether the ‘richness’ of the marking on personal names is an adequate indirect measure of a language’s case usage. The method uses parallel texts to identify, and group by lemma, names in over a thousand languages. These groupings are compared with data for case usage from a typological database for those languages for which it is available. This material is then used to test a method for assessing whether a language uses case or not. Results indicate that the maximum number of word types a proprial lemma is attested with in a text is a useful tool for inferring case usage. However, it only yielded clear results for a subset of the languages tested. It was not particularly useful for inferring the absence of case usage. Estimation of number of case categories was also performed. An entropy measure based on word types that a personal name lemma is attested with and the occurrences of these word types was used. It was found to be a fair indicator of number of case categories for languages, if somewhat inaccurate. Markings on languages which had no case were investigated. They were found to be of several types: pragmatic markers, non-case grammatical markers and case-like markers. Two languages with few markings on personal names and with case were investigated. They were found to not use any case marking on their personal names, but still use such markers on common nouns. This contrasts with a tentative generalization that this study is based on: ‘No languages have case marking exclusively in the domain of [personal names] or [common nouns].’ (Handschuh, 2017).

Abstract [sv]

Denna studies syfte är att utvärdera om ’formrikedomen’ hos personnamnslexem är ett fungerande indirekt sätt att undersöka språks kasussystem. Parallella texter användes för att namnen hitta personnamn och gruppera dem efter lexem i över ett tusen språk. För den delmängd av språken där data om deras kasussystem fanns tillgänglig så jämfördes denna med grupperingarna. Resultaten indikerar att det maximala antalet ordformstyper som ett namnlemma observerades i är ett användbart verktyg för att hitta språk som använder kasus, men bara för en delmängd av testade språk. Det var däremot sämre på att hitta språk som inte använder kasus. En entropiuppskattning som var baserat på antalet ordformstyper ett personnamnslemma hittades med och antalet förekomster av dessa ordformstyper användes. Det var en okej indikator för antalet kasuskategorier, dock med något bristande träffsäkerhet. Personnamnsmarkeringar på språk utan kasus undersöktes. De funna typerna av markeringar var pragmatiska, kasuslika, och grammatiska icke-kasus. Två språk med kasus, men med få personnamns, undersöktes. De använder inte kasusmarkering på personnamn, men på sina substantiv, vilket bröt mot en hypotetisk generalisering som denna studie baserades på: Att inga språk har kasusmarkeringar endast på personnamn eller endast på substantiv.

Place, publisher, year, edition, pages
2019. , p. 29
Keywords [en]
Case, Personal names, Parallel texts, Information entropy, Bible corpus, Indirect measurement
Keywords [sv]
Kasus, Personnamn, Parallella texter, Informationsentropi, Bibelkorpus, Indirekta mått
National Category
General Language Studies and Linguistics
Identifiers
URN: urn:nbn:se:su:diva-174831OAI: oai:DiVA.org:su-174831DiVA, id: diva2:1360457
Supervisors
Examiners
Available from: 2019-11-03 Created: 2019-10-13 Last updated: 2019-11-03Bibliographically approved

Open Access in DiVA

fulltext(1003 kB)6 downloads
File information
File name FULLTEXT01.pdfFile size 1003 kBChecksum SHA-512
a2920ecd4f77af938f1c39877294b316da9876ec15b42b8181a14fe686b22f21001a3f936302d1b7e5e6362447a347d94a2e66c9b1813f142dfbe135a50be2d7
Type fulltextMimetype application/pdf

By organisation
Department of Linguistics
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 6 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 37 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf