Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automatic Induction of Word Classes in Swedish Sign Language
Stockholm University, Faculty of Humanities, Department of Linguistics, Computational Linguistics.ORCID iD: 0000-0003-1149-2517
2013 (English)Independent thesis Advanced level (degree of Master (One Year)), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

Identifying word classes is an important part of describing a language. Research about sign languages often lack distinctions crucial for identifying word classes, e.g. the difference between sign and gesture. Additionally, sign languages typically lack written form, something that often constrains quantitative research on sign language to the use of glosses translated to the spoken language in the area. In this thesis, such glosses have been extracted from The Swedish Sign Language Corpus. The glosses were mapped to utterances based on Swedish translations in the corpus, and these utterances served as input data to a word space model, producing a co-occurence matrix. This matrix was clustered with the K-means algorithm. The extracted utterances were also clustered with the Brown algorithm. By using V-measure, the clusters were compared to a gold standard annotated manually with word classes. The Brown algorithm performs significantly better in inducing word classes than a random baseline. This work shows that utilizing unsupervised learning is a feasible approach for doing research on word classes in Swedish Sign Language. However, future studies of this kind should employ a deeper linguistic analysis of the language as a part of choosing the algorithms.

Place, publisher, year, edition, pages
2013.
Keyword [en]
Word Class Induction, Swedish Sign Language, Clustering
National Category
General Language Studies and Linguistics
Identifiers
URN: urn:nbn:se:su:diva-90824OAI: oai:DiVA.org:su-90824DiVA: diva2:627322
Supervisors
Examiners
Available from: 2013-06-13 Created: 2013-06-11 Last updated: 2014-05-26Bibliographically approved

Open Access in DiVA

Automatic Induction of Word Classes in Swedish Sign Language(653 kB)392 downloads
File information
File name FULLTEXT01.pdfFile size 653 kBChecksum SHA-512
f8295f73df54ea0f81a58c407c30192edcaec74747906d6a9833419f8c3e2950d84e8b0fcdec9a2ece13499a8066f0004ae7fd01470796e98fbf9e330967bb3f
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Sjons, Johan
By organisation
Computational Linguistics
General Language Studies and Linguistics

Search outside of DiVA

GoogleGoogle Scholar
Total: 392 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 303 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf