Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A Random Indexing Approach to Unsupervised Selectional Preference Induction
Stockholm University, Faculty of Humanities, Department of Linguistics, Computational Linguistics. (Datorlingvistik)
Stockholm University, Faculty of Humanities, Department of Linguistics, Computational Linguistics. (Datorlingvistik)ORCID iD: 0000-0002-4269-5619
2011 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

A selectional preference is the relation between a head-word and plausible arguments of that head-word. Estimation of the association feature between these words is important to natural language processing applications such as Word Sense Disambiguation. This study presents a novel approach to selectional preference induction within a Random Indexing word space. This is a spatial representation of meaning where distributional patterns enable estimation of the similarity between words. Using only frequency statistics about words to estimate how strongly one word selects another, the aim of this study is to develop a flexible method that is not language dependent and does not require any annotated resourceswhich is in contrast to methods from previous research. In order to optimize the performance of the selectional preference model, experiments including parameter tuning and variation of corpus size were conducted. The selectional preference model was evaluated in a pseudo-word evaluation which lets the selectional preference model decide which of two arguments have a stronger correlation to a given verb. Results show that varying parameters and corpus size does not affect the performance of the selectional preference model in a notable way. The conclusion of the study is that the language modelused does not provide the adequate tools to model selectional preferences. This might be due to a noisy representation of head-words and their arguments.

Place, publisher, year, edition, pages
2011. , 28 p.
Keyword [en]
Selectional preference induction, selectional preferences, word space model, Random Indexing, syntagma, distributional hypothesis
National Category
Language Technology (Computational Linguistics) Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:su:diva-59493OAI: oai:DiVA.org:su-59493DiVA: diva2:428507
Presentation
2011-06-01, C307, Institutionen för lingvistik, Universitetsvägen 10 C, Stockholm, 10:00 (Swedish)
Uppsok
Humanities, Theology
Supervisors
Examiners
Available from: 2011-07-04 Created: 2011-06-30 Last updated: 2014-06-02Bibliographically approved

Open Access in DiVA

fulltext(692 kB)776 downloads
File information
File name FULLTEXT02.pdfFile size 692 kBChecksum SHA-512
2362c780fd2733a4a028f1e3d8fb077c418b677eb012ca3de65e9dde02054ad6971d1eec51977f0587942598d4985b3b9b744ef18fbcac21d1c46996cce67094
Type fulltextMimetype application/pdf

Search in DiVA

By author/editor
Tengstrand, Lisa
By organisation
Computational Linguistics
Language Technology (Computational Linguistics)Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar
Total: 776 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 499 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf