A Random Indexing Approach to Unsupervised Selectional Preference Induction
Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
A selectional preference is the relation between a head-word and plausible arguments of that head-word. Estimation of the association feature between these words is important to natural language processing applications such as Word Sense Disambiguation. This study presents a novel approach to selectional preference induction within a Random Indexing word space. This is a spatial representation of meaning where distributional patterns enable estimation of the similarity between words. Using only frequency statistics about words to estimate how strongly one word selects another, the aim of this study is to develop a flexible method that is not language dependent and does not require any annotated resourceswhich is in contrast to methods from previous research. In order to optimize the performance of the selectional preference model, experiments including parameter tuning and variation of corpus size were conducted. The selectional preference model was evaluated in a pseudo-word evaluation which lets the selectional preference model decide which of two arguments have a stronger correlation to a given verb. Results show that varying parameters and corpus size does not affect the performance of the selectional preference model in a notable way. The conclusion of the study is that the language modelused does not provide the adequate tools to model selectional preferences. This might be due to a noisy representation of head-words and their arguments.
Place, publisher, year, edition, pages
2011. , 28 p.
Selectional preference induction, selectional preferences, word space model, Random Indexing, syntagma, distributional hypothesis
Language Technology (Computational Linguistics) Language Technology (Computational Linguistics)
IdentifiersURN: urn:nbn:se:su:diva-59493OAI: oai:DiVA.org:su-59493DiVA: diva2:428507
2011-06-01, C307, Institutionen för lingvistik, Universitetsvägen 10 C, Stockholm, 10:00 (Swedish)
Sahlgren, Magnus, doktorÖstling, Robert, doktorand
Wirén, Mats, doktor