Change search
ReferencesLink to record
Permanent link

Direct link
Amharic-English information retrieval with pseudo relevance feedback
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
2008 (English)In: Advances in multilingual and multimodal information retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007 : revised selected papers / [ed] Carol Peters ... [et al.], Berlin ; New York: Springer, 2008, 119-126 p.Conference paper (Other academic)
Abstract [en]

We describe cross language retrieval experiments using Amharic queries and English language document collection from our participation in the bilingual ad hoc track at the CLEF 2007. Two monolingual and eight bilingual runs were submitted. The bilingual experiments designed varied in terms of usage of long and short queries, presence of pseudo relevance feedback (PRF), and three approaches (maximal expansion, first-translation-given, manual) for word sense disambiguation. We used an Amharic-English machine readable dictionary (MRD) and an online Amharic-English dictionary in order to do the lookup translation of query terms. In utilizing both resources, matching query term bigrams were always given precedence over unigrams. Out of dictionary Amharic query terms were taken to be possible named entities in the language, and further filtering was attained through restricted fuzzy matching based on edit distance. The fuzzy matching was performed for each of these terms against automatically extracted English proper names. The Lemur toolkit for language modeling and information retrieval was used for indexing and retrieval. Although the experiments are too limited to draw conclusions from, the obtained results indicate that longer queries tend to perform similar to short ones, PRF improves performance considerably, and that queries tend to fare better when we use the first translation given in the MRD rather than using maximal expansion of terms by taking all the translations given in the MRD.

Place, publisher, year, edition, pages
Berlin ; New York: Springer, 2008. 119-126 p.
, Lecture notes in computer science, 5152
National Category
Information Systems, Social aspects
URN: urn:nbn:se:su:diva-33857ISBN: 9783540857594OAI: diva2:283752
8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007
Available from: 2009-12-30 Created: 2009-12-30 Last updated: 2013-10-10Bibliographically approved

Open Access in DiVA

No full text

Other links
By organisation
Department of Computer and Systems Sciences
Information Systems, Social aspects

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Total: 64 hits
ReferencesLink to record
Permanent link

Direct link