Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Dictionary-based Amharic-French information retrieval
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
SICS.
SICS.
Show others and affiliations
2006 (English)In: Accessing multilingual information repositories: 6th workshop of the Cross-Language Evalution Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005, revised selected papers / [ed] Carol Peters, Fredric C. Gey, Julio Gonzalo, Henning Müller, Gareth J. F. Jones, Michael kluck, Bernardo Magnini, Maarten de Rijke, Berlin: Springer Berlin/Heidelberg, 2006, 83-92 p.Conference paper, Published paper (Other academic)
Abstract [en]

We present four approaches to the Amharic - French bilingual track at CLEF 2005. All experiments use a dictionary based approach to translate the Amharic queries into French Bags-of-words, but while one approach uses word sense discrimination on the translated side of the queries, the other one includes all senses of a translated word in the query for searching. We used two search engines: The SICS experimental engine and Lucene, hence four runs with the two approaches. Non-content bearing words were removed both before and after the dictionary lookup. TF/IDF values supplemented by a heuristic function was used to remove the stop words from the Amharic queries and two French stopwords lists were used to remove them from the French translations. In our experiments, we found that the SICS search engine performs better than Lucene and that using the word sense discriminated keywords produce a slightly better result than the full set of non discriminated keywords.

Place, publisher, year, edition, pages
Berlin: Springer Berlin/Heidelberg, 2006. 83-92 p.
Series
Lecture notes in computer science, ISSN 0302-9743 ; 4022
National Category
Language Technology (Computational Linguistics)
Identifiers
URN: urn:nbn:se:su:diva-37937ISBN: 978-3-540-45697-1 (print)ISBN: 978-3-540-45700-8 (print)OAI: oai:DiVA.org:su-37937DiVA: diva2:305621
Conference
6th workshop of the Cross-Language Evalution Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005
Available from: 2010-03-24 Created: 2010-03-24 Last updated: 2013-09-26Bibliographically approved

Open Access in DiVA

No full text

Other links

http://download.springer.com/static/pdf/978/chp%253A10.1007%252F11878773_9.pdf?auth66=1380369933_afe62ce97aef353261b8c7c804515486&ext=.pdf

Search in DiVA

By author/editor
Asker, LarsSahlgren, Magnus
By organisation
Department of Computer and Systems Sciences
Language Technology (Computational Linguistics)

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 137 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf