Using parallel corpora to create a Greek-English dictionary with Uplug
2007 (English)In: NODALIDA 2007 PROCEEDINGS / [ed] Joakim Nivre, Heiki-Jaan Kaalep, Kadri Muischnek and Mare Koit, 2007, 212-215 p.Conference paper (Refereed)
This paper presents the construction of a Greek-English bilingual dictionary from parallel corpora that were created manually by collected documents retrieved from the Internet. The parallel corpora processing was performed by the Uplug word alignment system without the use of language specific information. A sample was extracted from the population of suggested translations and was included in questionnaires that were sent out to Greek-English speakers who evaluated the sample based on the quality of the translation pairs. For the suggested translation pairs of the sample belonging to the stratum with the higher frequency of occurrence, 67.11% correct translations were achieved. With an overall 50.63% of correct translations of the sample, the results were promising considering the minimal optimisation of the corpus and the differences between the two lan- guages.
Place, publisher, year, edition, pages
2007. 212-215 p.
IdentifiersURN: urn:nbn:se:su:diva-37895ISBN: 978-9985-4-0513-0OAI: oai:DiVA.org:su-37895DiVA: diva2:305337
Nodalida 2007, The 16th Nordic Conference of Computational Linguistics, 25-26 May 2007 in Tartu, Estonia