Experiences with Developing Language Processing Tools and Corpora for Amharic
2010 (English)In: IST-Africa 2010 Conference Proceedings, Paul Cunningham and Miriam Cunningham , 2010Conference paper (Other academic)
A major bottleneck for promoting use of computers and the Internet is that many languages lack access to basic tools that would make it possible for people to access ICT in their own language. The paper describes the development a set of such resources for the processing of Amharic, the working language of the Ethiopian government. The primary goal was to investigate techniques and methods that can be used to efficiently create computational linguistic resources for new languages based on existing tools and resources. The resources created consist of linguistically annotated text collections and tools for word-level analysis of Amharic.
Place, publisher, year, edition, pages
Paul Cunningham and Miriam Cunningham , 2010.
Part-of-Speech Tagging, Text Categorization, Corpora, Amharic
Research subject Computer and Systems Sciences
IdentifiersURN: urn:nbn:se:su:diva-51901ISBN: 978-1-905824-15-1OAI: oai:DiVA.org:su-51901DiVA: diva2:386368