Medical vocabulary mining using distributional semantics on Japanese patient blogs
2014 (English)In: Proceedings of the 6th International Symposium on Semantic Mining in Biomedicine, 2014, 57-62 p.Conference paper (Refereed)
Random indexing has previously been successfully used for medical vocabulary expansion for Germanic languages. In this study, we used this approach to ex- tract medical terms from a Japanese pa- tient blog corpus. The corpus was seg- mented into semantic units by a semantic role labeller, and different pre-processing and parameter settings were then evalu- ated. The evaluation showed that simi- lar settings are suitable for Japanese as for previously explored Germanic languages, and that distributional semantics is equally useful for semi-automatic expansion of Japanese medical vocabularies as for med- ical vocabularies in Germanic languages.
Place, publisher, year, edition, pages
2014. 57-62 p.
Random indexing, Japanese, medical terminologies
Research subject Computer and Systems Sciences
IdentifiersURN: urn:nbn:se:su:diva-108688DOI: 10.5167/uzh-98982OAI: oai:DiVA.org:su-108688DiVA: diva2:760066
6th International Symposium on Semantic Mining in Biomedicine, (SMBM), Aveiro, Portugal, October 6‐7, 2014