Creating and Evaluating a Consensus for Negated and Speculative Words in a Swedish Clinical Corpus
2010 (English)In: Proceedings of the Workshop on Negation and Speculation in Natural Language Processing ((NeSp-NLP 2010)) / [ed] Roser Morante, Caroline Sporleder, Antwerp: University of Antwerp , 2010, 5-13 p.Conference paper (Refereed)
In this paper we describe the creation of a consensus corpus that was obtained through combining three individual annotations of the same clinical corpus in Swedish. We used a few basic rules that were executed automatically to create the consensus. The corpus contains negation words, speculative words, uncertain expressions and certain expressions. We evaluated the consensus using it for negation and speculation cue detection. We used Stanford NER, which is based on the machine learning algorithm Conditional Random Fields for the training and detection. For comparison we also used the clinical part of the BioScope Corpus and trained it with Stanford NER. For our clinical consensus corpus in Swedish we obtained a precision of 87.9 percent and a recall of 91.7 percent for negation cues, and for English with the Bioscope Corpus we obtained a precision of 97.6 percent and a recall of 96.7 percent for negation cues.
Place, publisher, year, edition, pages
Antwerp: University of Antwerp , 2010. 5-13 p.
Research subject Computer and Systems Sciences
IdentifiersURN: urn:nbn:se:su:diva-51878ISBN: 9789057282669OAI: oai:DiVA.org:su-51878DiVA: diva2:386345
Negation and Speculation in Natural Language Processing, NeSp-NLP 2010 NeSp-NLP 2010 Workshop, Uppsala, Sweden