Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards A Better Understanding of Uncertainties and Speculations in Swedish Clinical Text – Analysis of an Initial Annotation Trial
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
2010 (English)In: Proceedings of the Workshop on Negation and Speculation in Natural Language Processing, University of Antwerpen , 2010, 14-22 p.Conference paper, Published paper (Other academic)
Abstract [en]

In view of the increasing need to facilitate processing the content of scientific papers, we present an annotation scheme for annotating full papers with zones of conceptualisation, reflecting the information structure and knowledge types which constitute a scientific investigation. The latter are the Core Scientific Concepts (CoreSCs) and include Hypothesis, Motivation, Goal, Object, Background, Method, Experiment, Model, Observation, Result and Conclusion. The CoreSC scheme has been used to annotate a corpus of 265 full papers in physical chemistry and biochemistry and we are currently automating the recognition of CoreSCs in papers. We discuss how the CoreSC scheme relates to other views of scientific papers and indeed how the former could be used to help identify negation and speculation in scientific texts.

Place, publisher, year, edition, pages
University of Antwerpen , 2010. 14-22 p.
National Category
Information Science
Research subject
Computer and Systems Sciences
Identifiers
URN: urn:nbn:se:su:diva-52029ISBN: 9789057282669 (print)OAI: oai:DiVA.org:su-52029DiVA: diva2:386511
Conference
Workshop on Negation and Speculation in Natural Language Processing
Available from: 2011-01-12 Created: 2011-01-12 Last updated: 2012-03-27Bibliographically approved
In thesis
1. Shades of Certainty: Annotation and Classification of Swedish Medical Records
Open this publication in new window or tab >>Shades of Certainty: Annotation and Classification of Swedish Medical Records
2012 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Access to information is fundamental in health care. This thesis presents research on Swedish medical records with the overall goal of building intelligent information access tools that can aid health personnel, researchers and other professions in their daily work, and, ultimately, improve health care in general.

The issue of ethics and identifiable information is addressed by creating an annotated gold standard corpus and porting an existing de-identification system to Swedish from English. The aim is to move towards making textual resources available to researchers without risking exposure of patients’ confidential information. Results for the rule-based system are not encouraging, but results for the gold standard are fairly high.

Affirmed, uncertain and negated information needs to be distinguished when building accurate information extraction tools. Annotation models are created, with the aim of building automated systems. One model distinguishes certain and uncertain sentences, and is applied on medical records from several clinical departments. In a second model, two polarities and three levels of certainty are applied on diagnostic statements from an emergency department. Overall results are promising. Differences are seen depending on clinical practice, annotation task and level of domain expertise among the annotators.

Using annotated resources for automatic classification is studied. Encouraging overall results using local context information are obtained. The fine-grained certainty levels are used for building classifiers for real-world e-health scenarios.

This thesis contributes two annotation models of certainty and one of identifiable information, applied on Swedish medical records. A deeper understanding of the language use linked to conveying certainty levels is gained. Three annotated resources that can be used for further research have been created, and implications for automated systems are presented.

Place, publisher, year, edition, pages
Stockholm: Department of Computer and Systems Sciences, Stockholm University, 2012. 78 p.
Series
Report Series / Department of Computer & Systems Sciences, ISSN 1101-8526 ; 12-002
Keyword
Clinical documentation, Certainty level classification, Annotation, E-health, Corpus creation, De-identification, Speculative language, Medical Records, Swedish, Natural Language Processing, Language Technology
National Category
Information Systems, Social aspects
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-74828 (URN)978-91-7447-444-2 (ISBN)
Public defence
2012-04-27, Sal C, Forum 100, Isafjordsgatan 39, Kista, 13:00 (English)
Opponent
Supervisors
Available from: 2012-04-05 Created: 2012-03-27 Last updated: 2012-03-28Bibliographically approved

Open Access in DiVA

No full text

Other links

fulltextProceedings

Search in DiVA

By author/editor
Velupillai, Sumithra
By organisation
Department of Computer and Systems Sciences
Information Science

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 29 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf