Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Detecting Protected Health Information in Heterogeneous Clinical Notes
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.ORCID iD: 0000-0002-5780-0063
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
2017 (English)In: MEDINFO 2017: Precision Healthcare through Informatics / [ed] Adi V. Gundlapalli, Marie-Christine Jaulent, Dongsheng Zhao, IOS Press, 2017, p. 393-397Conference paper, Published paper (Refereed)
Abstract [en]

To enable secondary use of healthcare data in a privacy-preserving manner, there is a need for methods capable of automatically identifying protected health information (PHI) in clinical text. To that end, learning predictive models from labeled examples has emerged as a promising alternative to rule-based systems. However, little is known about differences with respect to PHI prevalence in different types of clinical notes and how potential domain differences may affect the performance of predictive models trained on one particular type of note and applied to another. In this study, we analyze the performance of a predictive model trained on an existing PHI corpus of Swedish clinical notes and applied to a variety of clinical notes: written (i) in different clinical specialties, (ii) under different headings, and (iii) by persons in different professions. The results indicate that domain adaption is needed for effective detection of PHI in heterogeneous clinical notes.

Place, publisher, year, edition, pages
IOS Press, 2017. p. 393-397
Series
Studies in Health Technology and Informatics, ISSN 0926-9630, E-ISSN 1879-8365 ; 245
Keywords [en]
Data Anonymization, Electronic Health Records, Natural Language Processing
National Category
Information Systems
Research subject
Computer and Systems Sciences
Identifiers
URN: urn:nbn:se:su:diva-150179DOI: 10.3233/978-1-61499-830-3-393ISI: 000449471200082ISBN: 978-1-61499-829-7 (print)ISBN: 978-1-61499-830-3 (electronic)OAI: oai:DiVA.org:su-150179DiVA, id: diva2:1165765
Conference
16th World Congress of Medical and Health Informatics (MedInfo2017), Hangzhou, China, August 21-25, 2017
Available from: 2017-12-13 Created: 2017-12-13 Last updated: 2022-02-28Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Henriksson, AronKvist, MariaDalianis, Hercules

Search in DiVA

By author/editor
Henriksson, AronKvist, MariaDalianis, Hercules
By organisation
Department of Computer and Systems Sciences
Information Systems

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 81 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf