Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Multi-label Diagnosis Classification of Swedish Discharge Summaries – ICD-10 Code Assignment Using KB-BERT
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences. Norwegian Centre for E-health Research, Norway.
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences. Norwegian Centre for E-health Research, Norway.ORCID iD: 0000-0003-0165-9926
2021 (English)In: INTERNATIONAL CONFERENCE RECENT ADVANCES IN NATURAL LANGUAGE PROCESSING 2021: Deep Learning for Natural Language ProcessingMethods and Applications: PROCEEDINGS / [ed] Galia Angelova; Maria Kunilovskaya; Ruslan Mitkov; Ivelina Nikolova-Koleva, Shoumen: INCOMA Ltd. , 2021, p. 1158-1166Conference paper, Published paper (Refereed)
Abstract [en]

The International Classification of Diseases (ICD) is a system for systematically recording patients’ diagnoses. Clinicians or professional coders assign ICD codes to patients’ medical records to facilitate funding, research, and ad- ministration. In most health facilities, clinical coding is a manual, time-demanding task that is prone to errors. A tool that automatically assigns ICD codes to free-text clinical notes could save time and reduce erroneous coding. While many previous studies have focused on ICD coding, research on Swedish patient records is scarce. This study explored different approaches to pairing Swedish clinical notes with ICD codes. KB-BERT, a BERT model pre-trained on Swedish text, was compared to the traditional supervised learning models Support Vector Machines, Decision Trees, and K-nearest Neighbours used as the baseline. When considering ICD codes grouped into ten blocks, the KB-BERT was superior to the baseline models, obtaining an F1-micro of 0.80 and an F1-macro of 0.58. When considering the 263 full ICD codes, the KB-BERT was outperformed by all baseline models at an F1-micro and F1-macro of zero. Wilcoxon signed-rank tests showed that the performance differences between the KB-BERT and the baseline mod- els were statistically significant.

Place, publisher, year, edition, pages
Shoumen: INCOMA Ltd. , 2021. p. 1158-1166
Series
International Conference Recent Advances in Natural Language Processing, ISSN 1313-8502, E-ISSN 2603-2813
National Category
Information Systems
Research subject
Computer and Systems Sciences
Identifiers
URN: urn:nbn:se:su:diva-200500DOI: 10.26615/978-954-452-072-4_130ISBN: 978-954-452-072-4 (print)OAI: oai:DiVA.org:su-200500DiVA, id: diva2:1625240
Conference
International Conference Recent Advances in Natural Language Processing (RANLP'21), online, September 1-3, 2021
Available from: 2022-01-06 Created: 2022-01-06 Last updated: 2022-01-28Bibliographically approved

Open Access in DiVA

fulltext(543 kB)481 downloads
File information
File name FULLTEXT01.pdfFile size 543 kBChecksum SHA-512
80590b3895c048ea3a3034238ab39d86dd3de83f26f1d3b12623f7abe36623048cb54aa224b80eead3ae322c80e31b393b9d18fe17c61d9838d4447659aabbb1
Type fulltextMimetype application/pdf

Other links

Publisher's full text

Authority records

Remmer, SonjaLamproudis, AnastasiosDalianis, Hercules

Search in DiVA

By author/editor
Remmer, SonjaLamproudis, AnastasiosDalianis, Hercules
By organisation
Department of Computer and Systems Sciences
Information Systems

Search outside of DiVA

GoogleGoogle Scholar
Total: 481 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 1268 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf