Professional language in Swedish clinical text: Linguistic characterization and comparative studies
2014 (English)In: Nordic Journal of Linguistics, ISSN 0332-5865, E-ISSN 1502-4717, Vol. 37, no 2, 297-323 p.Article in journal (Refereed) Published
This study investigates the linguistic characteristics of Swedish clinical text in radiology reports and doctor's daily notes from electronic health records (EHRs) in comparison to general Swedish and biomedical journal text. We quantify linguistic features through a comparative register analysis to determine how the free text of EHRs differ from general and biomedical Swedish text in terms of lexical complexity, word and sentence composition, and common sentence structures. The linguistic features are extracted using state-of-the-art computational tools: a tokenizer, a part-of-speech tagger, and scripts for statistical analysis. Results show that technical terms and abbreviations are more frequent in clinical text, and lexical variance is low. Moreover, clinical text frequently omit subjects, verbs, and function words resulting in shorter sentences. Clinical text not only differs from general Swedish, but also internally, across its sub-domains, e.g. sentences lacking verbs are significantly more frequent in radiology reports. These results provide a foundation for future development of automatic methods for EHR simplification or clarification.
Place, publisher, year, edition, pages
2014. Vol. 37, no 2, 297-323 p.
clinical text, comparative register analysis, doctor's daily notes, electronic health records, medical terminology, part-of-speech tagging, radiology reports
Languages and Literature Information Systems
IdentifiersURN: urn:nbn:se:su:diva-110161DOI: 10.1017/S0332586514000213ISI: 000344015200007OAI: oai:DiVA.org:su-110161DiVA: diva2:771482