Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Finding Cervical Cancer Symptoms in Swedish Clinical Text using a Machine Learning Approach and NegEx
Stockholms universitet, Samhällsvetenskapliga fakulteten, Institutionen för data- och systemvetenskap.
Stockholms universitet, Samhällsvetenskapliga fakulteten, Institutionen för data- och systemvetenskap. Karolinska Institutet, Sweden.ORCID-id: 0000-0002-5780-0063
Vise andre og tillknytning
2015 (engelsk)Inngår i: AMIA Annual Symposium Proceedings, American Medical Informatics Association , 2015, s. 1296-1305Konferansepaper, Publicerat paper (Fagfellevurdert)
Resurstyp
Text
Abstract [en]

Detection of early symptoms in cervical cancer is crucial for early treatment and survival. To find symptoms of cervical cancer in clinical text, Named Entity Recognition is needed. In this paper the Clinical Entity Finder, a machine-learning tool trained on annotated clinical text from a Swedish internal medicine emergency unit, is evaluated on cervical cancer records. The Clinical Entity Finder identifies entities of the types body part, finding and disorder and is extended with negation detection using the rule-based tool NegEx, to distinguish between negated and non-negated entities. To measure the performance of the tools on this new domain, two physicians annotated a set of clinical notes from the health records of cervical cancer patients. The inter-annotator agreement for finding, disorder and body part obtained an average F-score of 0.677 and the Clinical Entity Finder extended with NegEx had an average F-score of 0.667.

sted, utgiver, år, opplag, sider
American Medical Informatics Association , 2015. s. 1296-1305
Serie
AMIA Annual Symposium Proceedings, ISSN 1559-4076, E-ISSN 1942-597X
HSV kategori
Forskningsprogram
data- och systemvetenskap
Identifikatorer
URN: urn:nbn:se:su:diva-123947PubMedID: 26958270OAI: oai:DiVA.org:su-123947DiVA, id: diva2:878591
Konferanse
AMIA 2015 Annual Symposium, San Francisco, CA, November 14 - 18, 2015
Tilgjengelig fra: 2015-12-09 Laget: 2015-12-09 Sist oppdatert: 2022-02-23bibliografisk kontrollert
Inngår i avhandling
1. Mining Clinical Text in Cancer Care
Åpne denne publikasjonen i ny fane eller vindu >>Mining Clinical Text in Cancer Care
2020 (engelsk)Doktoravhandling, med artikler (Annet vitenskapelig)
Abstract [en]

Health care and clinical practice generate large amounts of text detailing symptoms, test results, diagnoses, treatments, and outcomes for patients. This clinical text, documented in health records, is a potential source of knowledge and an underused resource for improved health care. The focus of this work has been text mining of clinical text in the domain of cancer care, with the aim to develop and evaluate methods for extracting relevant information from such texts. Two different types of clinical documentation have been included: clinical notes from electronic health records in Swedish and Norwegian pathology reports.

Free text, and clinical text in particular, is considered as a kind of unstructured information, which is difficult to process automatically. Therefore, information extraction can be applied to create a more structured representation of a text, making its content more accessible for machine learning and statistics. To this end, this thesis describes the development of an efficient and accurate tool for information extraction for pathology reports.

Another application for clinical text mining is risk prediction and diagnosis prediction. The goal for such prediction is to create a machine learning model capable of identifying patients at risk of a specific disease or some other adverse outcome. The motivation for cancer diagnosis prediction is that an early diagnosis can be beneficial for the outcome of treatment. Here, a disease prediction model was developed and evaluated for prediction of cervical cancer. To create this model, health records of patients diagnosed with cervical cancer were processed in two steps. First, clinical events were extracted from free text clinical notes through the use of named entity recognition. The extracted events were next combined with other event types, such as diagnosis codes and drug codes from the same health records. Finally, machine learning models were trained for predicting cervical cancer, and evaluation showed that events extracted from the free text records were the most informative event type for the diagnosis prediction.

sted, utgiver, år, opplag, sider
Stockholm: Department of Computer and Systems Sciences, Stockholm University, 2020. s. 64
Serie
Report Series / Department of Computer & Systems Sciences, ISSN 1101-8526 ; 20-001
Emneord
text mining, natural language processing, electronic health records, clinical text mining, information extraction
HSV kategori
Forskningsprogram
data- och systemvetenskap
Identifikatorer
urn:nbn:se:su:diva-176282 (URN)978-91-7797-911-1 (ISBN)978-91-7797-912-8 (ISBN)
Disputas
2020-01-27, L30, NOD-huset, Borgarfjordsgatan 12, Kista, 13:00 (engelsk)
Opponent
Veileder
Merknad

At the time of the doctoral defense, the following papers were unpublished and had a status as follows: Paper 4: Accepted. Paper 5: Submitted.

Tilgjengelig fra: 2019-12-19 Laget: 2019-11-28 Sist oppdatert: 2022-02-26bibliografisk kontrollert

Open Access i DiVA

Fulltekst mangler i DiVA

PubMed

Person

Weegar, RebeckaKvist, MariaDalianis, Hercules

Søk i DiVA

Av forfatter/redaktør
Weegar, RebeckaKvist, MariaDalianis, Hercules
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar

pubmed
urn-nbn

Altmetric

pubmed
urn-nbn
Totalt: 214 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf