Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
A clustering framework for patient phenotyping with application to adverse drug events
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.ORCID iD: 0000-0002-4632-4815
Stockholm University, Faculty of Social Sciences, Department of Computer and Systems Sciences.
2020 (English)In: 2020 IEEE 33rd International Symposium on Computer-Based Medical Systems (CBMS), IEEE, 2020, p. 177-182Conference paper, Published paper (Refereed)
Abstract [en]

We present a clustering framework for identifying patient groups with Adverse Drug Reactions from Electronic Health Records (EHRs). The increased adoption of EHRs has brought changes in the way drug safety surveillance is carried out and plays an important role in effective drug regulation. Unsupervised machine learning methods using EHRs as their input can identify patients that share common meaningful information, without the need for expert input. In this work, we propose a generalized framework that exploits the strengths of different clustering algorithms and via clustering aggregation identifies consensus patient cluster profiles. Moreover, the inherent hierarchical structure of diagnoses and medication codes is exploited. We assess the statistical significance of the produced clusterings by applying a randomization technique that keeps the data distribution margins fixed, as we are interested in evaluating information that is not conveyed by the marginal distributions. The experimental findings suggest that the framework produces medically meaningful patient groups with regard to adverse drug events by investigating two use-cases, i.e., aplastic anaemia and drug-induced skin eruption.

Place, publisher, year, edition, pages
IEEE, 2020. p. 177-182
Series
IEEE International Symposium on Computer-Based Medical Systems, ISSN 2372-918X, E-ISSN 2372-9198
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
URN: urn:nbn:se:su:diva-186974DOI: 10.1109/CBMS49503.2020.00041ISBN: 978-1-7281-9429-5 (electronic)ISBN: 978-1-7281-9430-1 (print)OAI: oai:DiVA.org:su-186974DiVA, id: diva2:1505334
Conference
Computer-Based Medical Systems, Rochester, USA, 28-30 July, 2020
Available from: 2020-11-30 Created: 2020-11-30 Last updated: 2024-06-18Bibliographically approved
In thesis
1. Data-Driven AI for Patient and Public Health: On the Use of Multisource and Multimodal Data in Machine Learning to Improve Healthcare
Open this publication in new window or tab >>Data-Driven AI for Patient and Public Health: On the Use of Multisource and Multimodal Data in Machine Learning to Improve Healthcare
2024 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

The integration of artificial intelligence in healthcare has created a new era of advancements, reshaping patient care and revolutionizing public health interventions. Through artificial intelligence, healthcare providers and public health authorities can optimize interventions, leading to more precise and efficient responses that enhance patient outcomes and address public health challenges effectively. The past decade has witnessed a rapid digital transformation across industries, and healthcare is no exception. This evolution is evident in the widespread adoption of electronic health records and healthcare information systems and the integration of diverse technologies, including handheld, wearable, and smart devices.

A central challenge in this digital shift lies in representing data from multiple sources and modalities for downstream machine learning tasks. This complexity stems from the varied longitudinal or contextual events in patients' historical records, encompassing lab tests, vital signs, diagnoses, and drug administration. Additionally, the challenge extends to predictive modeling and constructing robust models that accurately classify future health events, taking into consideration heterogeneous health-related data. Electronic phenotyping, crucial for identifying fine-grained disease/patient clusters, is also a central problem when utilizing multisource and multimodal information effectively to create meaningful patient profiles. In the context of public health interventions, exemplified by crises like the COVID-19 pandemic, decision-making requires a delicate balance between optimizing intervention effectiveness and considering economic and societal well-being.

This Ph.D. thesis seeks to unravel the potential of multisource and multimodal health observational data in generating patient phenotypes and predictions for both individual health and public health surveillance. It addresses the following central question: How can multisource and multimodal observational health data be effectively harnessed, using machine learning, to enhance patient and public health? Comprising five studies, the thesis confronts challenges posed by diverse data sources and modalities, exploring strategies for creating comprehensive patient profiles, developing robust classification models, and employing clustering methods tailored to observational health data. The research seeks to provide valuable insights into integrating AI in healthcare, with a specific emphasis on the complexities of multisource and multimodal data integration. It underscores the importance of exploring heterogeneous health observational data to deepen our understanding of patient health and optimize machine learning applications. Emphasizing the intricate nature of health data, the thesis discusses careful data handling and innovative methodologies to maximize its potential impact on improving patient outcomes and informing public health strategies. The effective management of heterogeneous observational health data requires thoughtful consideration due to their varied sources and inherent complexities.

Place, publisher, year, edition, pages
Stockholm: Department of Computer and Systems Sciences, Stockholm University, 2024. p. 86
Series
Report Series / Department of Computer & Systems Sciences, ISSN 1101-8526 ; 24-009
Keywords
Machine Learning; Artificial Intelligence; Healthcare; Multimodal Data; Complex Data
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-231336 (URN)978-91-8014-847-4 (ISBN)978-91-8014-848-1 (ISBN)
Public defence
2024-09-06, L30, NOD-huset, Borgarfjordsgatan 12, Kista, 09:00 (English)
Opponent
Supervisors
Available from: 2024-08-14 Created: 2024-06-18 Last updated: 2024-08-19Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Bampa, MariaPapapetrou, PanagiotisHollmén, Jaakko

Search in DiVA

By author/editor
Bampa, MariaPapapetrou, PanagiotisHollmén, Jaakko
By organisation
Department of Computer and Systems Sciences
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 178 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf