Change search
Link to record
Permanent link

Direct link
Alternative names
Publications (10 of 40) Show all publications
Wang, Z., Samsten, I., Miliou, I. & Papapetrou, P. (2024). COMET: Constrained Counterfactual Explanations for Patient Glucose Multivariate Forecasting. In: Annual IEEE Symposium on Computer-Based Medical Systems: 2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS), 26-28 June 2024. Paper presented at 2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS), 26-28 June 2024, Guadalajara, Mexico. (pp. 502-507). IEEE (Institute of Electrical and Electronics Engineers)
Open this publication in new window or tab >>COMET: Constrained Counterfactual Explanations for Patient Glucose Multivariate Forecasting
2024 (English)In: Annual IEEE Symposium on Computer-Based Medical Systems: 2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS), 26-28 June 2024, IEEE (Institute of Electrical and Electronics Engineers) , 2024, p. 502-507Conference paper, Published paper (Refereed)
Abstract [en]

Applying deep learning models for healthcare-related forecasting applications has been widely adopted, such as leveraging glucose monitoring data of diabetes patients to predict hyperglycaemic or hypoglycaemic events. However, most deep learning models are considered black-boxes; hence, the model predictions are not interpretable and may not offer actionable insights into medical practitioners’ decisions. Previous work has shown that counterfactual explanations can be applied in forecasting tasks by suggesting counterfactual changes in time series inputs to achieve the desired forecasting outcome. This study proposes a generalized multivariate forecasting setup of counterfactual generation by introducing a novel approach, COMET, which imposes three domain-specific constraint mechanisms to provide counterfactual explanations for glucose forecasting. Moreover, we conduct the experimental evaluation using two diabetes patient datasets to demonstrate the effectiveness of our proposed approach in generating realistic counterfactual changes in comparison with a baseline approach. Our qualitative analysis evaluates examples to validate that the counterfactual samples are clinically relevant and can effectively lead the patients to achieve a normal range of predicted glucose levels by suggesting changes to the treatment variables.

Place, publisher, year, edition, pages
IEEE (Institute of Electrical and Electronics Engineers), 2024
Series
IEEE International Symposium on Computer-Based Medical Systems, ISSN 2372-918X, E-ISSN 2372-9198
Keywords
Comet, Deep learning, Patents, Time series analysis, Predictive models, Glucose, Diabetes, time series forecasting, blood glucose prediction, counterfactual explanations, deep learning
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-233744 (URN)10.1109/CBMS61543.2024.00089 (DOI)001284700700038 ()2-s2.0-85200437241 (Scopus ID)
Conference
2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS), 26-28 June 2024, Guadalajara, Mexico.
Available from: 2024-09-24 Created: 2024-09-24 Last updated: 2024-10-16Bibliographically approved
Wang, Z., Miliou, I., Samsten, I. & Papapetrou, P. (2024). Counterfactual Explanations for Time Series Forecasting. In: 2023 IEEE International Conference on Data Mining (ICDM): . Paper presented at IEEE International Conference on Data Mining (ICDM), 1-4 December 2023, Shanghai, China. (pp. 1391-1396). IEEE conference proceedings
Open this publication in new window or tab >>Counterfactual Explanations for Time Series Forecasting
2024 (English)In: 2023 IEEE International Conference on Data Mining (ICDM), IEEE conference proceedings , 2024, p. 1391-1396Conference paper, Published paper (Refereed)
Abstract [en]

Among recent developments in time series forecasting methods, deep forecasting models have gained popularity as they can utilize hidden feature patterns in time series to improve forecasting performance. Nevertheless, the majority of current deep forecasting models are opaque, hence making it challenging to interpret the results. While counterfactual explanations have been extensively employed as a post-hoc approach for explaining classification models, their application to forecasting models still remains underexplored. In this paper, we formulate the novel problem of counterfactual generation for time series forecasting, and propose an algorithm, called ForecastCF, that solves the problem by applying gradient-based perturbations to the original time series. The perturbations are further guided by imposing constraints to the forecasted values. We experimentally evaluate ForecastCF using four state-of-the-art deep model architectures and compare to two baselines. ForecastCF outperforms the baselines in terms of counterfactual validity and data manifold closeness, while generating meaningful and relevant counterfactuals for various forecasting tasks.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2024
Series
IEEE International Conference on Data Mining. Proceedings, ISSN 1550-4786, E-ISSN 2374-8486
Keywords
Time series forecasting, Counterfactual explanations, Model interpretability, Deep learning
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-226602 (URN)10.1109/ICDM58522.2023.00180 (DOI)001165180100171 ()2-s2.0-85185401353 (Scopus ID)979-8-3503-0788-7 (ISBN)
Conference
IEEE International Conference on Data Mining (ICDM), 1-4 December 2023, Shanghai, China.
Available from: 2024-02-14 Created: 2024-02-14 Last updated: 2024-11-14Bibliographically approved
Wang, Z., Samsten, I., Miliou, I., Mochaourab, R. & Papapetrou, P. (2024). Glacier: guided locally constrained counterfactual explanations for time series classification. Machine Learning, 113, 4639-4669
Open this publication in new window or tab >>Glacier: guided locally constrained counterfactual explanations for time series classification
Show others...
2024 (English)In: Machine Learning, ISSN 0885-6125, E-ISSN 1573-0565, Vol. 113, p. 4639-4669Article in journal (Refereed) Published
Abstract [en]

In machine learning applications, there is a need to obtain predictive models of high performance and, most importantly, to allow end-users and practitioners to understand and act on their predictions. One way to obtain such understanding is via counterfactuals, that provide sample-based explanations in the form of recommendations on which features need to be modified from a test example so that the classification outcome of a given classifier changes from an undesired outcome to a desired one. This paper focuses on the domain of time series classification, more specifically, on defining counterfactual explanations for univariate time series. We propose Glacier, a model-agnostic method for generating locally-constrained counterfactual explanations for time series classification using gradient search either on the original space or on a latent space that is learned through an auto-encoder. An additional flexibility of our method is the inclusion of constraints on the counterfactual generation process that favour applying changes to particular time series points or segments while discouraging changing others. The main purpose of these constraints is to ensure more reliable counterfactuals, while increasing the efficiency of the counterfactual generation process. Two particular types of constraints are considered, i.e., example-specific constraints and global constraints. We conduct extensive experiments on 40 datasets from the UCR archive, comparing different instantiations of Glacier against three competitors. Our findings suggest that Glacier outperforms the three competitors in terms of two common metrics for counterfactuals, i.e., proximity and compactness. Moreover, Glacier obtains comparable counterfactual validity compared to the best of the three competitors. Finally, when comparing the unconstrained variant of Glacier to the constraint-based variants, we conclude that the inclusion of example-specific and global constraints yields a good performance while demonstrating the trade-off between the different metrics.

Keywords
Time series classification, Interpretability, Counterfactual explanation, s Deep learning
National Category
Other Computer and Information Science
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-227717 (URN)10.1007/s10994-023-06502-x (DOI)001181943800001 ()2-s2.0-85187677577 (Scopus ID)
Available from: 2024-03-26 Created: 2024-03-26 Last updated: 2024-10-16Bibliographically approved
Svanberg, J., Öhman, P., Samsten, I., Neidermeyer, P., Rana, T. & Berg, N. (2024). Predictive Machine Learning in Assessing Materiality: The Global Reporting Initiative Standard and Beyond. In: Thomas Walker; Stefan Wendt; Sherif Goubran; Tyler Schwartz (Ed.), Artificial Intelligence for Sustainability: Innovations in Business and Financial Services (pp. 105-131). Palgrave Macmillan
Open this publication in new window or tab >>Predictive Machine Learning in Assessing Materiality: The Global Reporting Initiative Standard and Beyond
Show others...
2024 (English)In: Artificial Intelligence for Sustainability: Innovations in Business and Financial Services / [ed] Thomas Walker; Stefan Wendt; Sherif Goubran; Tyler Schwartz, Palgrave Macmillan , 2024, p. 105-131Chapter in book (Refereed)
Abstract [en]

Sustainability reporting standards state that material information should be disclosed, but materiality is not easily nor consistently defined across companies and sectors. Research finds that materiality assessments by reporting companies and sustainability auditors are uncertain, discretionary, and subjective. This chapter investigates a machine learning approach to sustainability reporting materiality assessments that has predictive validity. The investigated assessment methodology provides materiality assessments of disclosed as well as non-disclosed sustainability items consistent with the impact materiality GRI (Global Reporting Initiative) reporting standard. Our machine learning model estimates the likelihood that a company fully complies with environmental responsibilities. We then explore how a state-of-the-art model interpretation method, the SHAP (SHapley Additive exPlanations) developed by Lundberg and Lee (A unified approach to interpreting model predictions. Advances in Neural Information Processing Systems, 2017-December, pp 4766–4775, 2017), can be used to estimate impact materiality.

Place, publisher, year, edition, pages
Palgrave Macmillan, 2024
Keywords
Sustainability reporting, Materiality assessment, Machine learning, Predictive validity
National Category
Information Systems
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-228237 (URN)10.1007/978-3-031-49979-1_6 (DOI)978-3-031-49978-4 (ISBN)978-3-031-49979-1 (ISBN)
Available from: 2024-04-10 Created: 2024-04-10 Last updated: 2024-04-12Bibliographically approved
Öhman, P., Svanberg, J. & Samsten, I. (2023). Assessment 10 of double materiality: The development of predictively valid materiality assessments with artificial intelligence. In: Jan Marton; Fredrik Nilsson; Peter Öhman (Ed.), Auditing Transformation: Regulation, Digitalisation and Sustainability (pp. 205-226). Routledge
Open this publication in new window or tab >>Assessment 10 of double materiality: The development of predictively valid materiality assessments with artificial intelligence
2023 (English)In: Auditing Transformation: Regulation, Digitalisation and Sustainability / [ed] Jan Marton; Fredrik Nilsson; Peter Öhman, Routledge , 2023, p. 205-226Chapter in book (Refereed)
Abstract [en]

Sustainability reporting standards, e.g. the Global Reporting Initiative, require a broader definition of materiality than is traditionally used in financial reporting. Double materiality expands the material information concept to include information about companies' environmental and social impact relevant to society at large. A problem for reporting companies as well as auditors (even though accounting firms invest resources in establishing themselves as reliable service providers) is that the assessment of double materiality is uncertain. The chapter utilises machine learning methods to suggest a method to determine double materiality in sustainability reporting by examining what type of information can predict environmental issues resulting from companies' operations. It represents a proposal to use a structured and quantitative approach for sustainability auditors to determine double materiality, thereby potentially facilitating sustainability reporting and assurance in accordance with future regulation.

Place, publisher, year, edition, pages
Routledge, 2023
National Category
Information Systems
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-228236 (URN)10.4324/9781003411390-13 (DOI)2-s2.0-85170151952 (Scopus ID)9781003411390 (ISBN)
Available from: 2024-04-10 Created: 2024-04-10 Last updated: 2024-04-12Bibliographically approved
Svanberg, J., Ardeshiri, T., Samsten, I., Öhman, P., Neidermeyer, P. E., Rana, T., . . . Danielson, M. (2023). Must social performance ratings be idiosyncratic? An exploration of social performance ratings with predictive validity. Sustainability Accounting, Management and Policy Journal, 14(7), 313-348
Open this publication in new window or tab >>Must social performance ratings be idiosyncratic? An exploration of social performance ratings with predictive validity
Show others...
2023 (English)In: Sustainability Accounting, Management and Policy Journal, ISSN 2040-8021, E-ISSN 2040-803X, Vol. 14, no 7, p. 313-348Article in journal (Refereed) Published
Abstract [sv]

Syftet med denna studie är att utveckla en metod för att bedöma social prestation. Traditionellt använder leverantörer av miljö, social och styrning (ESG) subjektivt viktade aritmetiska medelvärden för att kombinera en uppsättning sociala prestationsindikatorer (SP) till en enda värdering. För att övervinna detta problem undersöker denna studie förutsättningarna för en ny metodik för att klassificera SP-komponenten i ESG genom att tillämpa maskininlärning (ML) och artificiell intelligens (AI) förankrade i sociala kontroverser.

Den här studien föreslår användningen av en datadriven klassificeringsmetodik som härleder den relativa betydelsen av SP-egenskaper från deras bidrag till förutsägelsen av sociala kontroverser. Författarna använder den föreslagna metoden för att lösa viktningsproblemet med övergripande ESG-betyg och ytterligare undersöka om förutsägelse är möjlig.

Författarna finner att ML-modeller kan förutsäga kontroverser med hög prediktiv prestanda och validitet. Resultaten tyder på att viktningsproblemet med ESG-betygen kan lösas med ett datadrivet tillvägagångssätt. Den avgörande förutsättningen för den föreslagna ratingmetodiken är dock att sociala kontroverser förutsägs av en bred uppsättning SP-indikatorer. Resultaten tyder också på att prediktivt giltiga betyg kan utvecklas med denna ML-baserade AI-metod.

Praktiska konsekvenser

Denna studie erbjuder praktiska lösningar på ESG-ratingproblem som har konsekvenser för investerare, ESG-bedömare och socialt ansvarsfulla investeringar.

Den föreslagna ML-baserade AI-metoden kan bidra till att uppnå bättre ESG-betyg, vilket i sin tur kommer att bidra till att förbättra SP, vilket får konsekvenser för organisationer och samhällen genom hållbar utveckling.

Så vitt författarna vet är denna forskning en av de första studierna som erbjuder en unik metod för att ta itu med ESG-betygsproblemet och förbättra hållbarheten genom att fokusera på SP-indikatorer.

Abstract [en]

The purpose of this study is to develop a method to assess social performance. Traditionally, environment, social and governance (ESG) rating providers use subjectively weighted arithmetic averages to combine a set of social performance (SP) indicators into one single rating. To overcome this problem, this study investigates the preconditions for a new methodology for rating the SP component of the ESG by applying machine learning (ML) and artificial intelligence (AI) anchored to social controversies.

This study proposes the use of a data-driven rating methodology that derives the relative importance of SP features from their contribution to the prediction of social controversies. The authors use the proposed methodology to solve the weighting problem with overall ESG ratings and further investigate whether prediction is possible.

The authors find that ML models are able to predict controversies with high predictive performance and validity. The findings indicate that the weighting problem with the ESG ratings can be addressed with a data-driven approach. The decisive prerequisite, however, for the proposed rating methodology is that social controversies are predicted by a broad set of SP indicators. The results also suggest that predictively valid ratings can be developed with this ML-based AI method.

Practical implications

This study offers practical solutions to ESG rating problems that have implications for investors, ESG raters and socially responsible investments.

The proposed ML-based AI method can help to achieve better ESG ratings, which will in turn, help to improve SP, which has implications for organizations and societies through sustainable development.

To the best of the authors’ knowledge, this research is one of the first studies that offers a unique method to address the ESG rating problem and improve sustainability by focusing on SP indicators.

Keywords
AI, machine learning, ESG, social performance indicators, AI, maskininlärning, ESG, sociala prestationsindikatorer
National Category
Information Systems
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-223535 (URN)10.1108/SAMPJ-03-2022-0127 (DOI)001086807300001 ()2-s2.0-85175012618 (Scopus ID)
Available from: 2023-11-01 Created: 2023-11-01 Last updated: 2023-11-14Bibliographically approved
Svanberg, J., Ardeshiri, T., Samsten, I., Öhman, P. & Neidermeyer, P. E. (2023). Prediction of Controversies and Estimation of ESG Performance: An Experimental Investigation Using Machine Learning. In: Tarek Rana; Jan Svanberg; Peter Öhman; Alan Lowe (Ed.), Handbook of Big Data and Analytics in Accounting and Auditing: (pp. 65-87). Springer Publishing Company
Open this publication in new window or tab >>Prediction of Controversies and Estimation of ESG Performance: An Experimental Investigation Using Machine Learning
Show others...
2023 (English)In: Handbook of Big Data and Analytics in Accounting and Auditing / [ed] Tarek Rana; Jan Svanberg; Peter Öhman; Alan Lowe, Springer Publishing Company , 2023, p. 65-87Chapter in book (Refereed)
Abstract [en]

We develop a new methodology for computing environmental, social, and governance (ESG) ratings using a mode of artificial intelligence (AI) called machine learning (ML) to make ESG more transparent. The ML algorithms anchor our rating methodology in controversies related to non-compliance with corporate social responsibility (CSR). This methodology is consistent with the information needs of institutional investors and is the first ESG methodology with predictive validity. Our best model predicts what companies are likely to experience controversies. It has a precision of 70–84 per cent and high predictive performance on several measures. It also provides evidence of what indicators contribute the most to the predicted likelihood of experiencing an ESG controversy. Furthermore, while the common approach of rating companies is to aggregate indicators using the arithmetic average, which is a simple explanatory model designed to describe an average company, the proposed rating methodology uses state-of-the-art AI technology to aggregate ESG indicators into holistic ratings for the predictive modelling of individual company performance.

Predictive modelling using ML enables our models to aggregate the information contained in ESG indicators with far less information loss than with the predominant aggregation method.

Place, publisher, year, edition, pages
Springer Publishing Company, 2023
Keywords
Artificial Intelligence, Controversies, Corporate Social Performance, ESG, Machine Learning, Socially Responsible Investment
National Category
Other Computer and Information Science
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-222627 (URN)10.1007/978-981-19-4460-4_4 (DOI)2-s2.0-85160734598 (Scopus ID)978-981-19-4459-8 (ISBN)978-981-19-4460-4 (ISBN)
Available from: 2023-10-13 Created: 2023-10-13 Last updated: 2023-10-16Bibliographically approved
Wang, Z., Samsten, I., Kougia, V. & Papapetrou, P. (2023). Style-transfer counterfactual explanations: An application to mortality prevention of ICU patients. Artificial Intelligence in Medicine, 135, Article ID 102457.
Open this publication in new window or tab >>Style-transfer counterfactual explanations: An application to mortality prevention of ICU patients
2023 (English)In: Artificial Intelligence in Medicine, ISSN 0933-3657, E-ISSN 1873-2860, Vol. 135, article id 102457Article in journal (Refereed) Published
Abstract [en]

In recent years, machine learning methods have been rapidly adopted in the medical domain. However, current state-of-the-art medical mining methods usually produce opaque, black-box models. To address the lack of model transparency, substantial attention has been given to developing interpretable machine learning models. In the medical domain, counterfactuals can provide example-based explanations for predictions, and show practitioners the modifications required to change a prediction from an undesired to a desired state. In this paper, we propose a counterfactual solution MedSeqCF for preventing the mortality of three cohorts of ICU patients, by representing their electronic health records as medical event sequences, and generating counterfactuals by adopting and employing a text style-transfer technique. We propose three model augmentations for MedSeqCF to integrate additional medical knowledge for generating more trustworthy counterfactuals. Experimental results on the MIMIC-III dataset strongly suggest that augmented style-transfer methods can be effectively adapted for the problem of counterfactual explanations in healthcare applications and can further improve the model performance in terms of validity, BLEU-4, local outlier factor, and edit distance. In addition, our qualitative analysis of the results by consultation with medical experts suggests that our style-transfer solutions can generate clinically relevant and actionable counterfactual explanations.

National Category
Information Systems
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-212771 (URN)10.1016/j.artmed.2022.102457 (DOI)000897143800009 ()36628793 (PubMedID)2-s2.0-85143973748 (Scopus ID)
Available from: 2022-12-12 Created: 2022-12-12 Last updated: 2024-10-16Bibliographically approved
Pilipiec, P., Samsten, I. & Bota, A. (2023). Surveillance of communicable diseases using social media: A systematic review. PLOS ONE, 18(2), Article ID e0282101.
Open this publication in new window or tab >>Surveillance of communicable diseases using social media: A systematic review
2023 (English)In: PLOS ONE, E-ISSN 1932-6203, Vol. 18, no 2, article id e0282101Article in journal (Refereed) Published
Abstract [en]

Background

Communicable diseases pose a severe threat to public health and economic growth. The traditional methods that are used for public health surveillance, however, involve many drawbacks, such as being labor intensive to operate and resulting in a lag between data collection and reporting. To effectively address the limitations of these traditional methods and to mitigate the adverse effects of these diseases, a proactive and real-time public health surveillance system is needed. Previous studies have indicated the usefulness of performing text mining on social media.

Objective

To conduct a systematic review of the literature that used textual content published to social media for the purpose of the surveillance and prediction of communicable diseases.

Methodology

Broad search queries were formulated and performed in four databases. Both journal articles and conference materials were included. The quality of the studies, operationalized as reliability and validity, was assessed. This qualitative systematic review was guided by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines.

Results

Twenty-three publications were included in this systematic review. All studies reported positive results for using textual social media content to surveille communicable diseases. Most studies used Twitter as a source for these data. Influenza was studied most frequently, while other communicable diseases received far less attention. Journal articles had a higher quality (reliability and validity) than conference papers. However, studies often failed to provide important information about procedures and implementation.

Conclusion

Text mining of health-related content published on social media can serve as a novel and powerful tool for the automated, real-time, and remote monitoring of public health and for the surveillance and prediction of communicable diseases in particular. This tool can address limitations related to traditional surveillance methods, and it has the potential to supplement traditional methods for public health surveillance.

National Category
Public Health, Global Health and Social Medicine Computer and Information Sciences
Identifiers
urn:nbn:se:su:diva-218070 (URN)10.1371/journal.pone.0282101 (DOI)000972006100150 ()36827297 (PubMedID)2-s2.0-85148900460 (Scopus ID)
Available from: 2023-07-25 Created: 2023-07-25 Last updated: 2025-02-20Bibliographically approved
Svanberg, J., Ardeshiri, T., Samsten, I., Öhman, P., Neidermeyer, P. E., Rana, T., . . . Danielson, M. (2022). Corporate governance performance ratings with machine learning. International Journal of Intelligent Systems in Accounting, Finance & Management, 29(1), 50-68
Open this publication in new window or tab >>Corporate governance performance ratings with machine learning
Show others...
2022 (English)In: International Journal of Intelligent Systems in Accounting, Finance & Management, ISSN 1055-615X, E-ISSN 1099-1174, Vol. 29, no 1, p. 50-68Article in journal (Refereed) Published
Abstract [en]

We use machine learning with a cross-sectional research design to predict governance controversies and to develop a measure of the governance component of the environmental, social, governance (ESG) metrics. Based on comprehensive governance data from 2,517 companies over a period of 10 years and investigating nine machine-learning algorithms, we find that governance controversies can be predicted with high predictive performance. Our proposed governance rating methodology has two unique advantages compared with traditional ESG ratings: it rates companies' compliance with governance responsibilities and it has predictive validity. Our study demonstrates a solution to what is likely the greatest challenge for the finance industry today: how to assess a company's sustainability with validity and accuracy. Prior to this study, the ESG rating industry and the literature have not provided evidence that widely adopted governance ratings are valid. This study describes the only methodology for developing governance performance ratings based on companies' compliance with governance responsibilities and for which there is evidence of predictive validity.

Keywords
artificial intelligence, ESG, governance controversies, machine learning, performance of ESG ratings, prediction, socially responsible investment
National Category
Economics and Business
Identifiers
urn:nbn:se:su:diva-203457 (URN)10.1002/isaf.1505 (DOI)000770351100001 ()2-s2.0-85126475278 (Scopus ID)
Available from: 2022-04-07 Created: 2022-04-07 Last updated: 2022-08-17Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-3056-6801

Search in DiVA

Show all publications