Change search
Link to record
Permanent link

Direct link
Papapetrou, PanagiotisORCID iD iconorcid.org/0000-0002-4632-4815
Publications (10 of 123) Show all publications
Rugolon, F., Randl, K. R., Bampa, M. & Papapetrou, P. (2025). A Workflow for Creating Multimodal Machine Learning Models for Metastasis Predictions in Melanoma Patients. In: Rosa Meo; Fabrizio Silvestri (Ed.), Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023, Revised Selected Papers, Part IV. Paper presented at International Workshops of ECML PKDD 2023, 18-22 September 2023, Turin, Italy. (pp. 87-102). Springer Nature
Open this publication in new window or tab >>A Workflow for Creating Multimodal Machine Learning Models for Metastasis Predictions in Melanoma Patients
2025 (English)In: Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023, Revised Selected Papers, Part IV / [ed] Rosa Meo; Fabrizio Silvestri, Springer Nature , 2025, p. 87-102Conference paper, Published paper (Refereed)
Abstract [en]

Melanoma is the most common form of skin cancer, responsible for thousands of deaths annually. Novel therapies have been developed, but metastases are still a common problem, increasing the mortality rate and decreasing the quality of life of those who experience them. As traditional machine learning models for metastasis prediction have been limited to the use of a single modality, in this study we aim to explore and compare different unimodal and multimodal machine learning models to predict the onset of metastasis in melanoma patients to help clinicians focus their attention on patients at a higher risk of developing metastasis, increasing the likelihood of an earlier diagnosis.

We use a patient cohort derived from an Electronic Health Record, and we consider various modalities of data, including static, time series, and clinical text. We formulate the problem and propose a multimodal ML workflow for predicting the onset of metastasis in melanoma patients. We evaluate the performance of the workflow based on various classification metrics and statistical significance. The experimental findings suggest that multimodal models outperform the unimodal ones, demonstrating the potential of multimodal ML to predict the onset of metastasis.

Place, publisher, year, edition, pages
Springer Nature, 2025
Series
Communications in Computer and Information Science (CCIS), ISSN 1865-0929, E-ISSN 1865-0937 ; 2136
Keywords
Multimodal predictions, Machine learning, Melanoma, Metastasis, EHR
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-237873 (URN)10.1007/978-3-031-74640-6_7 (DOI)2-s2.0-85215966676 (Scopus ID)978-3-031-74640-6 (ISBN)978-3-031-74639-0 (ISBN)
Conference
International Workshops of ECML PKDD 2023, 18-22 September 2023, Turin, Italy.
Available from: 2025-01-14 Created: 2025-01-14 Last updated: 2025-02-25Bibliographically approved
Kreuzer, T., Papapetrou, P. & Zdravkovic, J. (2025). AI Explainability Methods in Digital Twins: A Model and a Use Case. In: José Borbinha, Tiago Prince Sales, Miguel Mira Da Silva, Henderik A. Proper, Marianne Schnellmann (Ed.), Enterprise Design, Operations, and Computing: 28th International Conference, EDOC 2024, Vienna, Austria, September 10–13, 2024, Revised Selected Papers. Paper presented at 28th International Conference on Enterprise Design, Operations, and Computing (EDOC 2024), Vienna, Austria, September 10-13, 2024 (pp. 3-20). Cham: Springer
Open this publication in new window or tab >>AI Explainability Methods in Digital Twins: A Model and a Use Case
2025 (English)In: Enterprise Design, Operations, and Computing: 28th International Conference, EDOC 2024, Vienna, Austria, September 10–13, 2024, Revised Selected Papers / [ed] José Borbinha, Tiago Prince Sales, Miguel Mira Da Silva, Henderik A. Proper, Marianne Schnellmann, Cham: Springer, 2025, p. 3-20Conference paper, Published paper (Refereed)
Abstract [en]

Digital twin systems can benefit from the integration of artificial intelligence (AI) algorithms for providing for example some predictive capabilities or supporting internal decision-making. As AI algorithms are often opaque, it becomes necessary to explain their decisions to a human operator working with the digital twin. In this study, we investigate the integration of explainable AI techniques with digital twins, which we termed XAI-DT system. We define the concept of XAI-DT system and provide a use case in smart buildings, where explainable AI is used to forecast CO2 concentration. Further, we present a core architectural model for our digital twin, outlining its interaction with the smart building and its internal processing. We evaluate five AI algorithms and compare their explainability for the operator and the entire digital twin model based on standard explainability properties from the literature.

Place, publisher, year, edition, pages
Cham: Springer, 2025
Series
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN 0302-9743, E-ISSN 1611-3349 ; 15409
National Category
Artificial Intelligence
Identifiers
urn:nbn:se:su:diva-242280 (URN)10.1007/978-3-031-78338-8_1 (DOI)2-s2.0-85219189278 (Scopus ID)978-3-031-78337-1 (ISBN)978-3-031-78338-8 (ISBN)
Conference
28th International Conference on Enterprise Design, Operations, and Computing (EDOC 2024), Vienna, Austria, September 10-13, 2024
Available from: 2025-04-22 Created: 2025-04-22 Last updated: 2025-04-22Bibliographically approved
Ho, V. L., Ho, N., Pedersen, T. B. & Papapetrou, P. (2025). Efficient Generalized Temporal Pattern Mining in Time Series Using Mutual Information. IEEE Transactions on Knowledge and Data Engineering, 37(4), 1753-1771
Open this publication in new window or tab >>Efficient Generalized Temporal Pattern Mining in Time Series Using Mutual Information
2025 (English)In: IEEE Transactions on Knowledge and Data Engineering, ISSN 1041-4347, E-ISSN 1558-2191, Vol. 37, no 4, p. 1753-1771Article in journal (Refereed) Published
Abstract [en]

Big time series are increasingly available from an ever wider range of IoT-enabled sensors deployed in various environments. Significant insights can be gained by mining temporal patterns from these time series. Temporal pattern mining (TPM) extends traditional pattern mining by adding event time intervals into extracted patterns, making them more expressive at the expense of increased time and space complexities. Besides frequent temporal patterns (FTPs), which occur frequently in the entire dataset, another useful type of temporal patterns are so-called rare temporal patterns (RTPs), which appear rarely but with high confidence. Mining rare temporal patterns yields additional challenges. For FTP mining, the temporal information and complex relations between events already create an exponential search space. For RTP mining, the support measure is set very low, leading to a further combinatorial explosion and potentially producing too many uninteresting patterns. Thus, there is a need for a better approach to mine frequent and rare temporal patterns. This paper presents our Generalized Temporal Pattern Mining from Time Series (GTPMfTS) approach that can mine both types of patterns, with the following specific contributions: (1) The end-to-end GTPMfTS process taking time series as input and producing frequent/rare temporal patterns as output. (2) The efficient Generalized Temporal Pattern Mining (GTPM) algorithm mines frequent and rare temporal patterns using efficient data structures for fast retrieval of events and patterns during the mining process, and employs effective pruning techniques for significantly faster mining. (3) An approximate version of GTPM that uses mutual information, a measure of data correlation, to prune unpromising time series from the search space. (4) An extensive experimental evaluation of GTPM for rare temporal pattern mining (RTPM) and frequent temporal pattern mining (FTPM), showing that RTPM and FTPM significantly outperform the baselines on runtime and memory consumption, and can scale to big datasets. The approximate RTPM is up to one order of magnitude, and the approximate FTPM is up to two orders of magnitude, faster than the baselines, while retaining high accuracy.

Keywords
Mutual Information, Rare Temporal Patterns, Temporal Pattern Mining, Time Series
National Category
Computer Sciences
Identifiers
urn:nbn:se:su:diva-240396 (URN)10.1109/TKDE.2025.3526800 (DOI)001439548100028 ()2-s2.0-86000437128 (Scopus ID)
Available from: 2025-03-10 Created: 2025-03-10 Last updated: 2025-04-08Bibliographically approved
Taheri, G., Szalai, M., Habibi, M. & Papapetrou, P. (2025). Unveiling Driver Modules in Lung Cancer: A Clustering-Based Gene-Gene Interaction Network Analysis. In: Rosa Meo, Fabrizio Silvestri (Ed.), Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023, Revised Selected Papers, Part IV. Paper presented at International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023 (pp. 41-58). Springer
Open this publication in new window or tab >>Unveiling Driver Modules in Lung Cancer: A Clustering-Based Gene-Gene Interaction Network Analysis
2025 (English)In: Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023, Revised Selected Papers, Part IV / [ed] Rosa Meo, Fabrizio Silvestri, Springer, 2025, p. 41-58Conference paper, Published paper (Refereed)
Abstract [en]

Lung cancer, which is the leading cause of cancer-related death worldwide and is characterized by genetic changes and heterogeneity, presents a significant treatment challenge. Existing approaches utilizing Machine Learning (ML) techniques for identifying driver modules lack specificity, particularly for lung cancer. This study addresses this limitation by proposing a novel method that combines gene-gene interaction network construction with ML-based clustering to identify lung cancer-specific driver modules. The methodology involves mapping biological processes to genes and constructing a weighted gene-gene interaction network to identify correlations within gene clusters. A clustering algorithm is then applied to identify potential cancer-driver modules, focusing on biologically relevant modules that contribute to lung cancer development. The results highlight the effectiveness and robustness of the clustering approach, identifying 110 unique clusters ranging in size from 4 to 10. These clusters surpass evaluation requirements and demonstrate significant relevance to critical cancer-related pathways. The identified driver modules hold promise for influencing future approaches to lung cancer diagnosis, prognosis, and treatment. This research expands our understanding of lung cancer and sets the stage for further investigations and potential clinical advancements.

Place, publisher, year, edition, pages
Springer, 2025
Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 2136 CCIS
Keywords
Driver Modules, Gene-Gene Interaction Network, Lung Cancer, Machine Learning
National Category
Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:su:diva-240214 (URN)10.1007/978-3-031-74640-6_4 (DOI)2-s2.0-85216025737 (Scopus ID)9783031746390 (ISBN)
Conference
International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023
Available from: 2025-03-06 Created: 2025-03-06 Last updated: 2025-03-06Bibliographically approved
Miliou, I., Piatkowski, N. & Papapetrou, P. (Eds.). (2024). Advances in Intelligent Data Analysis XXII: 22nd International Symposium on Intelligent Data Analysis, IDA 2024, Stockholm, Sweden, April 24–26, 2024, Proceedings, Part I (1ed.). Springer Nature
Open this publication in new window or tab >>Advances in Intelligent Data Analysis XXII: 22nd International Symposium on Intelligent Data Analysis, IDA 2024, Stockholm, Sweden, April 24–26, 2024, Proceedings, Part I
2024 (English)Conference proceedings (editor) (Other academic)
Place, publisher, year, edition, pages
Springer Nature, 2024. p. 268 Edition: 1
Series
Lecture Notes in Computer Science (LNCS), ISSN 0302-9743, E-ISSN 1611-3349
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-228399 (URN)10.1007/978-3-031-58547-0 (DOI)978-3-031-58546-3 (ISBN)978-3-031-58547-0 (ISBN)
Available from: 2024-04-16 Created: 2024-04-16 Last updated: 2024-04-17Bibliographically approved
Kreuzer, T., Papapetrou, P. & Zdravkovic, J. (2024). Artificial intelligence in digital twins—A systematic literature review. Data & Knowledge Engineering, 151, Article ID 102304.
Open this publication in new window or tab >>Artificial intelligence in digital twins—A systematic literature review
2024 (English)In: Data & Knowledge Engineering, ISSN 0169-023X, E-ISSN 1872-6933, Vol. 151, article id 102304Article, review/survey (Refereed) Published
Abstract [en]

Artificial intelligence and digital twins have become more popular in recent years and have seen usage across different application domains for various scenarios. This study reviews the literature at the intersection of the two fields, where digital twins integrate an artificial intelligence component. We follow a systematic literature review approach, analyzing a total of 149 related studies. In the assessed literature, a variety of problems are approached with an artificial intelligence-integrated digital twin, demonstrating its applicability across different fields. Our findings indicate that there is a lack of in-depth modeling approaches regarding the digital twin, while many articles focus on the implementation and testing of the artificial intelligence component. The majority of publications do not demonstrate a virtual-to-physical connection between the digital twin and the real-world system. Further, only a small portion of studies base their digital twin on real-time data from a physical system, implementing a physical-to-virtual connection.

Keywords
Artificial intelligence, Digital twin, Machine learning, Literature review, Business intelligence, Data mining
National Category
Computer Sciences
Identifiers
urn:nbn:se:su:diva-231188 (URN)10.1016/j.datak.2024.102304 (DOI)001234973200001 ()2-s2.0-85190070128 (Scopus ID)
Available from: 2024-06-24 Created: 2024-06-24 Last updated: 2024-06-24Bibliographically approved
Wang, Z., Samsten, I., Miliou, I. & Papapetrou, P. (2024). COMET: Constrained Counterfactual Explanations for Patient Glucose Multivariate Forecasting. In: Annual IEEE Symposium on Computer-Based Medical Systems: 2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS), 26-28 June 2024. Paper presented at 2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS), 26-28 June 2024, Guadalajara, Mexico. (pp. 502-507). IEEE (Institute of Electrical and Electronics Engineers)
Open this publication in new window or tab >>COMET: Constrained Counterfactual Explanations for Patient Glucose Multivariate Forecasting
2024 (English)In: Annual IEEE Symposium on Computer-Based Medical Systems: 2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS), 26-28 June 2024, IEEE (Institute of Electrical and Electronics Engineers) , 2024, p. 502-507Conference paper, Published paper (Refereed)
Abstract [en]

Applying deep learning models for healthcare-related forecasting applications has been widely adopted, such as leveraging glucose monitoring data of diabetes patients to predict hyperglycaemic or hypoglycaemic events. However, most deep learning models are considered black-boxes; hence, the model predictions are not interpretable and may not offer actionable insights into medical practitioners’ decisions. Previous work has shown that counterfactual explanations can be applied in forecasting tasks by suggesting counterfactual changes in time series inputs to achieve the desired forecasting outcome. This study proposes a generalized multivariate forecasting setup of counterfactual generation by introducing a novel approach, COMET, which imposes three domain-specific constraint mechanisms to provide counterfactual explanations for glucose forecasting. Moreover, we conduct the experimental evaluation using two diabetes patient datasets to demonstrate the effectiveness of our proposed approach in generating realistic counterfactual changes in comparison with a baseline approach. Our qualitative analysis evaluates examples to validate that the counterfactual samples are clinically relevant and can effectively lead the patients to achieve a normal range of predicted glucose levels by suggesting changes to the treatment variables.

Place, publisher, year, edition, pages
IEEE (Institute of Electrical and Electronics Engineers), 2024
Series
IEEE International Symposium on Computer-Based Medical Systems, ISSN 2372-918X, E-ISSN 2372-9198
Keywords
Comet, Deep learning, Patents, Time series analysis, Predictive models, Glucose, Diabetes, time series forecasting, blood glucose prediction, counterfactual explanations, deep learning
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-233744 (URN)10.1109/CBMS61543.2024.00089 (DOI)001284700700038 ()2-s2.0-85200437241 (Scopus ID)
Conference
2024 IEEE 37th International Symposium on Computer-Based Medical Systems (CBMS), 26-28 June 2024, Guadalajara, Mexico.
Available from: 2024-09-24 Created: 2024-09-24 Last updated: 2024-10-16Bibliographically approved
García, D., Pérez, D., Papapetrou, P., Díaz, I., A. Cuadrado, A., M. Enguita, J. & Domínguez, M. (2024). Conditioned fully convolutional denoising autoencoder for multi-target NILM. Neural Computing & Applications
Open this publication in new window or tab >>Conditioned fully convolutional denoising autoencoder for multi-target NILM
Show others...
2024 (English)In: Neural Computing & Applications, ISSN 0941-0643, E-ISSN 1433-3058Article in journal (Refereed) Epub ahead of print
Abstract [en]

Energy management requires reliable tools to support decisions aimed at optimising consumption. Advances in data-driven models provide techniques like Non-Intrusive Load Monitoring (NILM), which estimates the energy demand of appliances from total consumption. Common single-target NILM approaches perform energy disaggregation by using separate learned models for each device. However, the use of single-target systems in real scenarios is computationally expensive and can obscure the interpretation of the resulting feedback. This study assesses a conditioned deep neural network built upon a Fully Convolutional Denoising AutoEncoder (FCNdAE) as multi-target NILM model. The network performs multiple disaggregations using a conditioning input that allows the specification of the target appliance. Experiments compare this approach with several single-target and multi-target models using public residential data from households and non-residential data from a hospital facility. Results show that the multi-target FCNdAE model enhances the disaggregation accuracy compared to previous models, particularly in non-residential data, and improves computational efficiency by reducing the number of trainable weights below 2 million and inference time below 0.25 s for several sequence lengths. Furthermore, the conditioning input helps the user to interpret the model and gain insight into its internal behaviour when predicting the energy demand of different appliances.

Keywords
Non-intrusive Load Monitoring, Energy Efficiency, Deep Convolutional Neural Networks, Interpretability, Multi-target NILM models
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-237967 (URN)10.1007/s00521-024-10552-0 (DOI)2-s2.0-85211443979 (Scopus ID)
Available from: 2025-01-16 Created: 2025-01-16 Last updated: 2025-01-16
Wang, Z., Miliou, I., Samsten, I. & Papapetrou, P. (2024). Counterfactual Explanations for Time Series Forecasting. In: 2023 IEEE International Conference on Data Mining (ICDM): . Paper presented at IEEE International Conference on Data Mining (ICDM), 1-4 December 2023, Shanghai, China. (pp. 1391-1396). IEEE conference proceedings
Open this publication in new window or tab >>Counterfactual Explanations for Time Series Forecasting
2024 (English)In: 2023 IEEE International Conference on Data Mining (ICDM), IEEE conference proceedings , 2024, p. 1391-1396Conference paper, Published paper (Refereed)
Abstract [en]

Among recent developments in time series forecasting methods, deep forecasting models have gained popularity as they can utilize hidden feature patterns in time series to improve forecasting performance. Nevertheless, the majority of current deep forecasting models are opaque, hence making it challenging to interpret the results. While counterfactual explanations have been extensively employed as a post-hoc approach for explaining classification models, their application to forecasting models still remains underexplored. In this paper, we formulate the novel problem of counterfactual generation for time series forecasting, and propose an algorithm, called ForecastCF, that solves the problem by applying gradient-based perturbations to the original time series. The perturbations are further guided by imposing constraints to the forecasted values. We experimentally evaluate ForecastCF using four state-of-the-art deep model architectures and compare to two baselines. ForecastCF outperforms the baselines in terms of counterfactual validity and data manifold closeness, while generating meaningful and relevant counterfactuals for various forecasting tasks.

Place, publisher, year, edition, pages
IEEE conference proceedings, 2024
Series
IEEE International Conference on Data Mining. Proceedings, ISSN 1550-4786, E-ISSN 2374-8486
Keywords
Time series forecasting, Counterfactual explanations, Model interpretability, Deep learning
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-226602 (URN)10.1109/ICDM58522.2023.00180 (DOI)001165180100171 ()2-s2.0-85185401353 (Scopus ID)979-8-3503-0788-7 (ISBN)
Conference
IEEE International Conference on Data Mining (ICDM), 1-4 December 2023, Shanghai, China.
Available from: 2024-02-14 Created: 2024-02-14 Last updated: 2024-11-14Bibliographically approved
Kuratomi Hernandez, A., Lee, Z., Tsaparas, P., Junior, G. D., Pitoura, E., Lindgren, T. & Papapetrou, P. (2024). CounterFair: Group Counterfactuals for Bias Detection, Mitigation and Subgroup Identification. In: Elena Baralis; Kun Zhang; Ernesto Damiani; Meroane Debbah; Panos Kalnis; Xindong Wu (Ed.), Proceedings 24th IEEE International Conference on Data Mining: ICDM 2024. Paper presented at 24th IEEE International Conference on Data Mining (ICDM 2024), Abu Dhabi, United Arab Emirates, 9-12 December, 2024 (pp. 181-190). IEEE
Open this publication in new window or tab >>CounterFair: Group Counterfactuals for Bias Detection, Mitigation and Subgroup Identification
Show others...
2024 (English)In: Proceedings 24th IEEE International Conference on Data Mining: ICDM 2024 / [ed] Elena Baralis; Kun Zhang; Ernesto Damiani; Meroane Debbah; Panos Kalnis; Xindong Wu, IEEE, 2024, p. 181-190Conference paper, Published paper (Refereed)
Abstract [en]

Counterfactual explanations can be used as a means to explain a models decision process and to provide recommendations to users on how to improve their current status. The difficulty to apply these counterfactual recommendations from the users perspective, also known as burden, may be used to assess the models algorithmic fairness and to provide fair recommendations among different sensitive feature groups. We propose a novel model-agnostic, mathematical programming-based, group counterfactual algorithm that can: (1) detect biases via group counterfactual burden, (2) produce fair recommendations among sensitive groups and (3) identify relevant subgroups of instances through shared counterfactuals. We analyze these capabilities from the perspective of recourse fairness, and empirically compare our proposed method with the state-of-the-art algorithms for group counterfactual generation in order to assess the bias identification and the capabilities in group counterfactual effectiveness and burden minimization.

Place, publisher, year, edition, pages
IEEE, 2024
Keywords
Counterfactual explanations, Algorithmic Fairness, Group counterfactuals, Local explainability
National Category
Computer Systems
Identifiers
urn:nbn:se:su:diva-233353 (URN)10.1109/ICDM59182.2024.00025 (DOI)2-s2.0-86000228096 (Scopus ID)979-8-3315-0668-1 (ISBN)979-8-3315-0669-8 (ISBN)
Conference
24th IEEE International Conference on Data Mining (ICDM 2024), Abu Dhabi, United Arab Emirates, 9-12 December, 2024
Available from: 2024-09-09 Created: 2024-09-09 Last updated: 2025-04-28Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-4632-4815

Search in DiVA

Show all publications