Change search
Link to record
Permanent link

Direct link
Chaliane Junior, Guilherme DinisORCID iD iconorcid.org/0000-0001-8492-761X
Publications (6 of 6) Show all publications
Kuratomi Hernandez, A., Lee, Z., Tsaparas, P., Pitoura, E., Lindgren, T., Chaliane Junior, G. D. & Papapetrou, P. (2025). Subgroup fairness based on shared counterfactuals. Knowledge and Information Systems
Open this publication in new window or tab >>Subgroup fairness based on shared counterfactuals
Show others...
2025 (English)In: Knowledge and Information Systems, ISSN 0219-1377, E-ISSN 0219-3116Article in journal (Refereed) Epub ahead of print
Abstract [en]

CounterFair is a group counterfactual search algorithm that detects and minimizes biases among sensitive groups and identifies relevant subgroups inside these sensitive groups based on shared counterfactual instances. We investigate the latter capability, analyzing the found subgroups from the perspective of fairness based on counterfactual reasoning, in order to evaluate whether they present different biases with respect to each other and to the sensitive feature groups they belong to. We perform these measurements on the subgroups extracted by CounterFair over six binary classification datasets, providing figures and their respective analysis on the presence of bias.

Keywords
Bias, Counterfactual, Explainability, Fairness, Subgroups
National Category
Human Computer Interaction
Identifiers
urn:nbn:se:su:diva-247063 (URN)10.1007/s10115-025-02555-7 (DOI)001551587300001 ()2-s2.0-105013550551 (Scopus ID)
Available from: 2025-09-25 Created: 2025-09-25 Last updated: 2025-09-25
Kuratomi Hernandez, A., Lee, Z., Tsaparas, P., Junior, G. D., Pitoura, E., Lindgren, T. & Papapetrou, P. (2024). CounterFair: Group Counterfactuals for Bias Detection, Mitigation and Subgroup Identification. In: Elena Baralis; Kun Zhang; Ernesto Damiani; Meroane Debbah; Panos Kalnis; Xindong Wu (Ed.), Proceedings 24th IEEE International Conference on Data Mining: ICDM 2024. Paper presented at 24th IEEE International Conference on Data Mining (ICDM 2024), Abu Dhabi, United Arab Emirates, 9-12 December, 2024 (pp. 181-190). IEEE
Open this publication in new window or tab >>CounterFair: Group Counterfactuals for Bias Detection, Mitigation and Subgroup Identification
Show others...
2024 (English)In: Proceedings 24th IEEE International Conference on Data Mining: ICDM 2024 / [ed] Elena Baralis; Kun Zhang; Ernesto Damiani; Meroane Debbah; Panos Kalnis; Xindong Wu, IEEE, 2024, p. 181-190Conference paper, Published paper (Refereed)
Abstract [en]

Counterfactual explanations can be used as a means to explain a models decision process and to provide recommendations to users on how to improve their current status. The difficulty to apply these counterfactual recommendations from the users perspective, also known as burden, may be used to assess the models algorithmic fairness and to provide fair recommendations among different sensitive feature groups. We propose a novel model-agnostic, mathematical programming-based, group counterfactual algorithm that can: (1) detect biases via group counterfactual burden, (2) produce fair recommendations among sensitive groups and (3) identify relevant subgroups of instances through shared counterfactuals. We analyze these capabilities from the perspective of recourse fairness, and empirically compare our proposed method with the state-of-the-art algorithms for group counterfactual generation in order to assess the bias identification and the capabilities in group counterfactual effectiveness and burden minimization.

Place, publisher, year, edition, pages
IEEE, 2024
Keywords
Counterfactual explanations, Algorithmic Fairness, Group counterfactuals, Local explainability
National Category
Computer Systems
Identifiers
urn:nbn:se:su:diva-233353 (URN)10.1109/ICDM59182.2024.00025 (DOI)2-s2.0-86000228096 (Scopus ID)979-8-3315-0668-1 (ISBN)979-8-3315-0669-8 (ISBN)
Conference
24th IEEE International Conference on Data Mining (ICDM 2024), Abu Dhabi, United Arab Emirates, 9-12 December, 2024
Available from: 2024-09-09 Created: 2024-09-09 Last updated: 2025-04-28Bibliographically approved
Chaliane Junior, G. D., Magnússon, S. & Hollmén, J. (2024). Policy Control with Delayed, Aggregate, and Anonymous Feedback. In: Albert Bifet, Jesse Davis, Tomas Krilavičius, Meelis Kull, Eirini Ntoutsi, Indrė Žliobaitė (Ed.), Machine Learning and Knowledge Discovery in Databases. Research Track: European Conference, ECML PKDD 2024, Vilnius, Lithuania, September 9–13, 2024, Proceedings, Part VI: . Paper presented at Machine Learning and Knowledge Discovery in Databases. Research Track: European Conference, ECML PKDD 2024, Vilnius, Lithuania, September 9–13, 2024. (pp. 389-406). Springer Nature
Open this publication in new window or tab >>Policy Control with Delayed, Aggregate, and Anonymous Feedback
2024 (English)In: Machine Learning and Knowledge Discovery in Databases. Research Track: European Conference, ECML PKDD 2024, Vilnius, Lithuania, September 9–13, 2024, Proceedings, Part VI / [ed] Albert Bifet, Jesse Davis, Tomas Krilavičius, Meelis Kull, Eirini Ntoutsi, Indrė Žliobaitė, Springer Nature , 2024, p. 389-406Conference paper, Published paper (Refereed)
Abstract [sv]

Reinforcement learning algorithms have a dependency on observing rewards for actions taken. The relaxed setting of having fully observable rewards, however, can be infeasible in certain scenarios, due to either cost or the nature of the problem. Of specific interest here is the challenge of learning a policy when rewards are delayed, aggregated, and anonymous (DAAF). A problem which has been addressed in bandits literature and, to the best of our knowledge, to a lesser extent in the more general reinforcement learning (RL) setting. We introduce a novel formulation that mirrors scenarios encountered in real-world applications, characterized by intermittent and aggregated reward observations. To address these constraints, we develop four new algorithms: one employs least squares for true reward estimation; two and three adapt Q-learning and SARSA, to deal with our unique setting; and the fourth leverages a policy with options framework. Through a thorough and methodical experimental analysis, we compare these methodologies, demonstrating that three of them can approximate policies nearly as effectively as those derived from complete information scenarios, albeit with minimal performance degradation due to informational constraints. Our findings pave the way for more robust RL applications in environments with limited reward feedback.

Place, publisher, year, edition, pages
Springer Nature, 2024
Series
Lecture Notes in Computer Science (LNCS), ISSN 0302-9743, E-ISSN 1611-3349
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-237094 (URN)10.1007/978-3-031-70365-2_23 (DOI)001330395900023 ()2-s2.0-85203879812 (Scopus ID)978-3-031-70364-5 (ISBN)978-3-031-70365-2 (ISBN)
Conference
Machine Learning and Knowledge Discovery in Databases. Research Track: European Conference, ECML PKDD 2024, Vilnius, Lithuania, September 9–13, 2024.
Available from: 2024-12-10 Created: 2024-12-10 Last updated: 2025-02-06Bibliographically approved
Baran, B., Chaliane Junior, G. D., Danylenko, A., Sunday Folorunso, O., Forsum, G., Lefarov, M., . . . Zhao, Y. (2023). Accelerating Creator Audience Building through Centralized Exploration. In: Jie Zhang; Li Chen; Shlomo Berkovsky; Min Zhang; Tommaso di Noia; Justin Basilico; Luiz Pizzato; Yang Song (Ed.), RecSys '23: Proceedings of the 17th ACM Conference on Recommender Systems. Paper presented at RecSys '23: Seventeenth ACM Conference on Recommender Systems, 18-22 September 2023, Singapore, Singapore. (pp. 70-73). Association for Computing Machinery (ACM)
Open this publication in new window or tab >>Accelerating Creator Audience Building through Centralized Exploration
Show others...
2023 (English)In: RecSys '23: Proceedings of the 17th ACM Conference on Recommender Systems / [ed] Jie Zhang; Li Chen; Shlomo Berkovsky; Min Zhang; Tommaso di Noia; Justin Basilico; Luiz Pizzato; Yang Song, Association for Computing Machinery (ACM) , 2023, p. 70-73Conference paper, Published paper (Refereed)
Abstract [en]

On Spotify, multiple recommender systems enable personalized user experiences across a wide range of product features. These systems are owned by different teams and serve different goals, but all of these systems need to explore and learn about new content as it appears on the platform. In this work, we describe ongoing efforts at Spotify to develop an efficient solution to this problem, by centralizing content exploration and providing signals to existing, decentralized recommendation systems (a.k.a. exploitation systems). We take a creator-centric perspective, and argue that this approach can dramatically reduce the time it takes for new content to reach its full potential.

Place, publisher, year, edition, pages
Association for Computing Machinery (ACM), 2023
National Category
Information Systems
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-238222 (URN)10.1145/3604915.3608880 (DOI)001156630300006 ()2-s2.0-85174486568 (Scopus ID)9798400702419 (ISBN)
Conference
RecSys '23: Seventeenth ACM Conference on Recommender Systems, 18-22 September 2023, Singapore, Singapore.
Available from: 2025-01-17 Created: 2025-01-17 Last updated: 2025-01-20Bibliographically approved
Movin, M., Chaliane Junior, G. D., Hollmén, J. & Papapetrou, P. (2023). Explaining Black Box Reinforcement Learning Agents Through Counterfactual Policies. In: Bruno Crémilleux; Sibylle Hess; Siegfried Nijssen (Ed.), Advances in Intelligent Data Analysis XXI: 21st International Symposium on Intelligent Data Analysis, IDA 2023, Louvain-la-Neuve, Belgium, April 12–14, 2023, Proceedings. Paper presented at Advances in Intelligent Data Analysis XXI, 21st International Symposium on Intelligent Data Analysis, IDA 2023, Louvain-la-Neuve, Belgium, April 12–14, 2023. (pp. 314-326). Springer
Open this publication in new window or tab >>Explaining Black Box Reinforcement Learning Agents Through Counterfactual Policies
2023 (English)In: Advances in Intelligent Data Analysis XXI: 21st International Symposium on Intelligent Data Analysis, IDA 2023, Louvain-la-Neuve, Belgium, April 12–14, 2023, Proceedings / [ed] Bruno Crémilleux; Sibylle Hess; Siegfried Nijssen, Springer , 2023, p. 314-326Conference paper, Published paper (Refereed)
Abstract [en]

Despite the increased attention to explainable AI, explainability methods for understanding reinforcement learning (RL) agents have not been extensively studied. Failing to understand the agent’s behavior may cause reduced productivity in human-agent collaborations, or mistrust in automated RL systems. RL agents are trained to optimize a long term cumulative reward, and in this work we formulate a novel problem on how to generate explanations on when an agent could have taken another action to optimize an alternative reward. More concretely, we aim at answering the question: What does an RL agent need to do differently to achieve an alternative target outcome? We introduce the concept of a counterfactual policy, as a policy trained to explain in which states a black box agent could have taken an alternative action to achieve another desired outcome. The usefulness of counterfactual policies is demonstrated in two experiments with different use-cases, and the results suggest that our solution can provide interpretable explanations.

Place, publisher, year, edition, pages
Springer, 2023
Series
Lecture Notes in Computer Science (LNCS), ISSN 0302-9743, E-ISSN 1611-3349
Keywords
Explainable AI (XAI), Reinforcement Learning, Counterfactual Explanations
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-225173 (URN)10.1007/978-3-031-30047-9_25 (DOI)2-s2.0-85152589358 (Scopus ID)978-3-031-30046-2 (ISBN)978-3-031-30047-9 (ISBN)
Conference
Advances in Intelligent Data Analysis XXI, 21st International Symposium on Intelligent Data Analysis, IDA 2023, Louvain-la-Neuve, Belgium, April 12–14, 2023.
Available from: 2024-01-09 Created: 2024-01-09 Last updated: 2024-01-10Bibliographically approved
Chaliane Junior, G. D., Magnússon, S. & Hollmén, J. (2022). Policy Evaluation with Delayed, Aggregated Anonymous Feedback. In: Poncelet Pascal; Dino Ienco (Ed.), Discovery Science: 25th International Conference, DS 2022, Montpellier, France, October 10–12, 2022, Proceedings. Paper presented at 25th International Conference, DS 2022, Montpellier, France, October 10–12, 2022 (pp. 114-123). Springer Nature
Open this publication in new window or tab >>Policy Evaluation with Delayed, Aggregated Anonymous Feedback
2022 (English)In: Discovery Science: 25th International Conference, DS 2022, Montpellier, France, October 10–12, 2022, Proceedings / [ed] Poncelet Pascal; Dino Ienco, Springer Nature , 2022, p. 114-123Conference paper, Published paper (Refereed)
Abstract [en]

In reinforcement learning, an agent makes decisions to maximize rewards in an environment. Rewards are an integral part of the reinforcement learning as they guide the agent towards its learning objective. However, having consistent rewards can be infeasible in certain scenarios, due to either cost, the nature of the problem or other constraints. In this paper, we investigate the problem of delayed, aggregated, and anonymous rewards. We propose and analyze two strategies for conducting policy evaluation under cumulative periodic rewards, and study them by making use of simulation environments. Our findings indicate that both strategies can achieve similar sample efficiency as when we have consistent rewards.

Place, publisher, year, edition, pages
Springer Nature, 2022
Series
Lecture Notes in Computer Science (LNCS), ISSN 0302-9743, E-ISSN 1611-3349 ; 13601
Keywords
Reinforcement learning, Markov Decision Process (MDP), Reward estimation
National Category
Computer Sciences
Research subject
Computer and Systems Sciences
Identifiers
urn:nbn:se:su:diva-213202 (URN)10.1007/978-3-031-18840-4_9 (DOI)2-s2.0-85142725312 (Scopus ID)978-3-031-18839-8 (ISBN)978-3-031-18840-4 (ISBN)
Conference
25th International Conference, DS 2022, Montpellier, France, October 10–12, 2022
Available from: 2022-12-22 Created: 2022-12-22 Last updated: 2023-01-04Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-8492-761X

Search in DiVA

Show all publications