Change search
Link to record
Permanent link

Direct link
Publications (10 of 42) Show all publications
Andersson, P. & Lindholm, M. (2026). Mortality Forecasting Using Variational Inference. Journal of Forecasting, 45(3), 1069-1076
Open this publication in new window or tab >>Mortality Forecasting Using Variational Inference
2026 (English)In: Journal of Forecasting, ISSN 0277-6693, E-ISSN 1099-131X, Vol. 45, no 3, p. 1069-1076Article in journal (Refereed) Published
Abstract [en]

This paper considers the problem of forecasting mortality rates. A large number of models have already been proposed for this task, but they generally have the disadvantage of either estimating the model in a two-step process, possibly losing efficiency, or relying on methods that are cumbersome for the practitioner to use. We instead propose using variational inference and the probabilistic programming library Pyro for estimating the model. This allows for flexibility in modelling assumptions while still being able to estimate the full model in one step. The models are fitted on Swedish mortality data, and we find that the in-sample fit is good and that the forecasting performance is better than other popular models. Code is available online (https://github.com/LPAndersson/VImortality).

Keywords
hidden Markov model, mortality forecasting, nonlinear state-space models, variational inference
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-251011 (URN)10.1002/for.70078 (DOI)001631210500001 ()2-s2.0-105023975704 (Scopus ID)
Available from: 2026-01-21 Created: 2026-01-21 Last updated: 2026-03-25Bibliographically approved
Lindholm, M., Richman, R., Tsanakas, A. & Wüthrich, M. V. (2026). Sensitivity-based measures of discrimination in insurance pricing. European Journal of Operational Research, 333(2), 601-615
Open this publication in new window or tab >>Sensitivity-based measures of discrimination in insurance pricing
2026 (English)In: European Journal of Operational Research, ISSN 0377-2217, E-ISSN 1872-6860, Vol. 333, no 2, p. 601-615Article in journal (Refereed) Published
Abstract [en]

Different notions of fairness and discrimination have been extensively discussed in the machine learning, operations research, and insurance pricing literatures. As not all fairness criteria can be concurrently satisfied, metrics are needed that allow assessing the materiality of discriminatory effects and the trade-offs between various criteria. Methods from sensitivity analysis have been deployed for the measurement of demographic unfairness, that is, the statistical dependence of risk predictions on protected attributes. We produce a sensitivity-based measure for the distinct phenomenon of proxy discrimination, referring to the implicit inference of protected attributes from other covariates. For this, we first define a set of admissible prices that avoid proxy discrimination. Then, the measure is defined as the normalised L2-distance of a price from the closest element in that set. We use variance-based sensitivity analysis and Shapley values to attribute the proxy discrimination measure to individual (or subsets of) covariates and investigate how properties of the data generating process are reflected in those metrics. Furthermore, we build on the global (i.e., portfolio-wide) measures of demographic unfairness and proxy discrimination to propose local (i.e., instance- or policyholder-specific) measures, which allow a fine-grained understanding of discriminatory effects. Finally, we apply the methods developed in the paper to a real-world insurance dataset, where ethnicity is a protected variable. We observe substantial proxy-discriminatory effects for one ethnic group and identify the key variables driving this.

Keywords
Algorithmic fairness, Demographic parity, Insurance pricing, Proxy discrimination, Sensitivity analysis
National Category
Other Mathematics
Identifiers
urn:nbn:se:su:diva-252482 (URN)10.1016/j.ejor.2026.01.021 (DOI)2-s2.0-105028451341 (Scopus ID)
Available from: 2026-02-12 Created: 2026-02-12 Last updated: 2026-05-27Bibliographically approved
Zakrisson, H. & Lindholm, M. (2025). A tree-based varying coefficient model. Computational Statistics, 40, 5105-5134
Open this publication in new window or tab >>A tree-based varying coefficient model
2025 (English)In: Computational Statistics, ISSN 0943-4062, E-ISSN 1613-9658, Vol. 40, p. 5105-5134Article in journal (Refereed) Published
Abstract [en]

The paper introduces a tree-based varying coefficient model (VCM) where the varying coefficients are modelled using the cyclic gradient boosting machine (CGBM) from Delong et al. (On cyclic gradient boosting machines, 2023). Modelling the coefficient functions using a CGBM allows for dimension-wise early stopping and feature importance scores. The dimension-wise early stopping not only reduces the risk of dimension-specific overfitting, but also reveals differences in model complexity across dimensions. The use of feature importance scores allows for simple feature selection and easy model interpretation. The model is evaluated on the same simulated and real data examples as those used in Richman and Wüthrich (Scand Actuar J 2023:71–95, 2023), and the results show that it produces results in terms of out of sample loss that are comparable to those of their neural network-based VCM called LocalGLMnet.

Keywords
Generalised linear models, Multivariate gradient boosting, Feature selection, Interaction efects, Early stopping
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-226747 (URN)10.1007/s00180-025-01603-8 (DOI)001412768400001 ()2-s2.0-85217704687 (Scopus ID)
Available from: 2024-02-19 Created: 2024-02-19 Last updated: 2026-03-19Bibliographically approved
Lindholm, M. & Wüthrich, M. V. (2025). The balance property in insurance pricing. Scandinavian Actuarial Journal
Open this publication in new window or tab >>The balance property in insurance pricing
2025 (English)In: Scandinavian Actuarial Journal, ISSN 0346-1238, E-ISSN 1651-2030Article in journal (Refereed) Epub ahead of print
Abstract [en]

Unbiasedness, auto-calibration and the balance property are three important features that actuarial pricing algorithms should satisfy. Unbiasedness is a global property that describes price levels on average across the entire insurance portfolio. Auto-calibration is a local unbiasedness property that describes price levels on average on price cohorts. The balance property is a global property that describes price levels point-wise. This paper focuses on the balance property. It is the easiest of the three to verify and to rectify, and it has the nice interpretation in terms of an allocation of the totally observed claims cost. We provide different methods of rectifying the balance property, and we discuss how these methods impact prediction accuracy. We find that there is not one method that is generally better than the others, thus, the chosen optimal method is a case by case choice.

Keywords
actuarial pricing, auto-calibration, balance property, generalized linear model, GLM, maximum likelihood estimation, MLE, regression, Unbiasedness
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-247921 (URN)10.1080/03461238.2025.2552909 (DOI)001567996900001 ()2-s2.0-105016650872 (Scopus ID)
Available from: 2025-10-08 Created: 2025-10-08 Last updated: 2025-11-10
Lindholm, M., Richman, R., Tsanakas, A. & Wuthrich, M. V. (2024). A multi-task network approach for calculating discrimination-free insurance prices. European Actuarial Journal, 14, 329-369
Open this publication in new window or tab >>A multi-task network approach for calculating discrimination-free insurance prices
2024 (English)In: European Actuarial Journal, ISSN 2190-9733, E-ISSN 2190-9741, Vol. 14, p. 329-369Article in journal (Refereed) Published
Abstract [en]

In applications of predictive modeling, such as insurance pricing, indirect or proxy discrimination is an issue of major concern. Namely, there exists the possibility that protected policyholder characteristics are implicitly inferred from non-protected ones by predictive models and are thus having an undesirable (and possibly illegal) impact on prices. A technical solution to this problem relies on building a best-estimate model using all policyholder characteristics (including protected ones) and then averaging out the protected characteristics for calculating individual prices. However, such an approach requires full knowledge of policyholders' protected characteristics, which may in itself be problematic. Here, we address this issue by using a multi-task neural network architecture for claim predictions, which can be trained using only partial information on protected characteristics and produces prices that are free from proxy discrimination. We demonstrate the proposed method on both synthetic data and a real-world motor claims dataset, in which proxy discrimination can be observed. In both examples we find that the predictive accuracy of the multi-task network is comparable to a conventional feed-forward neural network, when the protected information is available for at least half of the insurance policies. However, the multi-task network has superior performance in the case when the protected information is known for less than half of the insurance policyholders.

Keywords
Indirect discrimination, Proxy discrimination, Discrimination-free insurance pricing, Unawareness price, Best-estimate price, Protected information, Discriminatory covariates, Fairness, Incomplete information, Multi-task learning, Multi-output network
National Category
Business Administration
Identifiers
urn:nbn:se:su:diva-224229 (URN)10.1007/s13385-023-00367-z (DOI)001098104000001 ()2-s2.0-85175970588 (Scopus ID)
Available from: 2023-12-05 Created: 2023-12-05 Last updated: 2025-02-20Bibliographically approved
Lindholm, M. & Palmquist, J. (2024). Black-box guided generalised linear model building with non-life pricing applications. Annals of Actuarial Science, 18(3), 675-691
Open this publication in new window or tab >>Black-box guided generalised linear model building with non-life pricing applications
2024 (English)In: Annals of Actuarial Science, ISSN 1748-4995, E-ISSN 1748-5002, Vol. 18, no 3, p. 675-691Article in journal (Refereed) Published
Abstract [en]

The paper introduces a method for creating a categorical generalized linear model (GLM) based on information extracted from a given black-box predictor. The procedure for creating the guided GLM is as follows: For each covariate, including interactions, a covariate partition is created using partial dependence functions calculated based on the given black-box predictor. In order to enhance the predictive performance, an auto-calibration step is used to determine which parts of each covariate partition should be kept, and which parts should be merged. Given the covariate and interaction partitions, a standard categorical GLM is fitted using a lasso penalty. The performance of the proposed method is illustrated using a number of real insurance data sets where gradient boosting machine (GBM) models are used as black-box reference models. From these examples, it is seen that the predictive performance of the guided GLMs is very close to that of the corresponding reference GBMs. Further, in the examples, the guided GLMs have few parameters, making the resulting models easy to interpret. In the numerical illustrations techniques are used to, e.g., identify important interactions both locally and globally, which is essential when, e.g., constructing a tariff.

Keywords
auto-calibration, black-box models, categorical GLM, feature extraction, regularisation
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-240855 (URN)10.1017/S1748499524000265 (DOI)001370873700001 ()2-s2.0-85211630562 (Scopus ID)
Available from: 2025-03-17 Created: 2025-03-17 Last updated: 2025-03-17Bibliographically approved
Fahrenwaldt, M., Furrer, C., Hiabu, M. E., Huang, F., Jørgensen, F. H., Lindholm, M., . . . Tsanakas, A. (2024). Fairness: plurality, causality, and insurability. European Actuarial Journal, 14(2), 317-328
Open this publication in new window or tab >>Fairness: plurality, causality, and insurability
Show others...
2024 (English)In: European Actuarial Journal, ISSN 2190-9733, E-ISSN 2190-9741, Vol. 14, no 2, p. 317-328Article in journal (Refereed) Published
Abstract [en]

This article summarizes the main topics, findings, and avenues for future work from the workshop Fairness with a view towards insurance held August 2023 in Copenhagen, Denmark.

Keywords
Artificial intelligence, Discrimination, Insurance, Machine learning
National Category
Mathematics Computer Sciences
Identifiers
urn:nbn:se:su:diva-235573 (URN)10.1007/s13385-024-00387-3 (DOI)001250260500001 ()2-s2.0-85196256546 (Scopus ID)
Available from: 2024-11-18 Created: 2024-11-18 Last updated: 2024-11-18Bibliographically approved
Lindholm, M. & Nazar, T. (2024). On duration effects in non-life insurance pricing. European Actuarial Journal, 14, 809-832
Open this publication in new window or tab >>On duration effects in non-life insurance pricing
2024 (English)In: European Actuarial Journal, ISSN 2190-9733, E-ISSN 2190-9741, Vol. 14, p. 809-832Article in journal (Refereed) Published
Abstract [en]

The paper discusses duration effects on the consistency of mean parameter and dispersion parameter estimators in exponential dispersion families (EDFs) that are the standard models used for non-life insurance pricing. Focus is on the standard generalised linear model assumptions where both the mean and variance, conditional on duration, are linear functions in terms of duration. We derive simple convergence results that highlight consequences when the linear conditional moment assumptions are not satisfied. These results illustrate that: (i) the resulting mean estimators always have a relevant asymptotic interpretation in terms of the duration adjusted actuarially fair premium—a premium that only agrees with the standard actuarial premium using a duration equal to one, given that the expected value is linear in the duration; (ii) deviance based estimators of the dispersion parameter in an EDF should be avoided in favour of Pearson estimators; (iii) unless the linear moment assumptions are satisfied, consistency of dispersion and plug-in variance estimators can not be guaranteed and may result in spurious over-dispersion. The results provide explicit conditions on the underlying data generating process that will lead to spurious over-dispersion that can be used for model checking. This is illustrated based on real insurance data, where it is concluded that the linear moment assumptions are violated, which results in non-negligible spurious over-dispersion. 

Keywords
Consistency, Duration, Exponential dispersion family, Generalised linear model, Over-dispersion, Dispersion estimators
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-231237 (URN)10.1007/s13385-024-00385-5 (DOI)001232118300001 ()2-s2.0-85194554249 (Scopus ID)
Available from: 2024-06-24 Created: 2024-06-24 Last updated: 2025-02-21Bibliographically approved
Lindholm, M., Richman, R., Tsanakas, A. & Wüthrich, M. V. (2024). What is fair? Proxy discrimination vs. demographic disparities in insurance pricing. Scandinavian Actuarial Journal, 2024(9), 935-970
Open this publication in new window or tab >>What is fair? Proxy discrimination vs. demographic disparities in insurance pricing
2024 (English)In: Scandinavian Actuarial Journal, ISSN 0346-1238, E-ISSN 1651-2030, Vol. 2024, no 9, p. 935-970Article in journal (Refereed) Published
Abstract [en]

Discrimination and fairness are major concerns in algorithmic models. Thisis particularly true in insurance, where protected policyholder attributes arenot allowed to be used for insurance pricing. Simply disregarding protectedpolicyholder attributes is not an appropriate solution as this still allows forthe possibility of inferring protected attributes from non-protected covari-ates, leading to the phenomenon of proxy discrimination. Although proxydiscrimination is qualitatively different from the group fairness conceptsdiscussed in the machine learning and actuarial literature, group fairnesscriteria have been proposed to control the impact of protected attributeson the calculation of insurance prices. The purpose of this paper is to discussthe relationship between direct and proxy discrimination in insurance andthe most popular group fairness axioms. We provide a technical definitionof proxy discrimination and derive incompatibility results, showing thatavoiding proxy discrimination does not imply satisfying group fairness andvice versa. This shows that the two concepts are materially different. Fur-thermore, we discuss input data pre-processing and model post-processingmethods that achieve group fairness in the sense of demographic parity.As these methods induce transformations that explicitly depend on poli-cyholders’ protected attributes, it becomes ambiguous whether direct andproxy discrimination is, in fact, avoided.

Keywords
Discrimination, fairness, indirect discrimination, input preprocessing, optimal transport, output post-processing, proxy discrimination, Wasserstein distance
National Category
Economics
Identifiers
urn:nbn:se:su:diva-236066 (URN)10.1080/03461238.2024.2364741 (DOI)001250509200001 ()2-s2.0-85196401999 (Scopus ID)
Available from: 2024-12-06 Created: 2024-12-06 Last updated: 2024-12-06Bibliographically approved
Lindholm, M., Lindskog, F. & Palmquist, J. (2023). Local bias adjustment, duration-weighted probabilities, and automatic construction of tariff cells. Scandinavian Actuarial Journal, 2023(10), 946-973
Open this publication in new window or tab >>Local bias adjustment, duration-weighted probabilities, and automatic construction of tariff cells
2023 (English)In: Scandinavian Actuarial Journal, ISSN 0346-1238, E-ISSN 1651-2030, Vol. 2023, no 10, p. 946-973Article in journal (Refereed) Published
Abstract [en]

We study non-life insurance pricing and present a general procedure for constructing a distribution-free locally unbiased predictor of the risk premium based on any initially suggested predictor. The resulting predictor is piecewise constant, corresponding to a partition of the covariate space, and by construction auto-calibrated. Two key issues are the appropriate partitioning of the covariate space and the handling of randomly varying durations, acknowledging possible early termination of contracts. A basic idea in the present paper is to partition the predictions from the initial predictor, which as a by-product defines a partition of the covariate space. Two different approaches to create partitions are discussed in detail using (i) duration-weighted equal-probability binning, and (ii) binning by duration-weighted regression trees. Given a partitioning procedure, the size of the partition to be used is obtained using cross-validation. In this way we obtain an automatic data-driven tariffication procedure, where the number of tariff cells corresponds to the size of the partition. We illustrate the procedure based on both simulated and real insurance data, using both simple GLMs and GBMs as initial predictors. The resulting tariffs are shown to have a rather small number of tariff cells while maintaining or improving the predictive performance compared to the initial predictors.

Keywords
Local bias adjustment, duration-weighted probabilities, non-life pricing, automatic tariffication
National Category
Mathematics Sociology
Identifiers
urn:nbn:se:su:diva-215531 (URN)10.1080/03461238.2023.2176251 (DOI)000935519500001 ()2-s2.0-85148100276 (Scopus ID)
Available from: 2023-03-16 Created: 2023-03-16 Last updated: 2023-10-12Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0001-7235-384x

Search in DiVA

Show all publications