Change search
Link to record
Permanent link

Direct link
Publications (10 of 11) Show all publications
Dörr, P. & Heiny, J. (2025). Joint Extremes of Inversions and Descents of Random Permutations. Journal of theoretical probability, 38(2), Article ID 42.
Open this publication in new window or tab >>Joint Extremes of Inversions and Descents of Random Permutations
2025 (English)In: Journal of theoretical probability, ISSN 0894-9840, E-ISSN 1572-9230, Vol. 38, no 2, article id 42Article in journal (Refereed) Published
Abstract [en]

We provide asymptotic theory for the joint distribution of Xinv and Xdes, the numbers of inversions and descents of random permutations. Recently, [14] proved that Xinv, respectively, Xdes, is in the maximum domain of attraction of the Gumbel distribution. To tackle the dependency between these two permutation statistics, we use Hájek projections and a suitable quantitative Gaussian approximation. We show that (Xinv,Xdes) is in the maximum domain of attraction of the two-dimensional Gumbel distribution with independent margins. This result can be stated in the broader combinatorial framework of finite Coxeter groups, on which our method also yields the central limit theorem for (Xinv,Xdes) and various other permutation statistics as a novel contribution. In particular, signed permutation groups with random biased signs and products of classical Weyl groups are investigated.

Keywords
Central limit theorem, Coxeter group, Extreme value theory, Joint distribution, Maximum, Permutation statistics
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-241846 (URN)10.1007/s10959-025-01407-y (DOI)001448979100002 ()2-s2.0-105000226546 (Scopus ID)
Available from: 2025-04-09 Created: 2025-04-09 Last updated: 2025-04-09Bibliographically approved
Dörnemann, N. & Heiny, J. (2025). Limiting spectral distribution for large sample correlation matrices. The Annals of Applied Probability, 35(4), 2570-2603
Open this publication in new window or tab >>Limiting spectral distribution for large sample correlation matrices
2025 (English)In: The Annals of Applied Probability, ISSN 1050-5164, E-ISSN 2168-8737, Vol. 35, no 4, p. 2570-2603Article in journal (Refereed) Published
Abstract [en]

In this paper, we consider the empirical spectral distribution of the sample correlation matrix and investigate its asymptotic behavior under mild assumptions on the data’s distribution, when dimension and sample size increase at the same rate. First, we give a characterization for the limiting spectral distribution to follow a Marčenko–Pastur law assuming that the underlying data matrix consists of i.i.d. entries. Subsequently, we provide the limiting spectral distribution of the sample correlation matrix when allowing for a dependence structure within the columns of the data matrix. In contrast to previous works, both the fourth moment of the data and the largest eigenvalue of the population correlation matrix may be unbounded, resulting in a fundamental structural difference. More precisely, the standard argument of approximating the sample correlation matrix by its sample covariance companion breaks down and novel techniques for tackling the challenging dependency structure of the sample correlation matrix are introduced.

Keywords
dependent data, limiting spectral distribution, Marčenko–Pastur law, Sample correlation matrix
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-246717 (URN)10.1214/25-AAP2181 (DOI)2-s2.0-105014127608 (Scopus ID)
Available from: 2025-09-11 Created: 2025-09-11 Last updated: 2025-09-11Bibliographically approved
Heiny, J. & Kleemann, C. (2025). Maximum interpoint distance of high-dimensional random vectors. Bernoulli, 31(1), 537-560
Open this publication in new window or tab >>Maximum interpoint distance of high-dimensional random vectors
2025 (English)In: Bernoulli, ISSN 1350-7265, E-ISSN 1573-9759, Vol. 31, no 1, p. 537-560Article in journal (Refereed) Published
Abstract [en]

A limit theorem for the largest interpoint distance of p independent and identically distributed points in Rn to the Gumbel distribution is proved, where the number of points p = pn tends to infinity as the dimension of the points n → ∞. The theorem holds under moment assumptions and corresponding conditions on the growth rate of p. We obtain a plethora of ancillary results such as the joint convergence of maximum and minimum interpoint dis-tances. Using the inherent sum structure of interpoint distances, our result is generalized to maxima of dependent random walks with non-decaying correlations and we also derive point process convergence. An application of the maximum interpoint distance to testing the equality of means for high-dimensional random vectors is presented. Moreover, we study the largest off-diagonal entry of a sample covariance matrix. The proofs are based on the Chen-Stein Poisson approximation method and Gaussian approximation to large deviation probabilities.

Keywords
Extreme value theory, Gumbel distribution, high dimension, independence test, maximum under dependence, p-norms
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-239957 (URN)10.3150/24-BEJ1738 (DOI)001392889900022 ()2-s2.0-85208722227 (Scopus ID)
Available from: 2025-02-28 Created: 2025-02-28 Last updated: 2025-02-28Bibliographically approved
Heiny, J. & Parolya, N. (2024). Log determinant of large correlation matrices under infinite fourth moment. Annales de l'I.H.P. Probabilites et statistiques, 60(2), 1048-1076
Open this publication in new window or tab >>Log determinant of large correlation matrices under infinite fourth moment
2024 (English)In: Annales de l'I.H.P. Probabilites et statistiques, ISSN 0246-0203, E-ISSN 1778-7017, Vol. 60, no 2, p. 1048-1076Article in journal (Refereed) Published
Abstract [en]

In this paper, we show the central limit theorem for the logarithmic determinant of the sample correlation matrix R constructed from the (p×n)(p×n)-dimensional data matrix X containing independent and identically distributed random entries with mean zero, variance one and infinite fourth moments. Precisely, we show that for p/n→γ∈(0,1) as n,p→∞ the logarithmic law

is still valid if the entries of the data matrix X follow a symmetric distribution with a regularly varying tail of index α∈(3,4). The latter assumptions seem to be crucial, which is justified by the simulations: if the entries of X have the infinite absolute third moment and/or their distribution is not symmetric, the logarithmic law is not valid anymore. The derived results highlight that the logarithmic determinant of the sample correlation matrix is a very stable and flexible statistic for heavy-tailed big data and open a novel way of analysis of high-dimensional random matrices with self-normalized entries.

Abstract [fr]

Dans cet article, nous démontrons le théorème de la limite centrale pour le déterminant logarithmique d’une matrice de corrélation R construite à partir d’une matrice de données X de taille (p×n)(p×n) contenant des entrées indépendantes d’espérance 0, variance 1 et quatrième moment infini. Plus précisément, nous démontrons que dans le régime p/n→γ∈(0,1) quand n,p→∞ la loi logarithmique

est toujours valable si les entrées de la matrice de données X suivent une distribution symétrique avec une queue à variation régulière d’indice α∈(3,4). Ces dernières conditions semblent être cruciales, ce qui est justifié par les simulations : si les entrées de X n’ont pas de troisième moment et/ou si leur distribution n’est pas symétrique, la loi logarithmique n’est plus valable. Les résultats obtenus mettent en évidence que le déterminant logarithmique d’une matrice de corrélation est une statistique très stable et flexible pour les données massives à queue lourde et ouvrent une nouvelle voie pour analyser les grandes matrices aléatoires avec entrées auto-normalisées.

Keywords
Sample correlation matrix, Logarithmic determinant, Random matrix theory, Heavy tails, Infinite fourth moment
National Category
Probability Theory and Statistics
Research subject
Mathematical Statistics
Identifiers
urn:nbn:se:su:diva-226669 (URN)10.1214/23-AIHP1368 (DOI)001250209000012 ()2-s2.0-85196278449 (Scopus ID)
Available from: 2024-02-14 Created: 2024-02-14 Last updated: 2024-11-22Bibliographically approved
Parolya, N., Heiny, J. & Kurowicka, D. (2024). Logarithmic law of large random correlation matrices. Bernoulli, 30(1), 346-370
Open this publication in new window or tab >>Logarithmic law of large random correlation matrices
2024 (English)In: Bernoulli, ISSN 1350-7265, E-ISSN 1573-9759, Vol. 30, no 1, p. 346-370Article in journal (Refereed) Published
Abstract [en]

Consider a random vector y=Σ1∕2x, where the p elements of the vector x are i.i.d. real-valued random variables with zero mean and finite fourth moment, and Σ1∕2 is a deterministic p×p matrix such that the eigenvalues of the population correlation matrix R of y are uniformly bounded away from zero and infinity. In this paper, we find that the log determinant of the sample correlation matrix based on a sample of size n from the distribution of y satisfies a CLT (central limit theorem) for pnγ∈(0,1] and pn. Explicit formulas for the asymptotic mean and variance are provided. In case the mean of y is unknown, we show that after re-centering by the empirical mean the obtained CLT holds with a shift in the asymptotic mean. This result is of independent interest in both large dimensional random matrix theory and high-dimensional statistical literature of large sample correlation matrices for non-normal data. Finally, the obtained findings are applied for testing of uncorrelatedness of p random variables. Surprisingly, in the null case R=I, the test statistic becomes distribution-free and the extensive simulations show that the obtained CLT also holds if the moments of order four do not exist at all, which conjectures a promising and robust test statistic for heavy-tailed high-dimensional data.

Keywords
CLT, dependent data, large-dimensional asymptotic, log determinant, random matrix theory, sample correlation matrix
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-226666 (URN)10.3150/23-bej1600 (DOI)001171838900017 ()2-s2.0-85177172141 (Scopus ID)
Available from: 2024-02-14 Created: 2024-02-14 Last updated: 2024-10-14Bibliographically approved
Heiny, J. & Kleemann, C. (2024). Point process convergence for symmetric functions of high-dimensional random vectors. Extremes, 27, 185-217
Open this publication in new window or tab >>Point process convergence for symmetric functions of high-dimensional random vectors
2024 (English)In: Extremes, ISSN 1386-1999, E-ISSN 1572-915X, Vol. 27, p. 185-217Article in journal (Refereed) Published
Abstract [en]

The convergence of a sequence of point processes with dependent points, defined by a symmetric function of iid high-dimensional random vectors, to a Poisson random measure is proved. This also implies the convergence of the joint distribution of a fixed number of upper order statistics. As applications of the result a generalization of maximum convergence to point process convergence is given for simple linear rank statistics, rank-type U-statistics and the entries of sample covariance matrices.

Keywords
Point process convergence, Extreme value theory, Poisson process, Gumbel distribution, High-dimensional data, U-statistics, Kendall's tau, Spearman's rho
National Category
Mathematics
Identifiers
urn:nbn:se:su:diva-225460 (URN)10.1007/s10687-023-00482-w (DOI)001126975800001 ()2-s2.0-85180258606 (Scopus ID)
Available from: 2024-01-16 Created: 2024-01-16 Last updated: 2024-10-14Bibliographically approved
Bastian, P., Dette, H. & Heiny, J. (2024). Testing for practically significant dependencies in high dimensions via bootstrapping maxima of U-statistics. Annals of Statistics, 52(2), 628-653
Open this publication in new window or tab >>Testing for practically significant dependencies in high dimensions via bootstrapping maxima of U-statistics
2024 (English)In: Annals of Statistics, ISSN 0090-5364, E-ISSN 2168-8966, Vol. 52, no 2, p. 628-653Article in journal (Refereed) Published
Abstract [en]

This paper takes a different look on the problem of testing the mutual independence of the components of a high-dimensional vector. Instead of testing if all pairwise associations (e.g., all pairwise Kendall’s τ) between the components vanish, we are interested in the (null) hypothesis that all pairwise associations do not exceed a certain threshold in absolute value. The consideration of these hypotheses is motivated by the observation that in the high-dimensional regime, it is rare, and perhaps impossible, to have a null hypothesis that can be exactly modeled by assuming that all pairwise associations are precisely equal to zero. The formulation of the null hypothesis as a composite hypothesis makes the problem of constructing tests nonstandard and in this paper we provide a solution for a broad class of dependence measures, which can be estimated by U-statistics. In particular, we develop an asymptotic and a bootstrap level α-test for the new hypotheses in the high-dimensional regime. We also prove that the new tests are minimax-optimal and investigate their finite sample properties by means of a small simulation study and a data example.

Keywords
bootstrap, gaussian approximation, Independence testing, minimax optimality, relevant association, U-statistics
National Category
Probability Theory and Statistics
Identifiers
urn:nbn:se:su:diva-235945 (URN)10.1214/24-AOS2361 (DOI)001244466200009 ()2-s2.0-85193707735 (Scopus ID)
Available from: 2024-11-27 Created: 2024-11-27 Last updated: 2024-11-27Bibliographically approved
Fleermann, M. & Heiny, J. (2023). Large sample covariance matrices of Gaussian observations with uniform correlation decay. Stochastic Processes and their Applications, 162, 456-480
Open this publication in new window or tab >>Large sample covariance matrices of Gaussian observations with uniform correlation decay
2023 (English)In: Stochastic Processes and their Applications, ISSN 0304-4149, E-ISSN 1879-209X, Vol. 162, p. 456-480Article in journal (Refereed) Published
Abstract [en]

We derive the Marchenko–Pastur (MP) law for sample covariance matrices of the form , where X is a p × n data matrix and p/ny ∈ (0,∞) as n, p → ∞. We assume the data in X stems from a correlated joint normal distribution. In particular, the correlation acts both across rows and across columns of X, and we do not assume a specific correlation structure, such as separable dependencies. Instead, we assume that correlations converge uniformly to zero at a speed of an/n, where an may grow mildly to infinity. We employ the method of moments tightly: We identify the exact condition on the growth of an which will guarantee that the moments of the empirical spectral distributions (ESDs) converge to the MP moments. If the condition is not met, we can construct an ensemble for which all but finitely many moments of the ESDs diverge. We also investigate the operator norm of Vn under a uniform correlation bound of C/nδ, where C, δ > 0 are fixed, and observe a phase transition at δ = 1. In particular, convergence of the operator norm to the maximum of the support of the MP distribution can only be guaranteed if δ > 1. The analysis leads to an example for which the MP law holds almost surely, but the operator norm remains stochastic in the limit, and we provide its exact limiting distribution.

Keywords
Sample covariance matrices, Marchenko–Pastur law, Correlated Gaussian, Operator norm
National Category
Probability Theory and Statistics
Research subject
Mathematical Statistics
Identifiers
urn:nbn:se:su:diva-226668 (URN)10.1016/j.spa.2023.04.020 (DOI)001008675100001 ()2-s2.0-85162797843 (Scopus ID)
Available from: 2024-02-14 Created: 2024-02-14 Last updated: 2024-02-20Bibliographically approved
Gusakova, A., Heiny, J. & Thäle, C. (2023). The volume of random simplices from elliptical distributions in high dimension. Stochastic Processes and their Applications, 164, 357-382
Open this publication in new window or tab >>The volume of random simplices from elliptical distributions in high dimension
2023 (English)In: Stochastic Processes and their Applications, ISSN 0304-4149, E-ISSN 1879-209X, Vol. 164, p. 357-382Article in journal (Refereed) Published
Abstract [en]

Random simplices and more general random convex bodies of dimension p in with pn are considered, which are generated by random vectors having an elliptical distribution. In the high-dimensional regime, that is, if p → ∞ and n → ∞ in such a way that p/nγ ∈ (0, 1), a central and a stable limit theorem for the logarithmic volume of random simplices and random convex bodies is shown. The result follows from a related central limit theorem for the log-determinant of p × n random matrices whose rows are copies of a random vector with an elliptical distribution, which is established as well.

National Category
Probability Theory and Statistics
Research subject
Mathematics
Identifiers
urn:nbn:se:su:diva-226667 (URN)10.1016/j.spa.2023.07.012 (DOI)001053014800001 ()2-s2.0-85166740319 (Scopus ID)
Funder
German Research Foundation (DFG), EXC 2044 – 390685587
Available from: 2024-02-14 Created: 2024-02-14 Last updated: 2024-02-20Bibliographically approved
Heiny, J. & Kleemann, C.Asymptotic independence of point process and Frobenius norm of a large sample covariance matrix.
Open this publication in new window or tab >>Asymptotic independence of point process and Frobenius norm of a large sample covariance matrix
(English)Manuscript (preprint) (Other academic)
Abstract [en]

A joint limit theorem for the point process of the off-diagonal entries of a sample covariance matrix S, constructed from n observations of a p-dimensional random vector with iid components, and the Frobenius norm of S is proved. In particular, assuming that p and n tend to infinity we obtain a central limit theorem for the Frobenius norm in the case of finite fourth moment of the components and an infinite variance stable law in the case of infinite fourth moment. Extending a theorem of Kallenberg, we establish asymptotic independence of the point process and the Frobenius norm of S. To the best of our knowledge, this is the first result about joint convergence of a point process of dependent points and their sum in the non-Gaussian case. 

National Category
Probability Theory and Statistics
Research subject
Mathematical Statistics
Identifiers
urn:nbn:se:su:diva-226674 (URN)
Available from: 2024-02-14 Created: 2024-02-14 Last updated: 2024-02-26Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-4680-8836

Search in DiVA

Show all publications