Change search
Link to record
Permanent link

Direct link
Balabanov, OleksandrORCID iD iconorcid.org/0000-0003-0417-9856
Publications (5 of 5) Show all publications
Balabanov, O., Mehlig, B. & Linander, H. (2023). Bayesian posterior approximation with stochastic ensembles. In: 2023 IEEE/CVF Conference on Computer Visionand Pattern Recognition: CVPR 2023. Paper presented at 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 18-22 June, 2023 (pp. 13701-13711). Los Alamitos: IEEE Computer Society
Open this publication in new window or tab >>Bayesian posterior approximation with stochastic ensembles
2023 (English)In: 2023 IEEE/CVF Conference on Computer Visionand Pattern Recognition: CVPR 2023, Los Alamitos: IEEE Computer Society, 2023, p. 13701-13711Conference paper, Published paper (Refereed)
Abstract [en]

We introduce ensembles of stochastic neural networks to approximate the Bayesian posterior, combining stochastic methods such as dropout with deep ensembles. The stochastic ensembles are formulated as families of distributions and trained to approximate the Bayesian posterior with variational inference. We implement stochastic ensembles based on Monte Carlo dropout, DropConnect and a novel non-parametric version of dropout and evaluate them on a toy problem and CIFAR image classification. For both tasks, we test the quality of the posteriors directly against Hamiltonian Monte Carlo simulations. Our results show that stochastic ensembles provide more accurate posterior estimates than other popular baselines for Bayesian inference.

Place, publisher, year, edition, pages
Los Alamitos: IEEE Computer Society, 2023
Series
Conference on Computer Vision and Pattern Recognition (CVPR), ISSN 1063-6919, E-ISSN 2575-7075
National Category
Computer Sciences
Identifiers
urn:nbn:se:su:diva-235300 (URN)10.1109/CVPR52729.2023.01317 (DOI)001062522106003 ()2-s2.0-85173970846 (Scopus ID)979-8-3503-0130-4 (ISBN)979-8-3503-0129-8 (ISBN)
Conference
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 18-22 June, 2023
Available from: 2024-11-05 Created: 2024-11-05 Last updated: 2024-11-05Bibliographically approved
Linander, H., Balabanov, O., Yang, H. & Mehlig, B. (2023). Looking at the posterior: accuracy and uncertainty of neural-network predictions. Machine Learning: Science and Technology, 4(4), Article ID 045032.
Open this publication in new window or tab >>Looking at the posterior: accuracy and uncertainty of neural-network predictions
2023 (English)In: Machine Learning: Science and Technology, E-ISSN 2632-2153, Vol. 4, no 4, article id 045032Article in journal (Refereed) Published
Abstract [en]

Bayesian inference can quantify uncertainty in the predictions of neural networks using posterior distributions for model parameters and network output. By looking at these posterior distributions, one can separate the origin of uncertainty into aleatoric and epistemic contributions. One goal of uncertainty quantification is to inform on prediction accuracy. Here we show that prediction accuracy depends on both epistemic and aleatoric uncertainty in an intricate fashion that cannot be understood in terms of marginalized uncertainty distributions alone. How the accuracy relates to epistemic and aleatoric uncertainties depends not only on the model architecture, but also on the properties of the dataset. We discuss the significance of these results for active learning and introduce a novel acquisition function that outperforms common uncertainty-based methods. To arrive at our results, we approximated the posteriors using deep ensembles, for fully-connected, convolutional and attention-based neural networks.

Keywords
deep learning, uncertainty quantification, bayesian inference, neural networks, active learning
National Category
Computer Sciences
Identifiers
urn:nbn:se:su:diva-224620 (URN)10.1088/2632-2153/ad0ab4 (DOI)001103803300001 ()2-s2.0-85178128076 (Scopus ID)
Available from: 2023-12-20 Created: 2023-12-20 Last updated: 2023-12-20Bibliographically approved
Balabanov, O., Ortega-Taberner, C. & Hermanns, M. (2022). Quantization of topological indices in critical chains at low temperatures. Physical Review B, 106(4), Article ID 045116.
Open this publication in new window or tab >>Quantization of topological indices in critical chains at low temperatures
2022 (English)In: Physical Review B, ISSN 2469-9950, E-ISSN 2469-9969, Vol. 106, no 4, article id 045116Article in journal (Refereed) Published
Abstract [en]

Various types of topological phenomena at criticality are currently under active research. In this paper we suggest to generalize the known topological quantities to finite temperatures, allowing us to consider gapped and critical (gapless) systems on the same footing. It is then discussed that the quantization of the topological indices, also at critically, is retrieved by taking the low-temperature limit. This idea is explicitly illustrated on a simple case study of chiral critical chains where the quantization is shown analytically and verified numerically. The formalism is also applied for studying robustness of the topological indices to various types of disordering perturbations.

National Category
Condensed Matter Physics
Identifiers
urn:nbn:se:su:diva-208403 (URN)10.1103/PhysRevB.106.045116 (DOI)000834338800001 ()2-s2.0-85134877094 (Scopus ID)
Available from: 2022-08-29 Created: 2022-08-29 Last updated: 2022-12-08Bibliographically approved
Balabanov, O., Erkensten, D. & Johannesson, H. (2021). Topology of critical chiral phases: Multiband insulators and superconductors. Physical Review Research, 3(4), Article ID 043048.
Open this publication in new window or tab >>Topology of critical chiral phases: Multiband insulators and superconductors
2021 (English)In: Physical Review Research, E-ISSN 2643-1564, Vol. 3, no 4, article id 043048Article in journal (Refereed) Published
Abstract [en]

Recent works have proved the existence of symmetry-protected edge states in certain one-dimensional topological band insulators and superconductors at the gap-closing points which define quantum phase transitions between two topologically nontrivial phases. We show how this picture generalizes to multiband critical models belonging to any of the chiral symmetry classes AIII, BDI, or CII of noninteracting fermions in one dimension.

National Category
Physical Sciences
Identifiers
urn:nbn:se:su:diva-199490 (URN)10.1103/PhysRevResearch.3.043048 (DOI)000708677100004 ()
Available from: 2021-12-09 Created: 2021-12-09 Last updated: 2021-12-09Bibliographically approved
Balabanov, O. & Granath, M. (2021). Unsupervised interpretable learning of topological indices invariant under permutations of atomic bands. Machine Learning: Science and Technology, 2(2), Article ID 025008.
Open this publication in new window or tab >>Unsupervised interpretable learning of topological indices invariant under permutations of atomic bands
2021 (English)In: Machine Learning: Science and Technology, E-ISSN 2632-2153, Vol. 2, no 2, article id 025008Article in journal (Refereed) Published
Abstract [en]

Multi-band insulating Bloch Hamiltonians with internal or spatial symmetries, such as particle-hole or inversion, may have topologically disconnected sectors of trivial atomic-limit (momentum-independent) Hamiltonians. We present a neural-network-based protocol for finding topologically relevant indices that are invariant under transformations between such trivial atomic-limit Hamiltonians, thus corresponding to the standard classification of band insulators. The work extends the method of 'topological data augmentation' for unsupervised learning introduced (2020 Phys. Rev. Res. 2 013354) by also generalizing and simplifying the data generation scheme and by introducing a special 'mod' layer of the neural network appropriate for Z ( n ) classification. Ensembles of training data are generated by deforming seed objects in a way that preserves a discrete representation of continuity. In order to focus the learning on the topologically relevant indices, prior to the deformation procedure we stack the seed Bloch Hamiltonians with a complete set of symmetry-respecting trivial atomic bands. The obtained datasets are then used for training an interpretable neural network specially designed to capture the topological properties by learning physically relevant momentum space quantities, even in crystalline symmetry classes.

Keywords
machine learning, topological phases, unsupervised learning, interpretable neural networks
National Category
Physical Sciences
Identifiers
urn:nbn:se:su:diva-196337 (URN)10.1088/2632-2153/abcc43 (DOI)000660867900001 ()
Available from: 2021-09-09 Created: 2021-09-09 Last updated: 2022-12-08Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-0417-9856

Search in DiVA

Show all publications