Open this publication in new window or tab >>2026 (English)In: Bioinformatics, ISSN 1367-4803, E-ISSN 1367-4811, Vol. 42, no 4, article id btag136Article in journal (Refereed) Published
Abstract [en]
Motivation: AlphaFold2 significantly improved the prediction of protein complex structures. However, its accuracy is lower for interactions without coevolutionary signals, such as host–pathogen and antibody–antigen interactions. Two strategies have been developed to address this limitation: massive sampling and replacing the evoformer with the pairformer, which does not rely on coevolution, as introduced in AlphaFold3, thereby enabling more structural reasoning by the network.
Results: In this study, we benchmark structure prediction methods on unseen antibody–antigen complexes. We found that increased sampling improves the chances of generating a correct protein model, roughly in a log-linear manner. However, the internal quality estimates by AlphaFold often cannot identify the best predicted structures for each target, resulting in a significant loss of performance for the top-ranked protein model compared with the best model. For all methods, a significant challenge remains the identification of the best model. We also show that AlphaFold3 outperforms AlphaFold2, Boltz-1, and Chai-1. Furthermore, AlphaFold3 performance declines significantly for complexes that lack structural similarity to the training set, indicating that it has to some extent learned to detect remote structural similarities.
Availability and implementation: All code is available from github.com/samuelfromm/abag-benchmark-set/ and all data from DOI: 10.5281/zenodo.17978681. The latter repository also contains the code.
National Category
Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:su:diva-254524 (URN)10.1093/bioinformatics/btag136 (DOI)001736660100001 ()41863324 (PubMedID)2-s2.0-105035306566 (Scopus ID)
2026-05-052026-05-052026-05-05Bibliographically approved