Operational message
There are currently operational disruptions. Troubleshooting is in progress.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Improved prediction of protein-protein interactions using AlphaFold2
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).ORCID iD: 0000-0003-3439-1866
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).ORCID iD: 0000-0002-4303-9939
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).ORCID iD: 0000-0002-7115-9751
Number of Authors: 32022 (English)In: Nature Communications, E-ISSN 2041-1723, Vol. 13, no 1, article id 1265Article in journal (Refereed) Published
Abstract [en]

Predicting the structure of interacting protein chains is a fundamental step towards understanding protein function. Unfortunately, no computational method can produce accurate structures of protein complexes. AlphaFold2, has shown unprecedented levels of accuracy in modelling single chain protein structures. Here, we apply AlphaFold2 for the prediction of heterodimeric protein complexes. We find that the AlphaFold2 protocol together with optimised multiple sequence alignments, generate models with acceptable quality (DockQ >= 0.23) for 63% of the dimers. From the predicted interfaces we create a simple function to predict the DockQ score which distinguishes acceptable from incorrect models as well as interacting from non-interacting proteins with state-of-art accuracy. We find that, using the predicted DockQ scores, we can identify 51% of all interacting pairs at 1% FPR. Predicting the structure of protein complexes is extremely difficult. Here, authors apply AlphaFold2 with optimized multiple sequence alignments to model complexes of interacting proteins, enabling prediction of both if and how proteins interact with state-of-art accuracy.

Place, publisher, year, edition, pages
2022. Vol. 13, no 1, article id 1265
National Category
Biological Sciences
Identifiers
URN: urn:nbn:se:su:diva-203709DOI: 10.1038/s41467-022-28865-wISI: 000767467900005PubMedID: 35273146Scopus ID: 2-s2.0-85126195059OAI: oai:DiVA.org:su-203709DiVA, id: diva2:1650764
Note

For correction, see: Bryant, P., Pozzati, G. & Elofsson, A. Author Correction: Improved prediction of protein-protein interactions using AlphaFold2. Nat Commun 13, 1694 (2022). DOI: 10.1038/s41467-022-29480-5

Available from: 2022-04-08 Created: 2022-04-08 Last updated: 2023-08-11Bibliographically approved
In thesis
1. Learning Protein Evolution and Structure
Open this publication in new window or tab >>Learning Protein Evolution and Structure
2022 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

By analysing the structure of a protein it is possible to draw conclusions about its function. Obtaining the structure of a protein experimentally is however a time consuming and expensive process. By using evolution it is possible to infer the structure of a protein. AlphaFold2 (AF), the latest AI technology for protein structure prediction, uses evolutionary information to obtain protein structures in minutes instead of years at a fraction of the experimental cost. Here, we develop this technology further to predict the structure of interacting proteins. We create a confidence score, pDockQ, and show that this score rivals high-throughput experiments in distinguishing true and false protein-protein interactions (PPIs). Applying AF and the pDockQ score to a set of 65484 human PPIs we identify 1371 new high-confidence models. These models expand the structural knowledge of human protein complexes and can be used to e.g. develop new drugs or evaluate biological pathways. One limitation of AF is that the accuracy decreases with the number of proteins being predicted together and that the biggest protein complexes do not fit in the memory of the latest GPUs. To circumvent these issues, we predict subcomponents of protein complexes and assemble these together with Monte Carlo Tree search (MCTS). MCTS enables assembling some of the largest protein complexes using only sequence information and stoichiometry. Out of 175 protein complexes with 10-30 chains, 91 can be completely assembled with a median TM-score of 0.51. A third of these (30 complexes) are highly accurate (TM-score ≥0.8). The use of highly accurate protein structure prediction is revolutionising many fiends of biological research only one year after its realisation. Likely, this is only the beginning of a new era; the era of AI.  

Place, publisher, year, edition, pages
Stockholm: Department of Biochemistry and Biophysics, Stockholm University, 2022. p. 44
Keywords
Protein structure prediction, Evolution, AI, AlphaFold
National Category
Bioinformatics and Computational Biology
Research subject
Biochemistry towards Bioinformatics
Identifiers
urn:nbn:se:su:diva-207579 (URN)978-91-7911-952-2 (ISBN)978-91-7911-953-9 (ISBN)
Public defence
2022-09-26, Air & Fire, SciLifeLab, Tomtebodavägen 23A, Solna, 14:00 (English)
Opponent
Supervisors
Available from: 2022-09-01 Created: 2022-07-29 Last updated: 2025-02-07Bibliographically approved
2. Deep learning solutions to protein quaternary structure
Open this publication in new window or tab >>Deep learning solutions to protein quaternary structure
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Interactions between proteins are directly involved in most biological processes and are essential for the correct functioning of every form of life. The nature of protein-protein interactions allows functional assemblies of hundreds of protein chains. Given the enormous complexity and the pivotal role of protein interactions in life’s mechanics, the necessity to obtain a complete comprehension of such mechanisms is just as big as the challenge to achieve such knowledge. In the last few decades, experimental procedures constantly improved, dramatically increasing the available structural data for protein interactions. Unfortunately, experimental methods require a lot of time and resources and cannot always be applied with the same degree of success. Several computational methods have been developed in parallel with experimental procedures to overcome such limitations. Therefore, this thesis focused on screening existing computational methods and adopting them to improve the overall accuracy in solving structures of protein-complexes. In the first paper, I propose a simple rigid-body docking framework to test several interface predictors and their ability to drive a protein-protein docking procedure. Next, in the second paper, I display a method to adapt the trRosetta deep neural network to predict inter-residues distances and dihedral angle constraints for full protein complexes. The same concept is then improved in the third paper with FoldDock, an adaptation of Alphafold2 to work on multiple protein sequences and produce the corresponding complex. Finally, in the fourth paper, the FoldDock pipeline is applied to a large dataset of protein pairwise interactions derived from the hu.MAP and HuRI datasets, resulting in the characterization of more than 3000 high-confidence structural models.

Place, publisher, year, edition, pages
Stockholm: Department of Biochemistry and Biophysics, Stockholm University, 2023. p. 78
Keywords
protein interactions, interface prediction, structure prediction, docking, deep learning
National Category
Bioinformatics and Computational Biology
Research subject
Biochemistry towards Bioinformatics
Identifiers
urn:nbn:se:su:diva-219990 (URN)978-91-8014-450-6 (ISBN)978-91-8014-451-3 (ISBN)
Public defence
2023-10-06, Gamma2 - Air&Fire - G2690, SciLifeLab, Tomtebodavägen 23A, Solna, 14:00 (English)
Opponent
Supervisors
Available from: 2023-09-13 Created: 2023-08-11 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedScopus

Authority records

Bryant, PatrickPozzati, GabrieleElofsson, Arne

Search in DiVA

By author/editor
Bryant, PatrickPozzati, GabrieleElofsson, Arne
By organisation
Department of Biochemistry and BiophysicsScience for Life Laboratory (SciLifeLab)
In the same journal
Nature Communications
Biological Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 294 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf