Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Limits and potential of combined folding and docking
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).ORCID iD: 0000-0002-4303-9939
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).ORCID iD: 0000-0001-7161-9028
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).ORCID iD: 0000-0003-0568-8281
Show others and affiliations
Number of Authors: 62022 (English)In: Bioinformatics, ISSN 1367-4803, E-ISSN 1367-4811, Vol. 38, no 4, p. 954-961Article in journal (Refereed) Published
Abstract [en]

Motivation: In the last decade, de novo protein structure prediction accuracy for individual proteins has improved significantly by utilising deep learning (DL) methods for harvesting the co-evolution information from large multiple sequence alignments (MSAs). The same approach can, in principle, also be used to extract information about evolutionary-based contacts across protein-protein interfaces. However, most earlier studies have not used the latest DL methods for inter-chain contact distance prediction. This article introduces a fold-and-dock method based on predicted residue-residue distances with trRosetta.

Results: The method can simultaneously predict the tertiary and quaternary structure of a protein pair, even when the structures of the monomers are not known. The straightforward application of this method to a standard dataset for protein-protein docking yielded limited success. However, using alternative methods for generating MSAs allowed us to dock accurately significantly more proteins. We also introduced a novel scoring function, PconsDock, that accurately separates 98% of correctly and incorrectly folded and docked proteins. The average performance of the method is comparable to the use of traditional, template-based or ab initio shape-complementarity-only docking methods. Moreover, the results of conventional and fold-and-dock approaches are complementary, and thus a combined docking pipeline could increase overall docking success significantly. This methodology contributed to the best model for one of the CASP14 oligomeric targets, H1065.

Place, publisher, year, edition, pages
2022. Vol. 38, no 4, p. 954-961
National Category
Biological Sciences Computer and Information Sciences
Identifiers
URN: urn:nbn:se:su:diva-202237DOI: 10.1093/bioinformatics/btab760ISI: 000747962400010PubMedID: 34788800OAI: oai:DiVA.org:su-202237DiVA, id: diva2:1640110
Available from: 2022-02-23 Created: 2022-02-23 Last updated: 2023-08-11Bibliographically approved
In thesis
1. Decipher protein complex structures from sequence
Open this publication in new window or tab >>Decipher protein complex structures from sequence
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Proteins are essential constituents of biological systems. A profound understanding of protein structure is significant for unraveling the intricate mechanisms of biological processes. The recent development of computational methods using AI technology is revolutionizing the structural biology field. Accurate predictions of three-dimentional protein structures can be generated from protein sequences, enabling rapid and accurate insights into protein interactions and functions. This thesis aims to investigate the applications of various cutting-edge methods in protein complex structure prediction. We first explore using trRosetta for dimeric protein complexes, and the study shows that the single-chain protein structure predictor is feasible for protein complexes. In light of the success of AlphaFold2, we use the pipeline FoldDock, which is an adaption of AlphaFold2 on protein complexes, for protein-protein interactions (PPIs) of two human interactome datasets and construct a PPI network. Next, we conduct a benchmark study of AlphaFold-Multimer in multi-chain protein complexes with 2 to 6 chains and examine how different evaluation scores affect the prediction assessment. In the last paper, we predict the large protein complexes starting from subcomponents using AlphaFold2 and a Monte Carlo Tree Search algorithm. The studies in this thesis show that deep learning approaches can yield reliable results in predicting protein complex structures, and there is ample potential for further improvement. 

Place, publisher, year, edition, pages
Stockholm: Department of Biochemistry and Biophysics, Stockholm University, 2023. p. 64
Keywords
Protein complex structure prediction, protein interaction, AI, AlphaFold
National Category
Bioinformatics (Computational Biology) Bioinformatics and Computational Biology
Research subject
Biochemistry towards Bioinformatics
Identifiers
urn:nbn:se:su:diva-219975 (URN)978-91-8014-414-8 (ISBN)978-91-8014-415-5 (ISBN)
Public defence
2023-09-25, Air & Fire, SciLifeLab, Tomtebodavägen 23A, Solna, 14:00 (English)
Opponent
Supervisors
Available from: 2023-08-31 Created: 2023-08-10 Last updated: 2025-02-05Bibliographically approved
2. Deep learning solutions to protein quaternary structure
Open this publication in new window or tab >>Deep learning solutions to protein quaternary structure
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Interactions between proteins are directly involved in most biological processes and are essential for the correct functioning of every form of life. The nature of protein-protein interactions allows functional assemblies of hundreds of protein chains. Given the enormous complexity and the pivotal role of protein interactions in life’s mechanics, the necessity to obtain a complete comprehension of such mechanisms is just as big as the challenge to achieve such knowledge. In the last few decades, experimental procedures constantly improved, dramatically increasing the available structural data for protein interactions. Unfortunately, experimental methods require a lot of time and resources and cannot always be applied with the same degree of success. Several computational methods have been developed in parallel with experimental procedures to overcome such limitations. Therefore, this thesis focused on screening existing computational methods and adopting them to improve the overall accuracy in solving structures of protein-complexes. In the first paper, I propose a simple rigid-body docking framework to test several interface predictors and their ability to drive a protein-protein docking procedure. Next, in the second paper, I display a method to adapt the trRosetta deep neural network to predict inter-residues distances and dihedral angle constraints for full protein complexes. The same concept is then improved in the third paper with FoldDock, an adaptation of Alphafold2 to work on multiple protein sequences and produce the corresponding complex. Finally, in the fourth paper, the FoldDock pipeline is applied to a large dataset of protein pairwise interactions derived from the hu.MAP and HuRI datasets, resulting in the characterization of more than 3000 high-confidence structural models.

Place, publisher, year, edition, pages
Stockholm: Department of Biochemistry and Biophysics, Stockholm University, 2023. p. 78
Keywords
protein interactions, interface prediction, structure prediction, docking, deep learning
National Category
Bioinformatics and Computational Biology
Research subject
Biochemistry towards Bioinformatics
Identifiers
urn:nbn:se:su:diva-219990 (URN)978-91-8014-450-6 (ISBN)978-91-8014-451-3 (ISBN)
Public defence
2023-10-06, Gamma2 - Air&Fire - G2690, SciLifeLab, Tomtebodavägen 23A, Solna, 14:00 (English)
Opponent
Supervisors
Available from: 2023-09-13 Created: 2023-08-11 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMed

Authority records

Pozzati, GabrieleZhu, WensiBassot, ClaudioLamb, JohnKundrotas, PetrasElofsson, Arne

Search in DiVA

By author/editor
Pozzati, GabrieleZhu, WensiBassot, ClaudioLamb, JohnKundrotas, PetrasElofsson, Arne
By organisation
Department of Biochemistry and BiophysicsScience for Life Laboratory (SciLifeLab)
In the same journal
Bioinformatics
Biological SciencesComputer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 75 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf