Ändra sökning
Länk till posten
Permanent länk

Direktlänk
Alternativa namn
Publikationer (2 of 2) Visa alla publikationer
Uziela, K., Menéndez Hurtado, D., Shu, N., Wallner, B. & Elofsson, A. (2018). Improved protein model quality assessments by changing the target function. Proteins: Structure, Function, and Bioinformatics, 86(6), 654-663
Öppna denna publikation i ny flik eller fönster >>Improved protein model quality assessments by changing the target function
Visa övriga...
2018 (Engelska)Ingår i: Proteins: Structure, Function, and Bioinformatics, ISSN 0887-3585, E-ISSN 1097-0134, Vol. 86, nr 6, s. 654-663Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Protein modeling quality is an important part of protein structure prediction. We have for more than a decade developed a set of methods for this problem. We have used various types of description of the protein and different machine learning methodologies. However, common to all these methods has been the target function used for training. The target function in ProQ describes the local quality of a residue in a protein model. In all versions of ProQ the target function has been the S-score. However, other quality estimation functions also exist, which can be divided into superposition- and contact-based methods. The superposition-based methods, such as S-score, are based on a rigid body superposition of a protein model and the native structure, while the contact-based methods compare the local environment of each residue. Here, we examine the effects of retraining our latest predictor, ProQ3D, using identical inputs but different target functions. We find that the contact-based methods are easier to predict and that predictors trained on these measures provide some advantages when it comes to identifying the best model. One possible reason for this is that contact based methods are better at estimating the quality of multi-domain targets. However, training on the S-score gives the best correlation with the GDT_TS score, which is commonly used in CASP to score the global model quality. To take the advantage of both of these features we provide an updated version of ProQ3D that predicts local and global model quality estimates based on different quality estimates.

Nyckelord
CASP, deep learning, estimation of model accuracy, model quality assessments, protein structure prediction
Nationell ämneskategori
Biologiska vetenskaper
Forskningsämne
biokemi med inriktning mot bioinformatik
Identifikatorer
urn:nbn:se:su:diva-156779 (URN)10.1002/prot.25492 (DOI)000431734800006 ()29524250 (PubMedID)
Tillgänglig från: 2018-06-04 Skapad: 2018-06-04 Senast uppdaterad: 2022-02-26Bibliografiskt granskad
Elofsson, A., Joo, K., Keasar, C., Lee, J., Maghrabi, A. H. A., Manavalan, B., . . . Wallner, B. (2018). Methods for estimation of model accuracy in CASP12. Proteins: Structure, Function, and Bioinformatics, 86(S1), 361-373
Öppna denna publikation i ny flik eller fönster >>Methods for estimation of model accuracy in CASP12
Visa övriga...
2018 (Engelska)Ingår i: Proteins: Structure, Function, and Bioinformatics, ISSN 0887-3585, E-ISSN 1097-0134, Vol. 86, nr S1, s. 361-373Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Methods to reliably estimate the quality of 3D models of proteins are essential drivers for the wide adoption and serious acceptance of protein structure predictions by life scientists. In this article, the most successful groups in CASP12 describe their latest methods for estimates of model accuracy (EMA). We show that pure single model accuracy estimation methods have shown clear progress since CASP11; the 3 top methods (MESHI, ProQ3, SVMQA) all perform better than the top method of CASP11 (ProQ2). Although the pure single model accuracy estimation methods outperform quasi-single (ModFOLD6 variations) and consensus methods (Pcons, ModFOLDclust2, Pcomb-domain, and Wallner) in model selection, they are still not as good as those methods in absolute model quality estimation and predictions of local quality. Finally, we show that when using contact-based model quality measures (CAD, lDDT) the single model quality methods perform relatively better.

Nyckelord
CASP, consensus predictions, estimates of model accuracy, machine learning, protein structure prediction, quality assessment
Nationell ämneskategori
Biologiska vetenskaper
Forskningsämne
biokemi med inriktning mot bioinformatik
Identifikatorer
urn:nbn:se:su:diva-154838 (URN)10.1002/prot.25395 (DOI)000425523000031 ()28975666 (PubMedID)
Tillgänglig från: 2018-04-10 Skapad: 2018-04-10 Senast uppdaterad: 2022-02-26Bibliografiskt granskad
Organisationer
Identifikatorer
ORCID-id: ORCID iD iconorcid.org/0000-0002-3772-8279

Sök vidare i DiVA

Visa alla publikationer