Change search
Link to record
Permanent link

Direct link
Publications (2 of 2) Show all publications
Kang, Y., Elofsson, A., Jiang, Y., Huang, W., Yu, M. & Li, Z. (2023). AFTGAN: prediction of multi-type PPI based on attention free transformer and graph attention network. Bioinformatics, 39(2), Article ID btad052.
Open this publication in new window or tab >>AFTGAN: prediction of multi-type PPI based on attention free transformer and graph attention network
Show others...
2023 (English)In: Bioinformatics, ISSN 1367-4803, E-ISSN 1367-4811, Vol. 39, no 2, article id btad052Article in journal (Refereed) Published
Abstract [en]

Motivation: Protein–protein interaction (PPI) networks and transcriptional regulatory networks are critical in regulating cells and their signaling. A thorough understanding of PPIs can provide more insights into cellular physiology at normal and disease states. Although numerous methods have been proposed to predict PPIs, it is still challenging for interaction prediction between unknown proteins. In this study, a novel neural network named AFTGAN was constructed to predict multi-type PPIs. Regarding feature input, ESM-1b embedding containing much biological information for proteins was added as a protein sequence feature besides amino acid co-occurrence similarity and one-hot coding. An ensemble network was also constructed based on a transformer encoder containing an AFT module (performing the weight operation on vital protein sequence feature information) and graph attention network (extracting the relational features of protein pairs) for the part of the network framework.

Results: The experimental results showed that the Micro-F1 of the AFTGAN based on three partitioning schemes (BFS, DFS and the random mode) on the SHS27K and SHS148K datasets was 0.685, 0.711 and 0.867, as well as 0.745, 0.819 and 0.920, respectively, all higher than that of other popular methods. In addition, the experimental comparisons confirmed the performance superiority of the proposed model for predicting PPIs of unknown proteins on the STRING dataset.

Availability and implementation: The source code is publicly available at https://github.com/1075793472/AFTGAN.

Supplementary information: Supplementary data are available at Bioinformatics online.

National Category
Biochemistry Molecular Biology Bioinformatics and Computational Biology
Identifiers
urn:nbn:se:su:diva-222182 (URN)10.1093/bioinformatics/btad052 (DOI)001068097700001 ()36692145 (PubMedID)2-s2.0-85147457539 (Scopus ID)
Available from: 2023-10-18 Created: 2023-10-18 Last updated: 2025-02-20Bibliographically approved
Li, Z., Jiang, K., Qin, S., Zhong, Y. & Elofsson, A. (2021). GCSENet: A GCN, CNN and SENet ensemble model for microRNA-disease association prediction. PloS Computational Biology, 17(6), Article ID e1009048.
Open this publication in new window or tab >>GCSENet: A GCN, CNN and SENet ensemble model for microRNA-disease association prediction
Show others...
2021 (English)In: PloS Computational Biology, ISSN 1553-734X, E-ISSN 1553-7358, Vol. 17, no 6, article id e1009048Article in journal (Refereed) Published
Abstract [en]

Recently, an increasing number of studies have demonstrated that miRNAs are involved in human diseases, indicating that miRNAs might be a potential pathogenic factor for various diseases. Therefore, figuring out the relationship between miRNAs and diseases plays a critical role in not only the development of new drugs, but also the formulation of individualized diagnosis and treatment. As the prediction of miRNA-disease association via biological experiments is expensive and time-consuming, computational methods have a positive effect on revealing the association. In this study, a novel prediction model integrating GCN, CNN and Squeeze-and-Excitation Networks (GCSENet) was constructed for the identification of miRNA-disease association. The model first captured features by GCN based on a heterogeneous graph including diseases, genes and miRNAs. Then, considering the different effects of genes on each type of miRNA and disease, as well as the different effects of the miRNA-gene and disease-gene relationships on miRNA-disease association, a feature weight was set and a combination of miRNA-gene and disease-gene associations was added as feature input for the convolution operation in CNN. Furthermore, the squeeze and excitation blocks of SENet were applied to determine the importance of each feature channel and enhance useful features by means of the attention mechanism, thus achieving a satisfactory prediction of miRNA-disease association. The proposed method was compared against other state-of-the-art methods. It achieved an AUROC score of 95.02% and an AUPR score of 95.55% in a 10-fold cross-validation, which led to the finding that the proposed method is superior to these popular methods on most of the performance evaluation indexes. Author summary Identifying miRNA-disease associations accelerates the understanding towards pathogenicity, which is beneficial for the development of treatment tools for diseases. Different from existing methods, our GCSENet captures the deep relationship between miRNA and disease through three heterogeneous graphs (disease, gene and miRNA) to promote an accurate prediction result. We performed the 10-fold cross validation to evaluate the performance of GCSENet, which can outperform many classic methods. Furthermore, we carried out case studies on four important diseases, which were used to evaluate the performance of our model regarding to the associations with experimental evidences in literature. The result shows that most predicted miRNAs (48 for lung neoplasms, 48 for heart failure, 48 for breast cancer and 50 for glioblastoma) in the top 50 predictions were confirmed in HMDD v3.0. As a result, it shows that GCSENet can make reliable predictions and guide experiments to uncover more miRNA-disease associations.

National Category
Biological Sciences
Identifiers
urn:nbn:se:su:diva-196263 (URN)10.1371/journal.pcbi.1009048 (DOI)000664326400002 ()34081706 (PubMedID)
Available from: 2021-09-06 Created: 2021-09-06 Last updated: 2022-02-25Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-2730-6427

Search in DiVA

Show all publications