Change search
Link to record
Permanent link

Direct link
Publications (4 of 4) Show all publications
Taheri, G., Szalai, M., Habibi, M. & Papapetrou, P. (2025). Unveiling Driver Modules in Lung Cancer: A Clustering-Based Gene-Gene Interaction Network Analysis. In: Rosa Meo, Fabrizio Silvestri (Ed.), Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023, Revised Selected Papers, Part IV. Paper presented at International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023 (pp. 41-58). Springer
Open this publication in new window or tab >>Unveiling Driver Modules in Lung Cancer: A Clustering-Based Gene-Gene Interaction Network Analysis
2025 (English)In: Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023, Revised Selected Papers, Part IV / [ed] Rosa Meo, Fabrizio Silvestri, Springer, 2025, p. 41-58Conference paper, Published paper (Refereed)
Abstract [en]

Lung cancer, which is the leading cause of cancer-related death worldwide and is characterized by genetic changes and heterogeneity, presents a significant treatment challenge. Existing approaches utilizing Machine Learning (ML) techniques for identifying driver modules lack specificity, particularly for lung cancer. This study addresses this limitation by proposing a novel method that combines gene-gene interaction network construction with ML-based clustering to identify lung cancer-specific driver modules. The methodology involves mapping biological processes to genes and constructing a weighted gene-gene interaction network to identify correlations within gene clusters. A clustering algorithm is then applied to identify potential cancer-driver modules, focusing on biologically relevant modules that contribute to lung cancer development. The results highlight the effectiveness and robustness of the clustering approach, identifying 110 unique clusters ranging in size from 4 to 10. These clusters surpass evaluation requirements and demonstrate significant relevance to critical cancer-related pathways. The identified driver modules hold promise for influencing future approaches to lung cancer diagnosis, prognosis, and treatment. This research expands our understanding of lung cancer and sets the stage for further investigations and potential clinical advancements.

Place, publisher, year, edition, pages
Springer, 2025
Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 2136 CCIS
Keywords
Driver Modules, Gene-Gene Interaction Network, Lung Cancer, Machine Learning
National Category
Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:su:diva-240214 (URN)10.1007/978-3-031-74640-6_4 (DOI)2-s2.0-85216025737 (Scopus ID)9783031746390 (ISBN)
Conference
International Workshops of ECML PKDD 2023, Turin, Italy, September 18–22, 2023
Available from: 2025-03-06 Created: 2025-03-06 Last updated: 2025-03-06Bibliographically approved
Taheri, G. & Habibi, M. (2024). Uncovering driver genes in breast cancer through an innovative machine learning mutational analysis method. Computers in Biology and Medicine, 171, Article ID 108234.
Open this publication in new window or tab >>Uncovering driver genes in breast cancer through an innovative machine learning mutational analysis method
2024 (English)In: Computers in Biology and Medicine, ISSN 0010-4825, E-ISSN 1879-0534, Vol. 171, article id 108234Article in journal (Refereed) Published
Abstract [en]

Breast cancer has become a severe public health concern and one of the leading causes of cancer-related death in women worldwide. Several genes and mutations in these genes linked to breast cancer have been identified using sophisticated techniques, despite the fact that the exact cause of breast cancer is still unknown. A commonly used feature for identifying driver mutations is the recurrence of a mutation in patients. Nevertheless, some mutations are more likely to occur than others for various reasons. Sequencing analysis has shown that cancer-driving genes operate across complex networks, often with mutations appearing in a modular pattern. In this work, as a retrospective study, we used TCGA data, which is gathered from breast cancer patients. We introduced a new machine-learning approach to examine gene functionality in networks derived from mutation associations, gene-gene interactions, and graph clustering for breast cancer analysis. These networks have uncovered crucial biological components in critical pathways, particularly those that exhibit low-frequency mutations. The statistical strength of the clinical study is significantly boosted by evaluating the network as a whole instead of just single gene effects. Our method successfully identified essential driver genes with diverse mutation frequencies. We then explored the functions of these potential driver genes and their related pathways. By uncovering low-frequency genes, we shed light on understudied pathways associated with breast cancer. Additionally, we present a novel Monte Carlo-based algorithm to identify driver modules in breast cancer. Our findings highlight the significance and role of these modules in critical signaling pathways in breast cancer, providing a comprehensive understanding of breast cancer development. Materials and implementations are available at: [https://github.com/MahnazHabibi/BreastCancer].

Keywords
Breast cancer, Driver genes, Machine learning
National Category
Medical Biotechnology (with a focus on Cell Biology (including Stem Cell Biology), Molecular Biology, Microbiology, Biochemistry or Biopharmacy)
Identifiers
urn:nbn:se:su:diva-235864 (URN)10.1016/j.compbiomed.2024.108234 (DOI)38430742 (PubMedID)2-s2.0-85186265125 (Scopus ID)
Available from: 2024-12-02 Created: 2024-12-02 Last updated: 2024-12-09Bibliographically approved
Taheri, G. & Habibi, M. (2023). Identification of essential genes associated with SARS-CoV-2 infection as potential drug target candidates with machine learning algorithms. Scientific Reports, 13(1), Article ID 15141.
Open this publication in new window or tab >>Identification of essential genes associated with SARS-CoV-2 infection as potential drug target candidates with machine learning algorithms
2023 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 13, no 1, article id 15141Article in journal (Refereed) Published
Abstract [en]

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) requires the fast discovery of effective treatments to fight this worldwide concern. Several genes associated with the SARS-CoV-2, which are essential for its functionality, pathogenesis, and survival, have been identified. These genes, which play crucial roles in SARS-CoV-2 infection, are considered potential therapeutic targets. Developing drugs against these essential genes to inhibit their regular functions could be a good approach for COVID-19 treatment. Artificial intelligence and machine learning methods provide powerful infrastructures for interpreting and understanding the available data and can assist in finding fast explanations and cures. We propose a method to highlight the essential genes that play crucial roles in SARS-CoV-2 pathogenesis. For this purpose, we define eleven informative topological and biological features for the biological and PPI networks constructed on gene sets that correspond to COVID-19. Then, we use three different unsupervised learning algorithms with different approaches to rank the important genes with respect to our defined informative features. Finally, we present a set of 18 important genes related to COVID-19. Materials and implementations are available at: https://github.com/MahnazHabibi/Gene_analysis.

National Category
Bioinformatics and Computational Biology Infectious Medicine
Identifiers
urn:nbn:se:su:diva-223226 (URN)10.1038/s41598-023-42127-9 (DOI)001067753600005 ()37704748 (PubMedID)2-s2.0-85171162453 (Scopus ID)
Available from: 2023-11-06 Created: 2023-11-06 Last updated: 2025-02-05Bibliographically approved
Habibi, M. & Taheri, G. (2022). A new machine learning method for cancer mutation analysis. PloS Computational Biology, 18(10), Article ID e1010332.
Open this publication in new window or tab >>A new machine learning method for cancer mutation analysis
2022 (English)In: PloS Computational Biology, ISSN 1553-734X, E-ISSN 1553-7358, Vol. 18, no 10, article id e1010332Article in journal (Refereed) Published
Abstract [en]

It is complicated to identify cancer-causing mutations. The recurrence of a mutation in patients remains one of the most reliable features of mutation driver status. However, some mutations are more likely to happen than others for various reasons. Different sequencing analysis has revealed that cancer driver genes operate across complex pathways and networks, with mutations often arising in a mutually exclusive pattern. Genes with low-frequency mutations are understudied as cancer-related genes, especially in the context of networks. Here we propose a machine learning method to study the functionality of mutually exclusive genes in the networks derived from mutation associations, gene-gene interactions, and graph clustering. These networks have indicated critical biological components in the essential pathways, especially those mutated at low frequency. Studying the network and not just the impact of a single gene significantly increases the statistical power of clinical analysis. The proposed method identified important driver genes with different frequencies. We studied the function and the associated pathways in which the candidate driver genes participate. By introducing lower-frequency genes, we recognized less studied cancer-related pathways. We also proposed a novel clustering method to specify driver modules. We evaluated each driver module with different criteria, including the terms of biological processes and the number of simultaneous mutations in each cancer. Materials and implementations are available at: https://github.com/MahnazHabibi/MutationAnalysis.

National Category
Medical Genetics and Genomics
Identifiers
urn:nbn:se:su:diva-215897 (URN)10.1371/journal.pcbi.1010332 (DOI)000924885500001 ()36251702 (PubMedID)2-s2.0-85140933352 (Scopus ID)
Available from: 2023-03-29 Created: 2023-03-29 Last updated: 2025-02-10Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-2741-0355

Search in DiVA

Show all publications