Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
From networks to pathway analysis
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics.ORCID iD: 0000-0002-4665-6537
2021 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Biological mechanisms stem from complex intracellular interactions spanning across different levels of regulation. Mapping these interactions is fundamental for the understanding of all types of biological conditions, including complex diseases. Each experimental approach carries its own bias and noise. Combining heterogeneous data sources reduces noise and gives a broader sense of the interactions between genes known as functional association, where both direct and indirect interactions are captured.

FunCoup is one of the most comprehensive functional association databases, providing networks for 22 organisms in all domains of life. FunCoup uses a naïve Bayesian integration approach to combine 11 different data types and increases the coverage by transferring associations between species via orthologs. Additional insights into the mechanisms of a gene network are provided through tissue specificity filtering and directed regulatory links.

Even though FunCoup provides a comprehensive map of the intracellular machinery, gaining insights into conditions such as diseases requires a functional level analysis rather than a gene level analysis. Thus, studying genes that are involved in a condition from a functional perspective requires the usage of pathway enrichment analysis. Several approaches exist, from basic gene overlap to more elaborate analyses that use functional association networks. ANUBIX is a novel network-based analysis (NBA) method that overcomes the high false positive rate issue that previous state-of-the-art NBA approaches have. Additionally, even with accurate methods, a commonly ignored problem is that gene sets derived from experiments are often noisy or contain multiple mechanisms, mixing different pathways which weakens their association to the condition under study. To increase the sensitivity of pathway analysis, we developed a pipeline to cluster gene sets into more homogeneous parts with the aim of unraveling all the mechanisms activated in the studied condition. To facilitate the usage of these tools, we built a web server called PathBIX, a user-friendly platform that allows interactive analysis of all species in FunCoup against multiple pathway databases.

Place, publisher, year, edition, pages
Stockholm: Department of Biochemistry and Biophysics, Stockholm University , 2021. , p. 77
National Category
Bioinformatics and Computational Biology
Research subject
Biochemistry towards Bioinformatics
Identifiers
URN: urn:nbn:se:su:diva-195048ISBN: 978-91-7911-564-7 (print)ISBN: 978-91-7911-565-4 (electronic)OAI: oai:DiVA.org:su-195048DiVA, id: diva2:1582631
Public defence
2021-09-17, Air and Fire, SciLifeLab, Tomtebodavägen 23, Solna, 14:00 (English)
Opponent
Supervisors
Available from: 2021-08-25 Created: 2021-08-03 Last updated: 2025-02-07Bibliographically approved
List of papers
1. FunCoup 5: Functional Association Networks in All Domains of Life, Supporting Directed Links and Tissue-Specificity
Open this publication in new window or tab >>FunCoup 5: Functional Association Networks in All Domains of Life, Supporting Directed Links and Tissue-Specificity
Show others...
2021 (English)In: Journal of Molecular Biology, ISSN 0022-2836, E-ISSN 1089-8638, Vol. 433, article id 166835Article in journal (Refereed) Published
Abstract [en]

FunCoup (https://funcoup.sbc.su.se) is one of the most comprehensive functional association networks of genes/proteins available. Functional associations are inferred by integrating different types of evidence using a redundancy-weighted naïve Bayesian approach, combined with orthology transfer. FunCoup's high coverage comes from using eleven different types of evidence, and extensive transfer of information between species. Since the latest update of the database, the availability of source data has improved drastically, and user expectations on a tool for functional associations have grown. To meet these requirements, we have made a new release of FunCoup with updated source data and improved functionality. FunCoup 5 now includes 22 species from all domains of life, and the source data for evidences, gold standards, and genomes have been updated to the latest available versions. In this new release, directed regulatory links inferred from transcription factor binding can be visualized in the network viewer for the human interactome. Another new feature is the possibility to filter by genes expressed in a certain tissue in the network viewer. FunCoup 5 further includes the SARS-CoV-2 proteome, allowing users to visualize and analyze interactions between SARS-CoV-2 and human proteins in order to better understand COVID-19. This new release of FunCoup constitutes a major advance for the users, with updated sources, new species and improved functionality for analysis of the networks.

Keywords
Bayesian integration; SARS-CoV-2; functional association network; gene regulatory network; protein network; tissue-specific network.
National Category
Biological Sciences
Identifiers
urn:nbn:se:su:diva-195046 (URN)10.1016/j.jmb.2021.166835 (DOI)000648520800016 ()
Available from: 2021-08-02 Created: 2021-08-02 Last updated: 2024-11-04Bibliographically approved
2. Pathway-specific model estimation for improved pathway annotation by network crosstalk
Open this publication in new window or tab >>Pathway-specific model estimation for improved pathway annotation by network crosstalk
2020 (English)In: Scientific Reports, E-ISSN 2045-2322, Vol. 10, no 1, article id 13585Article in journal (Refereed) Published
Abstract [en]

Pathway enrichment analysis is the most common approach for understanding which biological processes are affected by altered gene activities under specific conditions. However, it has been challenging to find a method that efficiently avoids false positives while keeping a high sensitivity. We here present a new network-based method ANUBIX based on sampling random gene sets against intact pathway. Benchmarking shows that ANUBIX is considerably more accurate than previous network crosstalk based methods, which have the drawback of modelling pathways as random gene sets. We demonstrate that ANUBIX does not have a bias for finding certain pathways, which previous methods do, and show that ANUBIX finds biologically relevant pathways that are missed by other methods.

National Category
Biological Sciences
Identifiers
urn:nbn:se:su:diva-185368 (URN)10.1038/s41598-020-70239-z (DOI)000563534600007 ()32788619 (PubMedID)
Available from: 2020-10-16 Created: 2020-10-16 Last updated: 2022-09-15Bibliographically approved
3. Clustered Pathway Analysis
Open this publication in new window or tab >>Clustered Pathway Analysis
(English)Manuscript (preprint) (Other academic)
Abstract [en]

Motivation: Functional analysis of gene sets derived from experiments is typically done by pathway annotation. Although many algorithms exist for analyzing the association between a gene set and a pathway, an issue which is generally ignored is that gene sets often represent multiple pathways. In such cases an association to a pathway is weakened by the presence of genes associated with other pathways. A way to counteract this is to cluster the gene set into more homogenous parts before performing pathway analysis on each cluster.

Results: We explored whether network-based pre-clustering of a query gene set can improve pathway analysis. The methods MCL, Infomap, and MGclus were used to cluster the gene set projected onto the FunCoup network. We characterized how well these methods are able to detect individual pathways in multi-pathway gene sets, and applied each of the clustering methods in combination with four pathway analysis methods: Gene Enrichment Analysis, BinoX, NEAT, and ANUBIX. Using benchmarks constructed from the KEGG pathway database we found that clustering substantially increased the sensitivity of pathway analysis methods. For ANUBIX this came with almost no loss of specificity, while for BinoX and NEAT the specificity decreased roughly as much as the sensitivity increased. GEA had very low sensitivity both before and after clustering. The choice of clustering method only had a minor effect on the results. We conclude that clustering can improve overall pathway annotation performance, but only if the used enrichment method has a low false positive rate. 

Availability and Implementation: https://bitbucket.org/sonnhammergroup/clustering-and-pathway-enrichment/

National Category
Bioinformatics and Computational Biology
Identifiers
urn:nbn:se:su:diva-195034 (URN)
Available from: 2021-08-02 Created: 2021-08-02 Last updated: 2025-02-07Bibliographically approved
4. PathBIX—a web server for network-based pathway annotation with adaptive null models
Open this publication in new window or tab >>PathBIX—a web server for network-based pathway annotation with adaptive null models
2021 (English)In: Bioinformatics Advances, E-ISSN 2635-0041, Vol. 1, no 1, article id vbab010Article in journal (Refereed) Published
Abstract [en]

Motivation: Pathway annotation is a vital tool for interpreting and giving meaning to experimental data in life sciences. Numerous tools exist for this task, where the most recent generation of pathway enrichment analysis tools, network-based methods, utilize biological networks to gain a richer source of information as a basis of the analysis than merely the gene content. Network-based methods use the network crosstalk between the query gene set and the genes in known pathways, and compare this to a null model of random expectation.

Results: We developed PathBIX, a novel web application for network-based pathway analysis, based on the recently published ANUBIX algorithm which has been shown to be more accurate than previous network-based methods. The PathBIX website performs pathway annotation for 21 species, and utilizes prefetched and preprocessed network data from FunCoup 5.0 networks and pathway data from three databases: KEGG, Reactome, and WikiPathways.

National Category
Bioinformatics and Computational Biology
Identifiers
urn:nbn:se:su:diva-195030 (URN)10.1093/bioadv/vbab010 (DOI)
Available from: 2021-08-02 Created: 2021-08-02 Last updated: 2025-02-07Bibliographically approved

Open Access in DiVA

From networks to pathway analysis(5568 kB)1948 downloads
File information
File name FULLTEXT01.pdfFile size 5568 kBChecksum SHA-512
a5b5bb34e758e1d7957b540a1910432ae46308d36049182d6a24be9dd733106a27bfb350c36a209d8e1b7d6245a40744221b68e2b7225fe4f89cafc62a7e4449
Type fulltextMimetype application/pdf

Authority records

Castresana Aguirre, Miguel

Search in DiVA

By author/editor
Castresana Aguirre, Miguel
By organisation
Department of Biochemistry and Biophysics
Bioinformatics and Computational Biology

Search outside of DiVA

GoogleGoogle Scholar
Total: 1950 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 1252 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf