Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
InParanoiDB 9: Ortholog Groups for Protein Domains and Full-Length Proteins
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).ORCID iD: 0000-0003-0532-8251
Stockholm University, Faculty of Science, Department of Biochemistry and Biophysics. Stockholm University, Science for Life Laboratory (SciLifeLab).ORCID iD: 0000-0002-9015-5588
Number of Authors: 22023 (English)In: Journal of Molecular Biology, ISSN 0022-2836, E-ISSN 1089-8638, Vol. 435, no 14, article id 168001Article in journal (Refereed) Published
Abstract [en]

Prediction of orthologs is an important bioinformatics pursuit that is frequently used for inferring protein function and evolutionary analyses. The InParanoid database is a well known resource of ortholog predictions between a wide variety of organisms. Although orthologs have historically been inferred at the level of full-length protein sequences, many proteins consist of several independent protein domains that may be orthologous to domains in other proteins in a way that differs from the full-length protein case. To be able to capture all types of orthologous relations, conventional full-length protein orthologs can be complemented with orthologs inferred at the domain level. We here present InParanoiDB 9, covering 640 species and providing orthologs for both protein domains and full-length proteins. InParanoiDB 9 was built using the faster InParanoid-DIAMOND algorithm for orthology analysis, as well as Domainoid and Pfam to infer orthologous domains. InParanoiDB 9 is based on proteomes from 447 eukaryotes, 158 bacteria and 35 archaea, and includes over one billion predicted ortholog groups. A new website has been built for the database, providing multiple search options as well as visualization of groups of orthologs and orthologous domains. This release constitutes a major upgrade of the InParanoid database in terms of the number of species as well as the new capability to operate on the domain level. InParanoiDB 9 is available at https://inparanoidb.sbc.su.se/.

Place, publisher, year, edition, pages
2023. Vol. 435, no 14, article id 168001
Keywords [en]
ortholog, InParanoid, orthologous domain, protein domain, ortholog database
National Category
Bioinformatics and Systems Biology
Identifiers
URN: urn:nbn:se:su:diva-220951DOI: 10.1016/j.jmb.2023.168001ISI: 001054111000001PubMedID: 36764355Scopus ID: 2-s2.0-85148362111OAI: oai:DiVA.org:su-220951DiVA, id: diva2:1797677
Available from: 2023-09-15 Created: 2023-09-15 Last updated: 2023-10-16Bibliographically approved
In thesis
1. Big data networks and orthology analysis
Open this publication in new window or tab >>Big data networks and orthology analysis
2023 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Understanding biological systems in complex organisms is important in life science in order to comprehend the interplay of genes, proteins, and compounds causing complex diseases. As biological systems are intricate, bioinformatics tools, models, and algorithms are of the utmost importance to understand the bigger picture and decipher biological meaning from the vast amounts of information available from biological experiments and predictions. Bioinformatics programs and algorithms do not only depend on information from experiments, but also on information generated from other tools in order to draw accurate conclusions and make predictions. 

Prediction of orthologs, genes having a common ancestry, separated by a speciation event, are important building blocks for a wide variety of tools and analysis pipelines, as they can be used to transfer gene function between species. Orthologs can for example be used to map genes of model organisms to genes in humans in studies of drug targets. They are extensively used in functional association networks in order to transfer information between species. Functional association networks are models of associations between genes or proteins, where associations can be derived from experimental evidence of different types, from the species itself, or transferred from other species using orthologs. The networks can be used to explore the context and neighbors of a gene, but also for a variety of higher-level analyses, e.g. network-based pathway enrichment analysis. In pathway enrichment analysis the networks can be utilized to contextualize experimental gene sets and annotate them with biological functions. As these tools depend on each other, it is of great importance that the networks used in pathway enrichment analysis are comprehensive and accurate, and that the orthologs used in the networks are relevant and significant. 

In this thesis, the development and improvement of five bioinformatics tools within three areas of bioinformatics are presented. Despite the tools residing within slightly different areas, they all rely on each other, and can all on different levels improve our understanding of biological functions and biological meaning, from the level of orthology analysis to functional association networks to pathway enrichment analysis.

Place, publisher, year, edition, pages
Stockholm: Department of Biochemistry and Biophysics, Stockholm University, 2023. p. 67
Keywords
Ortholog, protein domain, functional association network, pathway enrichment analysis
National Category
Bioinformatics and Systems Biology
Research subject
Biochemistry towards Bioinformatics
Identifiers
urn:nbn:se:su:diva-222146 (URN)978-91-8014-548-0 (ISBN)978-91-8014-549-7 (ISBN)
Public defence
2023-12-01, Air & Fire, SciLifeLab, Tomtebodavägen 23A, and online via Zoom, public link is available at the department website, Solna, 15:00 (English)
Opponent
Supervisors
Available from: 2023-11-08 Created: 2023-10-16 Last updated: 2023-10-27Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textPubMedScopus

Authority records

Persson, EmmaSonnhammer, Erik L. L.

Search in DiVA

By author/editor
Persson, EmmaSonnhammer, Erik L. L.
By organisation
Department of Biochemistry and BiophysicsScience for Life Laboratory (SciLifeLab)
In the same journal
Journal of Molecular Biology
Bioinformatics and Systems Biology

Search outside of DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetric score

doi
pubmed
urn-nbn
Total: 45 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf