Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Computational pan-genomics: status, promises and challenges
Visa övriga samt affilieringar
Antal upphovsmän: 592018 (Engelska)Ingår i: Briefings in Bioinformatics, ISSN 1467-5463, E-ISSN 1477-4054, Vol. 19, nr 1, s. 118-135Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

Many disciplines, from human genetics and oncology to plant breeding, microbiology and virology, commonly face the challenge of analyzing rapidly increasing numbers of genomes. In case of Homo sapiens, the number of sequenced genomes will approach hundreds of thousands in the next few years. Simply scaling up established bioinformatics pipelines will not be sufficient for leveraging the full potential of such rich genomic data sets. Instead, novel, qualitatively different computational methods and paradigms are needed. We will witness the rapid extension of computational pan-genomics, a new sub-area of research in computational biology. In this article, we generalize existing definitions and understand a pan-genome as any collection of genomic sequences to be analyzed jointly or to be used as a reference. We examine already available approaches to construct and use pan-genomes, discuss the potential benefits of future technologies and methodologies and review open challenges from the vantage point of the above-mentioned biological disciplines. As a prominent example for a computational paradigm shift, we particularly highlight the transition from the representation of reference genomes as strings to representations as graphs. We outline how this and other challenges from different application domains translate into common computational problems, point out relevant bioinformatics techniques and identify open problems in computer science. With this review, we aim to increase awareness that a joint approach to computational pan-genomics can help address many of the problems currently faced in various domains.

Ort, förlag, år, upplaga, sidor
2018. Vol. 19, nr 1, s. 118-135
Nyckelord [en]
pan-genome, sequence graph, read mapping, haplotypes, data structures
Nationell ämneskategori
Bioinformatik (beräkningsbiologi) Biologiska vetenskaper
Identifikatorer
URN: urn:nbn:se:su:diva-153888DOI: 10.1093/bib/bbw089ISI: 000423311000011PubMedID: 27769991OAI: oai:DiVA.org:su-153888DiVA, id: diva2:1188381
Tillgänglig från: 2018-03-07 Skapad: 2018-03-07 Senast uppdaterad: 2022-03-23Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Övriga länkar

Förlagets fulltextPubMed

Person

Dutilh, Bas E.Makinen, VeliAlkan, CanMartin, Marcel

Sök vidare i DiVA

Av författaren/redaktören
Dutilh, Bas E.Makinen, VeliAlkan, CanMartin, Marcel
Av organisationen
Institutionen för biokemi och biofysikScience for Life Laboratory (SciLifeLab)
I samma tidskrift
Briefings in Bioinformatics
Bioinformatik (beräkningsbiologi)Biologiska vetenskaper

Sök vidare utanför DiVA

GoogleGoogle Scholar

doi
pubmed
urn-nbn

Altmetricpoäng

doi
pubmed
urn-nbn
Totalt: 61 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf