Change search
ReferencesLink to record
Permanent link

Direct link
GAM-NGS: genomic assemblies merger for next generation sequencing
Stockholm University, Faculty of Science, Numerical Analysis and Computer Science (NADA).
Show others and affiliations
2013 (English)In: BMC Bioinformatics, ISSN 1471-2105, Vol. 14, no Suppl.7, S6- p.Article in journal (Refereed) Published
Abstract [en]

Background: In recent years more than 20 assemblers have been proposed to tackle the hard task of assembling NGS data. A common heuristic when assembling a genome is to use several assemblers and then select the best assembly according to some criteria. However, recent results clearly show that some assemblers lead to better statistics than others on specific regions but are outperformed on other regions or on different evaluation measures. To limit these problems we developed GAM-NGS (Genomic Assemblies Merger for Next Generation Sequencing), whose primary goal is to merge two or more assemblies in order to enhance contiguity and correctness of both. GAM-NGS does not rely on global alignment: regions of the two assemblies representing the same genomic locus (called blocks) are identified through reads' alignments and stored in a weighted graph. The merging phase is carried out with the help of this weighted graph that allows an optimal resolution of local problematic regions. Results: GAM-NGS has been tested on six different datasets and compared to other assembly reconciliation tools. The availability of a reference sequence for three of them allowed us to show how GAM-NGS is a tool able to output an improved reliable set of sequences. GAM-NGS is also a very efficient tool able to merge assemblies using substantially less computational resources than comparable tools. In order to achieve such goals, GAM-NGS avoids global alignment between contigs, making its strategy unique among other assembly reconciliation tools. Conclusions: The difficulty to obtain correct and reliable assemblies using a single assembler is forcing the introduction of new algorithms able to enhance de novo assemblies. GAM-NGS is a tool able to merge two or more assemblies in order to improve contiguity and correctness. It can be used on all NGS-based assembly projects and it shows its full potential with multi-library Illumina-based projects. With more than 20 available assemblers it is hard to select the best tool. In this context we propose a tool that improves assemblies (and, as a by-product, perhaps even assemblers) by merging them and selecting the generating that is most likely to be correct.

Place, publisher, year, edition, pages
BioMed Central, 2013. Vol. 14, no Suppl.7, S6- p.
National Category
Biochemistry and Molecular Biology Microbiology Mathematical Analysis
URN: urn:nbn:se:su:diva-91547DOI: 10.1186/1471-2105-14-S7-S6ISI: 000318869400006OAI: diva2:634144
Knut and Alice Wallenberg Foundation


Available from: 2013-06-28 Created: 2013-06-28 Last updated: 2013-06-28Bibliographically approved

Open Access in DiVA

No full text

Other links

Publisher's full text

Search in DiVA

By author/editor
Arvestad, Lars
By organisation
Numerical Analysis and Computer Science (NADA)
In the same journal
BMC Bioinformatics
Biochemistry and Molecular BiologyMicrobiologyMathematical Analysis

Search outside of DiVA

GoogleGoogle Scholar
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 20 hits
ReferencesLink to record
Permanent link

Direct link