Change search
Link to record
Permanent link

Direct link
Publications (10 of 35) Show all publications
Hellmuth, M. & Lindeberg, A. (2026). Characterizing and transforming DAGs within the ℐ-lca framework. Discrete Applied Mathematics, 378, 584-593
Open this publication in new window or tab >>Characterizing and transforming DAGs within the ℐ-lca framework
2026 (English)In: Discrete Applied Mathematics, ISSN 0166-218X, E-ISSN 1872-6771, Vol. 378, p. 584-593Article in journal (Refereed) Published
Abstract [en]

We explore the connections between clusters and least common ancestors (LCAs) in directed acyclic graphs (DAGs), focusing on the interplay between so-called ℐ-lca-relevant DAGs and DAGs with the ℐ-lca-property. Here, ℐ denotes a set of integers. In ℐ-lca-relevant DAGs, each vertex is the unique LCA for some subset A of leaves of size |A|∈ℐ, whereas in a DAG with the ℐ-lca-property there exists a unique LCA for every subset A of leaves satisfying |A|∈ℐ. We elaborate on the difference between these two properties and establish their close relationship to pre-ℐ-ary and ℐ-ary set systems. This, in turn, generalizes results established for (pre-) binary and k-ary set systems. Moreover, we build upon recently established results that use a simple operator ⊖, enabling the transformation of arbitrary DAGs into ℐ-lca-relevant DAGs. This process reduces unnecessary complexity while preserving key structural properties of the original DAG. The set ℭG consists of all clusters in a DAG G, where clusters correspond to the descendant leaves of vertices. While in some cases ℭH=ℭG when transforming G into an ℐ-lca-relevant DAG H, it often happens that certain clusters in ℭG do not appear as clusters in H. To understand this phenomenon in detail, we characterize the subset of clusters in ℭG that remain in H for DAGs G with the ℐ-lca-property. Furthermore, we show that the set W of vertices required to transform G into H=GW is uniquely determined for such DAGs. This, in turn, allows us to show that the “shortcut-free” version of the transformed DAG H is always a tree or a galled-tree whenever ℭG represents the clustering system of a tree or galled-tree and G has the ℐ-lca-property. In the latter case ℭH=ℭG always holds.

Keywords
Cluster, Galled-tree, Hasse diagram, Hierarchy, I-ary set systems, Phylogenetic network, Regular DAGs
National Category
Discrete Mathematics
Identifiers
urn:nbn:se:su:diva-246610 (URN)10.1016/j.dam.2025.08.037 (DOI)001565640600001 ()2-s2.0-105014613207 (Scopus ID)
Available from: 2025-09-15 Created: 2025-09-15 Last updated: 2025-09-15Bibliographically approved
Lindeberg, A., Scholz, G. E. & Hellmuth, M. (2025). Network Representation and Modular Decomposition of Combinatorial Structures: A Galled-Tree Perspective. Vietnam Journal of Mathematics
Open this publication in new window or tab >>Network Representation and Modular Decomposition of Combinatorial Structures: A Galled-Tree Perspective
2025 (English)In: Vietnam Journal of Mathematics, ISSN 2305-221X, E-ISSN 2305-2228Article in journal (Refereed) Epub ahead of print
Abstract [en]

In phylogenetics, reconstructing rooted trees from distances between taxa is a common task. Böcker and Dress generalized this concept by introducing symbolic dating maps δ, where distances are replaced by symbols, and showed that there is a one-to-one correspondence between symbolic ultrametrics and labeled rooted phylogenetic trees. Many combinatorial structures fall under the umbrella of symbolic dating maps, such as 2-dissimilarities, symmetric labeled 2-structures, or edge-colored complete graphs, and are here referred to as strudigrams. Strudigrams have a unique decomposition into non-overlapping modules, which can be represented by a modular decomposition tree (MDT). In the absence of prime modules, strudigrams are equivalent to symbolic ultrametrics, and the MDT fully captures the relationships between pairs of vertices through the label of their least common ancestor in the MDT. However, in the presence of prime modules, appearing as prime vertices in the MDT, this information is generally hidden. To provide this missing structural information, we aim to locally replace the prime vertices in the MDT to obtain networks that capture full information about the strudigrams. While starting with the general framework of prime-vertex replacement networks, we then focus on a specific type of such networks obtained by replacing prime vertices with so-called galls, resulting in labeled galled-trees. We introduce the concept of galled-tree explainable (GaTEx) strudigrams, provide their characterization, and demonstrate that recognizing these structures and reconstructing the labeled networks that explain them can be achieved in polynomial time.

Keywords
Dissimilarities, Symbolic ultrametrics, 2-structures, Edge-colored graphs, Strudigram, Prime-vertex replacement, Phylogenetic network, Clustering systems
National Category
Computational Mathematics
Identifiers
urn:nbn:se:su:diva-247033 (URN)10.1007/s10013-025-00747-w (DOI)001572007400001 ()2-s2.0-105016666231 (Scopus ID)
Funder
Stockholm University
Available from: 2025-09-17 Created: 2025-09-17 Last updated: 2025-10-28
Hellmuth, M. & Thekkumpadan Puthiyaveedu, S. (2025). On a generalization of median graphs: k-median graphs. Ars Mathematica Contemporanea, 25(3), Article ID P3.06.
Open this publication in new window or tab >>On a generalization of median graphs: k-median graphs
2025 (English)In: Ars Mathematica Contemporanea, ISSN 1855-3966, E-ISSN 1855-3974, Vol. 25, no 3, article id P3.06Article in journal (Refereed) Published
Abstract [en]

Median graphs are connected graphs in which for all three vertices there is a unique vertex that belongs to shortest paths between each pair of these three vertices. To be more formal, a graph G is a median graph if, for all μ, u, v ∈ V(G), it holds that |I(μ, u) ∩ I(μ, v) ∩ I(u, v)| = 1 where I(x, y) denotes the set of all vertices that lie on shortest paths connecting x and y.

In this paper we are interested in a natural generalization of median graphs, called k-median graphs. A graph G is a k-median graph, if there are k vertices μ1, …, μk ∈ V(G) such that, for all u, v ∈ V(G), it holds that |I(μi, u) ∩ I(μi, v) ∩ I(u, v)| = 1, 1 ≤ i ≤ k. By definition, every median graph with n vertices is an n-median graph. We provide several characterizations of k-median graphs that, in turn, are used to provide many novel characterizations of median graphs.

Keywords
Median graph, convexity, meshed and quadrangle property, modular, interval
National Category
Discrete Mathematics
Identifiers
urn:nbn:se:su:diva-248823 (URN)10.26493/1855-3974.3134.87b (DOI)001506824200001 ()2-s2.0-105007171405 (Scopus ID)
Available from: 2025-11-03 Created: 2025-11-03 Last updated: 2025-11-03Bibliographically approved
Lindeberg, A., Scholz, G. E., Wieseke, N. & Hellmuth, M. (2025). Orthology and near-cographs in the context of phylogenetic networks. Algorithms for Molecular Biology, 20, Article ID 19.
Open this publication in new window or tab >>Orthology and near-cographs in the context of phylogenetic networks
2025 (English)In: Algorithms for Molecular Biology, E-ISSN 1748-7188, Vol. 20, article id 19Article in journal (Refereed) Published
Abstract [en]

Orthologous genes, which arise through speciation, play a key role in comparative genomics and functional inference. In particular, graph-based methods allow for the inference of orthology estimates without prior knowledge of the underlying gene or species trees. This results in orthology graphs, where each vertex represents a gene, and an edge exists between two vertices if the corresponding genes are estimated to be orthologs. Orthology graphs inferred under a tree-like evolutionary model must be cographs. However, real-world data often deviate from this property, either due to noise in the data, errors in inference methods or, simply, because evolution follows a network-like rather than a tree-like process. The latter, in particular, raises the question of whether and how orthology graphs can be derived from or, equivalently, are explained by phylogenetic networks. In this work, we study the constraints imposed on orthology graphs when the underlying evolutionary history follows a phylogenetic network instead of a tree. We show that any orthology graph can be represented by a sufficiently complex level-k network. However, such networks lack biologically meaningful constraints. In contrast, level-1 networks provide a simpler explanation, and we establish characterizations for level-1 explainable orthology graphs, i.e., those derived from level-1 evolutionary histories. To this end, we employ modular decomposition, a classical technique for studying graph structures. Specifically, an arbitrary graph is level-1 explainable if and only if each primitive subgraph is a near-cograph (a graph in which the removal of a single vertex results in a cograph). Additionally, we present a linear-time algorithm to recognize level-1 explainable orthology graphs and to construct a level-1 network that explains them, if such a network exists. Finally, we demonstrate the close relationship of level-1 explainable orthology graphs to the substitution operation, weakly chordal and perfect graphs, as well as graphs with twin-width at most 2.

Keywords
Homology, Level-1 network, Linear-time algorithm, Modular decomposition, Near cograph, Perfect graph, Twin-width
National Category
Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:su:diva-248256 (URN)10.1186/s13015-025-00285-7 (DOI)001585586100001 ()2-s2.0-105017895424 (Scopus ID)
Available from: 2025-10-22 Created: 2025-10-22 Last updated: 2025-10-22Bibliographically approved
Lindeberg, A. & Hellmuth, M. (2025). Simplifying and Characterizing DAGs and Phylogenetic Networks via Least Common Ancestor Constraints. Bulletin of Mathematical Biology, 87(3), Article ID 44.
Open this publication in new window or tab >>Simplifying and Characterizing DAGs and Phylogenetic Networks via Least Common Ancestor Constraints
2025 (English)In: Bulletin of Mathematical Biology, ISSN 0092-8240, E-ISSN 1522-9602, Vol. 87, no 3, article id 44Article in journal (Refereed) Published
Abstract [en]

Rooted phylogenetic networks, or more generally, directed acyclic graphs (DAGs), are widely used to model species or gene relationships that traditional rooted trees cannot fully capture, especially in the presence of reticulate processes or horizontal gene transfers. Such networks or DAGs are typically inferred from observable data (e.g., genomic sequences of extant species), providing only an estimate of the true evolutionary history. However, these inferred DAGs are often complex and difficult to interpret. In particular, many contain vertices that do not serve as least common ancestors (LCAs) for any subset of the underlying genes or species, thus may lack direct support from the observable data. In contrast, LCA vertices are witnessed by historical traces justifying their existence and thus represent ancestral states substantiated by the data. To reduce unnecessary complexity and eliminate unsupported vertices, we aim to simplify a DAG to retain only LCA vertices while preserving essential evolutionary information. In this paper, we characterize LCA -relevant and lca -relevant DAGs, defined as those in which every vertex serves as an LCA (or unique LCA) for some subset of taxa. We introduce methods to identify LCAs in DAGs and efficiently transform any DAG into an LCA -relevant or lca -relevant one while preserving key structural properties of the original DAG or network. This transformation is achieved using a simple operator "⊖" that mimics vertex suppression. 

Keywords
Algorithms, Cluster, Hasse diagram, NP-completeness, Phylogenetic networks, Regular DAGs, Reticulate evolution, Transformation
National Category
Computational Mathematics
Research subject
Mathematics
Identifiers
urn:nbn:se:su:diva-241740 (URN)10.1007/s11538-025-01419-z (DOI)001420922600001 ()39937386 (PubMedID)2-s2.0-85218478616 (Scopus ID)
Available from: 2025-04-05 Created: 2025-04-05 Last updated: 2025-04-08Bibliographically approved
Hellmuth, M. & Scholz, G. E. (2025). Solving NP-hard problems on GATEX graphs: Linear-time algorithms for perfect orderings, cliques, colorings, and independent sets. Theoretical Computer Science, 1037, Article ID 115157.
Open this publication in new window or tab >>Solving NP-hard problems on GATEX graphs: Linear-time algorithms for perfect orderings, cliques, colorings, and independent sets
2025 (English)In: Theoretical Computer Science, ISSN 0304-3975, E-ISSN 1879-2294, Vol. 1037, article id 115157Article in journal (Refereed) Published
Abstract [en]

The class of GAlled-Tree EXplainable (GATEX) graphs has recently been discovered as a natural generalization of cographs. Cographs are precisely those graphs that can be uniquely represented by a rooted tree where the leaves correspond to the vertices of the graph. As a generalization, GATEX graphs are precisely those that can be uniquely represented by a particular rooted acyclic network, called a galled-tree. This paper explores the use of galled-trees to solve combinatorial problems on GATEX graphs that are, in general, NP-hard. We demonstrate that finding a maximum clique, an optimal vertex coloring, a perfect order, as well as a maximum independent set in GATEX graphs can be efficiently done in linear time. The key idea behind the linear-time algorithms is to utilize the galled-trees that explain the GATEX graphs as a guide for computing the respective cliques, colorings, perfect orders, or independent sets.

Keywords
Cograph, Galled-tree, Linear-time algorithms, Modular decomposition, NP-hard problems
National Category
Discrete Mathematics
Identifiers
urn:nbn:se:su:diva-241848 (URN)10.1016/j.tcs.2025.115157 (DOI)001444805000001 ()2-s2.0-86000368984 (Scopus ID)
Available from: 2025-04-09 Created: 2025-04-09 Last updated: 2025-04-09Bibliographically approved
Hellmuth, M., Schmidt, B. J., Scholz, G. E. & Thekkumpadan Puthiyaveedu, S. (2025). The complement of the Djoković-Winkler relation. Discrete Mathematics, 348(3), Article ID 114328.
Open this publication in new window or tab >>The complement of the Djoković-Winkler relation
2025 (English)In: Discrete Mathematics, ISSN 0012-365X, E-ISSN 1872-681X, Vol. 348, no 3, article id 114328Article in journal (Refereed) Published
Abstract [en]

The Djoković-Winkler relation Θ is a binary relation defined on the edge set of a given graph that is based on the distances of certain vertices and which plays a prominent role in graph theory. In this paper, we explore the relatively uncharted “reflexive complement” Θ‾ of Θ, where (e,f)∈Θ‾ if and only if e=f or (e,f)∉Θ for edges e and f. We establish the relationship between Θ‾ and the set Δef, comprising the distances between the vertices of e and f and shed some light on the intricacies of its transitive closure ⁎Θ‾⁎. Notably, we demonstrate that ⁎Θ‾⁎ exhibits multiple equivalence classes only within a restricted subclass of complete multipartite graphs. In addition, we characterize non-trivial relations R that coincide with Θ‾ as those where the graph representation is disconnected, with each connected component being the (join of) Cartesian product of complete graphs. The latter results imply, somewhat surprisingly, that knowledge about the distances between vertices is not required to determine ⁎Θ‾⁎. Moreover, ⁎Θ‾⁎ has either exactly one or three equivalence classes.

Keywords
Block graph, Cartesian product, Complete multipartite graph, Diameter, Distances, Equivalence relation
National Category
Discrete Mathematics Computer Sciences
Identifiers
urn:nbn:se:su:diva-241540 (URN)10.1016/j.disc.2024.114328 (DOI)001361459700001 ()2-s2.0-85209256182 (Scopus ID)
Available from: 2025-04-01 Created: 2025-04-01 Last updated: 2025-04-01Bibliographically approved
Hellmuth, M. & Scholz, G. E. (2024). Resolving prime modules: The structure of pseudo-cographs and galled-tree explainable graphs. Discrete Applied Mathematics, 343, 25-43
Open this publication in new window or tab >>Resolving prime modules: The structure of pseudo-cographs and galled-tree explainable graphs
2024 (English)In: Discrete Applied Mathematics, ISSN 0166-218X, E-ISSN 1872-6771, Vol. 343, p. 25-43Article in journal (Refereed) Published
Abstract [en]

The modular decomposition of a graph G is a natural construction to capture key features of G in terms of a labeled tree (T,t) whose vertices are labeled as “series” (1), “parallel” (0) or “prime”. However, full information of G is provided by its modular decomposition tree (T,t) only, if G is a cograph, i.e., G does not contain prime modules. In this case, (T,t) explains G, i.e., {x,y}∈E(G) if and only if the lowest common ancestor lcaT(x,y) of x and y has label “1”. Pseudo-cographs, or, more generally, GaTEx graphs G are graphs that can be explained by labeled galled-trees, i.e., labeled networks (N,t) that are obtained from the modular decomposition tree (T,t) of G by replacing the prime vertices in T by simple labeled cycles. GaTEx graphs can be recognized and labeled galled-trees that explain these graphs can be constructed in linear time.

In this contribution, we provide a novel characterization of GaTEx graphs in terms of a set FGT of 25 forbidden induced subgraphs. This characterization, in turn, allows us to show that GATEX graphs are closely related to many other well-known graph classes such as P4-sparse and P4-reducible graphs, weakly-chordal graphs, perfect graphs with perfect order, comparability and permutation graphs, murky graphs as well as interval graphs, Meyniel graphs or very strongly-perfect and brittle graphs. Moreover, we show that every GATEX graph as twin-width at most 1.

Keywords
GATEx graph, Pseudo-cograph, Galled-tree, Graph classes, Forbidden subgraphs, Twin-width
National Category
Discrete Mathematics
Identifiers
urn:nbn:se:su:diva-223979 (URN)10.1016/j.dam.2023.09.034 (DOI)001097302500001 ()2-s2.0-85174059460 (Scopus ID)
Available from: 2023-11-24 Created: 2023-11-24 Last updated: 2023-11-24Bibliographically approved
Ramírez-Rafael, J. A., Korchmaros, A., Aviña-Padilla, K., López Sánchez, A., España-Tinajero, A. A., Hellmuth, M., . . . Hernández-Rosales, M. (2024). REvolutionH-tl: Reconstruction of Evolutionary Histories tool. In: Celine Scornavacca, Maribel Hernández-Rosales (Ed.), Comparative Genomics: Proceedings. Paper presented at 21st International Conference, RECOMB-CG 2024, Boston, MA, USA, April 27–28, 2024 (pp. 89-109). Springer Nature
Open this publication in new window or tab >>REvolutionH-tl: Reconstruction of Evolutionary Histories tool
Show others...
2024 (English)In: Comparative Genomics: Proceedings / [ed] Celine Scornavacca, Maribel Hernández-Rosales, Springer Nature, 2024, p. 89-109Conference paper, Published paper (Refereed)
Abstract [en]

Orthology detection from sequence similarity remains a difficult and computationally expensive problem for gene families with large numbers of gene duplications and losses. REvolutionH-tl implements a new graph-based approach to identify orthogroups, orthology, and paralogy relationships first, and it uses this information in a second step to infer event-labeled gene trees and their reconciliation with an inferred species tree. It avoids using gene trees and species trees upon input and settles for a maximal subtree reconciliation in cases where noise or horizontal gene transfer precludes a global reconciliation. The accuracy of the tool is comparable to competing tools at substantially reduced computational cost. 

Place, publisher, year, edition, pages
Springer Nature, 2024
Series
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN 0302-9743, E-ISSN 1611-3349 ; 14616 LNBI
Keywords
best matches, evolutionary scenarios, orthology
National Category
Bioinformatics (Computational Biology)
Identifiers
urn:nbn:se:su:diva-236130 (URN)10.1007/978-3-031-58072-7_5 (DOI)2-s2.0-85192245669 (Scopus ID)978-3-031-58071-0 (ISBN)
Conference
21st International Conference, RECOMB-CG 2024, Boston, MA, USA, April 27–28, 2024
Available from: 2024-12-02 Created: 2024-12-02 Last updated: 2024-12-02Bibliographically approved
Hellmuth, M. & Stadler, P. F. (2024). The Theory of Gene Family Histories. In: João Carlos Setubal, Peter F. Stadler, Jens Stoye (Ed.), Comparative Genomics: Methods and Protocols (pp. 1-32). Humana Press
Open this publication in new window or tab >>The Theory of Gene Family Histories
2024 (English)In: Comparative Genomics: Methods and Protocols / [ed] João Carlos Setubal, Peter F. Stadler, Jens Stoye, Humana Press, 2024, p. 1-32Chapter in book (Refereed)
Abstract [en]

Most genes are part of larger families of evolutionary-related genes. The history of gene families typically involves duplications and losses of genes as well as horizontal transfers into other organisms. The reconstruction of detailed gene family histories, i.e., the precise dating of evolutionary events relative to phylogenetic tree of the underlying species has remained a challenging topic despite their importance as a basis for detailed investigations into adaptation and functional evolution of individual members of the gene family. The identification of orthologs, moreover, is a particularly important subproblem of the more general setting considered here. In the last few years, an extensive body of mathematical results has appeared that tightly links orthology, a formal notion of best matches among genes, and horizontal gene transfer. The purpose of this chapter is to broadly outline some of the key mathematical insights and to discuss their implication for practical applications. In particular, we focus on tree-free methods, i.e., methods to infer orthology or horizontal gene transfer as well as gene trees, species trees, and reconciliations between them without using a priori knowledge of the underlying trees or statistical models for the inference of phylogenetic trees. Instead, the initial step aims to extract binary relations among genes.

Place, publisher, year, edition, pages
Humana Press, 2024
Series
Methods in Molecular Biology, ISSN 1064-3745, E-ISSN 1940-6029 ; 2802
Keywords
Best matches, Gene family, Horizontal gene transfer, Orthologs, Paralogs, Phylogeny, Protein family, Tree-free methods
National Category
Bioinformatics and Computational Biology
Identifiers
urn:nbn:se:su:diva-236098 (URN)10.1007/978-1-0716-3838-5_1 (DOI)38819554 (PubMedID)2-s2.0-85195003884 (Scopus ID)9781071638378 (ISBN)
Available from: 2024-12-02 Created: 2024-12-02 Last updated: 2025-02-07Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-1620-5508

Search in DiVA

Show all publications

Profile pages