Graph transformations and mappings - functor operations on graphs.
This module provides operations that transform graphs while preserving their structure. These are useful for adapting graph data types, creating derived graphs, and preparing graphs for specific algorithms.
Available Transformations
| Transformation | Function | Complexity | Use Case |
|---|---|---|---|
| Transpose | transpose/1 | O(1) | Reverse edge directions |
| Map Nodes | map_nodes/2 | O(V) | Transform node data |
| Map Nodes Async | map_nodes_async/2 | O(V/cores) | Parallel node transforms |
| Map Edges | map_edges/2 | O(E) | Transform edge weights |
| Map Edges Async | map_edges_async/2 | O(E/cores) | Parallel edge transforms |
| Filter Nodes | filter_nodes/2 | O(V) | Subgraph extraction |
| Filter Edges | filter_edges/2 | O(E) | Remove unwanted edges |
Parallel Transformations
The _async variants use Task.async_stream/3 for parallel processing on multi-core
systems. They're beneficial for:
- Large graphs (10K+ nodes, 100K+ edges)
- Expensive transformation functions (I/O, complex computations)
- Multi-core environments where parallelism outweighs overhead
For small graphs or trivial transforms, use the sequential versions to avoid task overhead.
The O(1) Transpose Operation
Due to yog's dual-map representation (storing both outgoing and incoming edges), transposing a graph is a single pointer swap - dramatically faster than O(E) implementations in traditional adjacency list libraries.
Functor Laws
The mapping operations satisfy functor laws:
- Identity:
map_nodes(g, fn x -> x end) == g - Composition:
map_nodes(map_nodes(g, f), h) == map_nodes(g, fn x -> h.(f.(x)) end)
Use Cases
- Kosaraju's Algorithm: Requires transposed graph for SCC finding
- Type Conversion: Changing node/edge data types for algorithm requirements
- Subgraph Extraction: Working with portions of large graphs
- Weight Normalization: Preprocessing edge weights
- Parallel Processing: Large-scale data transformations on multi-core systems
Migration Note: This module was ported from Gleam to pure Elixir in v0.53.0. The API remains unchanged. Async variants added in v0.60.1.
Summary
Functions
Creates the complement of a graph.
Contracts an edge by merging node b into node a.
Filters edges by a predicate, preserving all nodes.
Filters nodes by a predicate, automatically pruning connected edges.
Transforms edge weights using a function, preserving graph structure.
Transforms edge weights using a function in parallel, preserving graph structure.
Transforms edge weights using a function that also takes the source and destination IDs.
Transforms node data using a function, preserving graph structure.
Transforms node data using a function in parallel, preserving graph structure.
Combines two graphs, with the second graph's data taking precedence on conflicts.
Extracts a subgraph containing only the specified nodes and their connecting edges.
Converts an undirected graph to a directed graph.
Converts a directed graph to an undirected graph.
Reverses the direction of every edge in the graph (graph transpose).
Functions
Creates the complement of a graph.
The complement contains the same nodes but connects all pairs of nodes
that are not connected in the original graph, and removes all edges
that are present. Each new edge gets the supplied default_weight.
Self-loops are never added in the complement.
Time Complexity: O(V² + E)
Example
iex> graph =
...> Yog.undirected()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_node(3, "C")
...> |> Yog.add_edge!(from: 1, to: 2, with: 1)
iex> comp = Yog.Transform.complement(graph, 100)
iex> # Original: 1-2 connected, 1-3 and 2-3 not
iex> # Complement: 1-3 and 2-3 connected, 1-2 not
iex> {1, 100} in Yog.successors(comp, 3)
trueUse Cases
- Finding independent sets (cliques in the complement)
- Graph coloring via complement analysis
- Testing graph density (sparse ↔ dense complement)
@spec contract( Yog.graph(), Yog.node_id(), Yog.node_id(), (term(), term() -> term()) ) :: Yog.graph()
Contracts an edge by merging node b into node a.
Node b is removed from the graph, and all edges connected to b are
redirected to a. If both a and b had edges to the same neighbor,
their weights are combined using with_combine.
Self-loops (edges from a node to itself) are removed during contraction.
Important for undirected graphs: Since undirected edges are stored bidirectionally, each logical edge is processed twice during contraction, causing weights to be combined twice. For example, if edge weights represent capacities, this effectively doubles them. Consider dividing weights by 2 or using a custom combine function if this behavior is undesired.
Time Complexity: O(deg(a) + deg(b)) - proportional to the combined degree of both nodes.
Example
iex> graph =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_node(3, "C")
...> |> Yog.add_edge!(from: 1, to: 2, with: 10)
...> |> Yog.add_edge!(from: 2, to: 3, with: 20)
iex> contracted = Yog.Transform.contract(graph, 1, 2, fn w1, w2 -> w1 + w2 end)
iex> # Result: nodes [1, 3], edge 1->3 with weight 20
iex> # Node 2 is merged into node 1
iex> Yog.successors(contracted, 1)
[{3, 20}]Combining Weights
When both a and b have edges to the same neighbor c:
iex> # Before: a->c (5), b->c (10)
iex> graph =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_node(3, "C")
...> |> Yog.add_edge!(from: 1, to: 3, with: 5)
...> |> Yog.add_edge!(from: 2, to: 3, with: 10)
iex> contracted = Yog.Transform.contract(graph, 1, 2, fn w1, w2 -> w1 + w2 end)
iex> # After: a->c (15) (5 + 10)
iex> Yog.successors(contracted, 1)
[{3, 15}]Use Cases
- Stoer-Wagner algorithm for minimum cut
- Graph simplification by merging strongly connected nodes
- Community detection by contracting nodes in the same community
- Karger's algorithm for minimum cut (randomized)
@spec filter_edges(Yog.graph(), (Yog.node_id(), Yog.node_id(), term() -> boolean())) :: Yog.graph()
Filters edges by a predicate, preserving all nodes.
Returns a new graph with the same nodes but only the edges where the
predicate returns true. The predicate receives (src, dst, weight).
Time Complexity: O(E) where E is the number of edges
Example
iex> {:ok, graph} =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_node(3, "C")
...> |> Yog.add_edges([{1, 2, 5}, {1, 3, 15}, {2, 3, 3}])
iex> # Keep only edges with weight >= 10
iex> heavy = Yog.Transform.filter_edges(graph, fn _src, _dst, w -> w >= 10 end)
iex> # Result: edges [1->3 (15)], edges 1->2 and 2->3 removed
iex> Yog.successors(heavy, 1)
[{3, 15}]Use Cases
- Pruning low-weight edges in weighted networks
- Removing self-loops:
filter_edges(g, fn(s, d, _) -> s != d end) - Threshold-based graph sparsification
Filters nodes by a predicate, automatically pruning connected edges.
Returns a new graph containing only nodes whose data satisfies the predicate. All edges connected to removed nodes (both incoming and outgoing) are automatically removed to maintain graph consistency.
Time Complexity: O(V + E) where V is nodes and E is edges
Example
iex> graph =
...> Yog.directed()
...> |> Yog.add_node(1, "apple")
...> |> Yog.add_node(2, "banana")
...> |> Yog.add_node(3, "apricot")
...> |> Yog.add_edge!(from: 1, to: 2, with: 1)
...> |> Yog.add_edge!(from: 2, to: 3, with: 2)
iex> # Keep only nodes starting with 'a'
iex> filtered = Yog.Transform.filter_nodes(graph, fn s ->
...> String.starts_with?(s, "a")
...> end)
iex> # Result has nodes 1 and 3, edge 1->2 is removed (node 2 gone)
iex> map_size(filtered.nodes)
2Use Cases
- Extract subgraphs based on node properties
- Remove inactive/disabled nodes from a network
- Filter by node importance/centrality
Transforms edge weights using a function, preserving graph structure.
This is a functor operation - it applies a function to every edge's weight/data while keeping all nodes and the graph topology unchanged.
Time Complexity: O(E) where E is the number of edges
Functor Law: map_edges(map_edges(g, f), h) == map_edges(g, fn x -> h.(f.(x)) end)
Example
iex> {:ok, graph} =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_node(3, "C")
...> |> Yog.add_edges([{1, 2, 10}, {2, 3, 20}])
iex> # Double all weights
iex> doubled = Yog.Transform.map_edges(graph, fn w -> w * 2 end)
iex> # Edges now have weights 20 and 40
iex> Yog.successors(doubled, 1)
[{2, 20}]Type Changes
Can change the edge weight type:
iex> # Convert integer weights to floats
iex> {:ok, graph} =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_edge(1, 2, 10)
iex> float_graph = Yog.Transform.map_edges(graph, fn w -> w * 1.0 end)
iex> Yog.successors(float_graph, 1)
[{2, 10.0}]
iex> # Convert weights to labels
iex> {:ok, graph} =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_edge(1, 2, 5)
iex> labeled = Yog.Transform.map_edges(graph, fn w ->
...> if w < 10, do: "short", else: "long"
...> end)
iex> Yog.successors(labeled, 1)
[{2, "short"}]
Transforms edge weights using a function in parallel, preserving graph structure.
This is the async version of map_edges/2 that uses Task.async_stream/3 to
process edge transformations concurrently. For large graphs with expensive
transformation functions, this can provide significant speedups on multi-core systems.
The parallelization strategy processes nodes (and their outgoing edges) in parallel, transforming all edges from each node concurrently.
Time Complexity: O(E/cores) amortized, where E is the number of edges
When to use:
- Large graphs (100,000+ edges)
- Expensive transformation functions (I/O, database lookups, complex calculations)
- Multi-core systems where parallelism benefits outweigh overhead
When NOT to use:
- Small graphs (< 10,000 edges) - overhead dominates
- Trivial transformations (arithmetic) - spawning tasks costs more
- Already in a parallel context - avoid nested parallelism
Options
:max_concurrency- Maximum concurrent tasks (default:System.schedulers_online()):timeout- Task timeout in milliseconds (default: 5000):ordered- Preserve order (default: false, faster unordered)
Example
iex> {:ok, graph} =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_edge(1, 2, 10)
iex> doubled = Yog.Transform.map_edges_async(graph, fn w -> w * 2 end)
iex> Yog.successors(doubled, 1)
[{2, 20}]Custom Options
iex> {:ok, graph} =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_edge(1, 2, 5)
iex> opts = [max_concurrency: 8, timeout: 10_000]
iex> result = Yog.Transform.map_edges_async(graph, fn w -> w * 3 end, opts)
iex> Yog.successors(result, 1)
[{2, 15}]Performance Example
# Sequential map_edges on 1M edges with moderate computation: ~6 seconds
graph |> Yog.Transform.map_edges(fn w -> expensive_transform(w) end)
# Parallel map_edges_async on 8-core system: ~1 second
graph |> Yog.Transform.map_edges_async(fn w -> expensive_transform(w) end)
@spec map_edges_indexed(Yog.graph(), (Yog.node_id(), Yog.node_id(), term() -> term())) :: Yog.graph()
Transforms edge weights using a function that also takes the source and destination IDs.
Example
iex> {:ok, graph} = Yog.directed()
...> |> Yog.add_node(1, "A") |> Yog.add_node(2, "B")
...> |> Yog.add_edge(1, 2, 10)
iex> result = Yog.Transform.map_edges_indexed(graph, fn u, v, w -> u + v + w end)
iex> Yog.successors(result, 1)
[{2, 13}] # 1 + 2 + 10
Transforms node data using a function, preserving graph structure.
This is a functor operation - it applies a function to every node's data while keeping all edges and the graph structure unchanged.
Time Complexity: O(V) where V is the number of nodes
Functor Law: map_nodes(map_nodes(g, f), h) == map_nodes(g, fn x -> h.(f.(x)) end)
Example
iex> graph =
...> Yog.directed()
...> |> Yog.add_node(1, "alice")
...> |> Yog.add_node(2, "bob")
iex> uppercased = Yog.Transform.map_nodes(graph, &String.upcase/1)
iex> # Nodes now contain "ALICE" and "BOB"
iex> uppercased.nodes[1]
"ALICE"Type Changes
Can change the node data type:
iex> # Convert string node data to integers
iex> graph =
...> Yog.directed()
...> |> Yog.add_node(1, "5")
...> |> Yog.add_node(2, "10")
iex> int_graph = Yog.Transform.map_nodes(graph, fn s -> String.to_integer(s) end)
iex> int_graph.nodes[1]
5
Transforms node data using a function in parallel, preserving graph structure.
This is the async version of map_nodes/2 that uses Task.async_stream/3 to
process node transformations concurrently. For large graphs with expensive
transformation functions, this can provide significant speedups on multi-core systems.
Time Complexity: O(V/cores) amortized, where V is the number of nodes
When to use:
- Large graphs (10,000+ nodes)
- Expensive transformation functions (I/O, complex computations)
- Multi-core systems where parallelism benefits outweigh overhead
When NOT to use:
- Small graphs (< 1,000 nodes) - overhead dominates
- Trivial transformations - spawning tasks costs more than the work
- Already in a parallel context - avoid nested parallelism
Options
:max_concurrency- Maximum concurrent tasks (default:System.schedulers_online()):timeout- Task timeout in milliseconds (default: 5000):ordered- Preserve order (default: false, faster unordered)
Example
iex> graph =
...> Yog.directed()
...> |> Yog.add_node(1, "alice")
...> |> Yog.add_node(2, "bob")
iex> uppercased = Yog.Transform.map_nodes_async(graph, &String.upcase/1)
iex> uppercased.nodes[1]
"ALICE"Custom Options
iex> graph = Yog.directed() |> Yog.add_node(1, "test")
iex> opts = [max_concurrency: 4, timeout: 10_000]
iex> result = Yog.Transform.map_nodes_async(graph, &String.upcase/1, opts)
iex> result.nodes[1]
"TEST"Performance Example
# Sequential map_nodes on 1M nodes with moderate computation: ~6 seconds
graph |> Yog.Transform.map_nodes(fn x -> expensive_function(x) end)
# Parallel map_nodes_async on 8-core system: ~1 second
graph |> Yog.Transform.map_nodes_async(fn x -> expensive_function(x) end)
Combines two graphs, with the second graph's data taking precedence on conflicts.
Merges nodes, out_edges, and in_edges from both graphs. When a node exists in
both graphs, the node data from other overwrites base. When the same edge
exists in both graphs, the edge weight from other overwrites base.
Importantly, edges from different nodes are combined - if base has edges
1->2 and 1->3, and other has edges 1->4 and 1->5, the result will have
all four edges from node 1.
The resulting graph uses the kind (Directed/Undirected) from the base graph.
Time Complexity: O(V + E) for both graphs combined
Example
iex> base =
...> Yog.directed()
...> |> Yog.add_node(1, "Original")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_edge!(from: 1, to: 2, with: 10)
iex> other =
...> Yog.directed()
...> |> Yog.add_node(1, "Updated")
...> |> Yog.add_node(3, "C")
...> |> Yog.add_edge!(from: 1, to: 3, with: 20)
iex> merged = Yog.Transform.merge(base, other)
iex> # Node 1 has "Updated" (from other)
iex> # Node 1 has edges to: 2 and 3 (all edges combined)
iex> length(Yog.successors(merged, 1))
2Use Cases
- Combining disjoint subgraphs
- Applying updates/patches to a graph
- Building graphs incrementally from multiple sources
@spec subgraph(Yog.graph(), [Yog.node_id()]) :: Yog.graph()
Extracts a subgraph containing only the specified nodes and their connecting edges.
Returns a new graph with only the nodes whose IDs are in the provided list, along with any edges that connect nodes within this subset. Nodes not in the list are removed, and all edges touching removed nodes are pruned.
Time Complexity: O(V + E) where V is nodes and E is edges
Example
iex> {:ok, graph} =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_node(3, "C")
...> |> Yog.add_node(4, "D")
...> |> Yog.add_edges([{1, 2, 10}, {2, 3, 20}, {3, 4, 30}])
iex> # Extract only nodes 2 and 3
iex> sub = Yog.Transform.subgraph(graph, [2, 3])
iex> # Result has nodes 2, 3 and edge 2->3
iex> # Edges 1->2 and 3->4 are removed (endpoints outside subgraph)
iex> Yog.successors(sub, 2)
[{3, 20}]Use Cases
- Extracting connected components found by algorithms
- Analyzing k-hop neighborhoods around specific nodes
- Working with strongly connected components (extract each SCC)
- Removing nodes found by some criteria (keep the inverse set)
- Visualizing specific portions of large graphs
Comparison with filter_nodes/2
filter_nodes/2- Filters by predicate on node data (e.g., "keep active users")subgraph/2- Filters by explicit node IDs (e.g., "keep nodes [1, 5, 7]")
Converts an undirected graph to a directed graph.
Since yog internally stores undirected edges as bidirectional directed edges,
this is essentially free — it just changes the kind flag. The resulting
directed graph has two directed edges (A→B and B→A) for each original
undirected edge.
If the graph is already directed, it is returned unchanged.
Time Complexity: O(1)
Example
iex> undirected =
...> Yog.undirected()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_edge!(from: 1, to: 2, with: 10)
iex> directed = Yog.Transform.to_directed(undirected)
iex> # Has edges: 1->2 and 2->1 (both with weight 10)
iex> directed.kind
:directed
Converts a directed graph to an undirected graph.
For each directed edge A→B, ensures B→A also exists. If both A→B and B→A
already exist with different weights, the resolve function decides which
weight to keep.
If the graph is already undirected, it is returned unchanged.
Time Complexity: O(E) where E is the number of edges
Examples
When both directions exist, keep the smaller weight
iex> directed =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_edges!([{1, 2, 10}, {2, 1, 20}])
iex> undirected = Yog.Transform.to_undirected(directed, &min/2)
iex> # Edge 1-2 has weight 10 (min of 10 and 20)
iex> Yog.successors(undirected, 1)
[{2, 10}]One-directional edges get mirrored automatically
iex> directed =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_edge!(from: 1, to: 2, with: 5)
iex> undirected = Yog.Transform.to_undirected(directed, &min/2)
iex> # Edge exists in both directions with weight 5
iex> Enum.sort(Yog.successors(undirected, 1))
[{2, 5}]
Reverses the direction of every edge in the graph (graph transpose).
Due to the dual-map representation (storing both out_edges and in_edges), this is an O(1) operation - just a pointer swap! This is dramatically faster than most graph libraries where transpose is O(E).
Time Complexity: O(1)
Property: transpose(transpose(G)) = G
Example
iex> {:ok, graph} =
...> Yog.directed()
...> |> Yog.add_node(1, "A")
...> |> Yog.add_node(2, "B")
...> |> Yog.add_node(3, "C")
...> |> Yog.add_edges([{1, 2, 10}, {2, 3, 20}])
iex> reversed = Yog.Transform.transpose(graph)
iex> # Now has edges: 2->1 and 3->2
iex> Yog.successors(reversed, 2)
[{1, 10}]Use Cases
- Computing strongly connected components (Kosaraju's algorithm)
- Finding all nodes that can reach a target node
- Reversing dependencies in a DAG