View Source ProcessHub (ProcessHub v0.2.0-alpha)

This is the main public API module for the ProcessHub library and it is recommended to use only the functions defined in this module to interact with the ProcessHub library.

ProcessHub is a library that distributes processes within the BEAM cluster. It is designed to be used as a building block for distributed applications that require process distribution and synchronization.

Summary

Types

child_id()

The child_id defines the name of the child. It is used to identify the child. Each child must have a unique child_id() in the cluster. A single child may have multiple pid()s across the cluster.

child_spec()

The child_spec defines the specification of a child process.

hub_id()

The hub_id defines the name of the hub. It is used to identify the hub.

init_opts()

The init_opts() defines the options that can be passed to the start_children/3, start_child/3, stop_children/3, and stop_child/3 functions.

reply_to()

The reply_to defines the pid()s that will receive the response from the hub when a child is started or stopped.

stop_opts()

The stop_opts() defines the options that can be passed to the stop_children/3 and stop_child/3 functions.

t()

This is the base configuration structure for the hub and has to be passed to the start_link/1 function.

Functions

await(init_func)

This function can be used to wait for the ProcessHub child start or stop functions to complete.

child_lookup(hub_id, child_id)

Returns information about processes that are registered with the given child_id/0.

child_spec(opts)

Returns the child specification for the ProcessHub.Initializer supervisor.

is_alive?(hub_id)

Checks if the ProcessHub with the given hub_id/0 is alive.

is_locked?(hub_id)

Checks if the ProcessHub with the given hub_id/0 is locked.

is_partitioned?(hub_id)

Checks if the ProcessHub with the given hub_id/0 is in a network-partitioned state.

nodes(hub_id, opts \\ [])

Returns a list of nodes where the ProcessHub with the given hub_id/0 is running.

process_list(hub_id, scope)

Returns a list of processes that are registered.

process_registry(hub_id)

Returns all information in the registry.

start_child(hub_id, child_spec, opts \\ [])

Starts a child process that will be distributed across the cluster. The t:child_spec() :id must be unique.

start_children(hub_id, child_specs, opts \\ [])

Starts multiple child processes that will be distributed across the cluster.

start_link(hub_settings)

Starts the ProcessHub with the given hub_id/0 and settings.

stop(hub_id)

Stops the ProcessHub with the given hub_id/0.

stop_child(hub_id, child_id, opts \\ [])

Stops a child process in the cluster.

stop_children(hub_id, child_ids, opts \\ [])

Stops multiple child processes in the cluster.

which_children(hub_id, opts \\ [])

Works similarly to Supervisor.which_children/1, but wraps the result in a tuple containing the node name and the children.

Types

child_id()

@type child_id() :: atom() | binary()

child_spec()

@type child_spec() :: %{id: child_id(), start: {module(), atom(), [any()]}}

The child_spec defines the specification of a child process.

hub_id()

@type hub_id() :: atom()

The hub_id defines the name of the hub. It is used to identify the hub.

init_opts()

@type init_opts() :: [
  async_wait: boolean(),
  timeout: non_neg_integer(),
  check_mailbox: boolean(),
  check_existing: boolean()
]

The init_opts() defines the options that can be passed to the start_children/3, start_child/3, stop_children/3, and stop_child/3 functions.

:async_wait - is optional and is used to define whether the function should return another function that can be used to wait for the children to start or stop. The default is false.
:timeout is optional and is used to define the timeout for the function. The timeout option should be used with async_wait: true. The default is 5000 (5 seconds).
:check_mailbox - is optional and is used to define whether the function should clear the mailbox of any existing messages that may overlap. It is recommended to keep this option true to avoid any unexpected behavior where start_child/3 or start_children/3 call timeout but eventually the calling process receives the start responses later. These messages will stay in that process's mailbox, and when the same process calls start child functions again with the same child_id()s, it will receive the old responses.This option should be used with async_wait: true. The default is true.
:check_existing - is optional and is used to define whether the function should check if the children are already started. The default is true.

reply_to()

@type reply_to() :: [pid()]

The reply_to defines the pid()s that will receive the response from the hub when a child is started or stopped.

stop_opts()

@type stop_opts() :: [async_wait: boolean(), timeout: non_neg_integer()]

The stop_opts() defines the options that can be passed to the stop_children/3 and stop_child/3 functions.

:async_wait - is optional and is used to define whether the function should return another function that can be used to wait for the children to stop. The default is false.
:timeout is optional and is used to define the timeout for the function. The timeout option should be used with async_wait: true. The default is 5000 (5 seconds).

t()

@type t() :: %ProcessHub{
  distribution_strategy:
    ProcessHub.Strategy.Distribution.ConsistentHashing.t()
    | ProcessHub.Strategy.Distribution.Guided.t(),
  hooks: ProcessHub.Service.HookManager.hook_handlers(),
  hub_id: hub_id(),
  migration_strategy:
    ProcessHub.Strategy.Migration.ColdSwap.t()
    | ProcessHub.Strategy.Migration.HotSwap.t(),
  partition_tolerance_strategy:
    ProcessHub.Strategy.PartitionTolerance.Divergence.t()
    | ProcessHub.Strategy.PartitionTolerance.StaticQuorum.t()
    | ProcessHub.Strategy.PartitionTolerance.DynamicQuorum.t(),
  redundancy_strategy:
    ProcessHub.Strategy.Redundancy.Singularity.t()
    | ProcessHub.Strategy.Redundancy.Replication.t(),
  synchronization_strategy:
    ProcessHub.Strategy.Synchronization.PubSub.t()
    | ProcessHub.Strategy.Synchronization.Gossip.t()
}

This is the base configuration structure for the hub and has to be passed to the start_link/1 function.

:hub_id is the name of the hub and is required.
:hooks are optional and are used to define the hooks that can be triggered on specific events.
:redundancy_strategy is optional and is used to define the strategy for redundancy. The default is ProcessHub.Strategy.Redundancy.Singularity.
:migration_strategy is optional and is used to define the strategy for migration. The default is ProcessHub.Strategy.Migration.ColdSwap.
:synchronization_strategy is optional and is used to define the strategy for synchronization. The default is ProcessHub.Strategy.Synchronization.PubSub.
:partition_tolerance_strategy is optional and is used to define the strategy for partition tolerance. The default is ProcessHub.Strategy.PartitionTolerance.Divergence.
:distribution_strategy is optional and is used to define the strategy for process distribution. The default is ProcessHub.Strategy.Distribution.ConsistentHashing.

Functions

await(init_func)

@spec await(function()) :: term()

This function can be used to wait for the ProcessHub child start or stop functions to complete.

The await/1 function should be used with the async_wait: true option.

Keep in mind that the await/1 function will block the calling process until the response is received. If the response is not received within the timeout period, the function will return {:error, term()}.

Example

iex> ref = ProcessHub.start_child(:my_hub, child_spec, [async_wait: true])
iex> ProcessHub.await(ref)
{:ok, {:my_child, [{:mynode, #PID<0.123.0>}]}}

child_lookup(hub_id, child_id)

@spec child_lookup(hub_id(), child_id()) :: {child_spec(), [{node(), pid()}]} | nil

Returns information about processes that are registered with the given child_id/0.

This function queries results from the local ets table and does not make any network calls.

The return results contain the child_spec/0 and a list of tuples where the first element is the node where the child is started, and the second element is the pid() of the started child.

Example

iex> {} = {_child_spec, _node_pid_tuples} = ProcessHub.child_lookup(:my_hub, :my_child)
{%{id: :my_child, start: {MyProcess, :start_link, []}}, [{:mynode, #PID<0.123.0>}]}

child_spec(opts)

@spec child_spec(t()) :: %{
  id: ProcessHub,
  start: {ProcessHub.Initializer, :start_link, [...]},
  type: :supervisor
}

Returns the child specification for the ProcessHub.Initializer supervisor.

is_alive?(hub_id)

@spec is_alive?(hub_id()) :: boolean()

Checks if the ProcessHub with the given hub_id/0 is alive.

A hub is considered alive if the ProcessHub.Initializer supervisor process is running along with the required child processes for the hub to function.

Example

iex> ProcessHub.is_alive?(:not_existing)
false

is_locked?(hub_id)

@spec is_locked?(hub_id()) :: boolean()

Checks if the ProcessHub with the given hub_id/0 is locked.

A hub is considered locked if the ProcessHub local event queue has a priority level greater than or equal to 10. This is used to throttle the hub from processing any new events and conserve data integrity.

Example

iex> ProcessHub.is_locked?(:my_hub)
false

is_partitioned?(hub_id)

@spec is_partitioned?(hub_id()) :: boolean()

Checks if the ProcessHub with the given hub_id/0 is in a network-partitioned state.

A hub is considered partitioned if the ProcessHub.Strategy.PartitionTolerance strategy has detected a network partition. When a network partition is detected, the hub will terminate the ProcessHub.DistributedSupervisor process along with its children.

Example

iex> ProcessHub.is_partitioned?(:my_hub)
false

nodes(hub_id, opts \\ [])

@spec nodes(hub_id(), [:include_local] | nil) :: [node()]

Returns a list of nodes where the ProcessHub with the given hub_id/0 is running.

Nodes where the ProcessHub is running with the same hub_id/0 are considered to be part of the same cluster.

Example

iex> ProcessHub.nodes(:my_hub, [:include_local])
[:remote_node]

process_list(hub_id, scope)

@spec process_list(hub_id(), :global | :local) :: [{child_id(), [{node(), pid()}]}]

Returns a list of processes that are registered.

The process list contains the child_id/0 and depending on the scope option, it may contain the node and pid() of the child.

This function queries results from the local ets table and does not make any network calls.

Available options:

:global - returns a list of all child processes on all nodes in the cluster. The return result will be in the format of [{child_id, [{:node, pid}]}].
:local - returns a list of child processes that belong to the local node. The return result will be in the format of [{child_id, [pid]}] but only the processes that belong to the local node will be returned.

Example

iex> ProcessHub.process_list(:my_hub, :global)
[
  {:my_child1, [{:node1, #PID<0.123.0>}, {:node2, #PID<2.123.0>}]},
  {:my_child2, [{:node2, #PID<5.124.0>}]}
]
iex> ProcessHub.process_list(:my_hub, :local)
[{:my_child1, [#PID<0.123.0>]}]

process_registry(hub_id)

@spec process_registry(hub_id()) :: ProcessHub.Service.ProcessRegistry.registry()

Returns all information in the registry.

This function queries results from the local ets table and does not make any network calls.

start_child(hub_id, child_spec, opts \\ [])

@spec start_child(hub_id(), child_spec(), init_opts()) ::
  (-> {:ok, list()})
  | {:error, :no_children | {:already_started, [atom() | binary(), ...]}}
  | {:ok, :start_initiated}

Starts a child process that will be distributed across the cluster. The t:child_spec() :id must be unique.

Example

iex> child_spec = %{id: :my_child, start: {MyProcess, :start_link, []}}
iex> ProcessHub.start_child(:my_hub, child_spec)
{:ok, :start_initiated}

By default, the start_child/3 function is asynchronous and returns immediately. To wait for the child to start, you can pass async_wait: true to the opts argument. When async_wait: true, you must await the response from the function.

See init_opts/0 for more options.

Example with synchronous wait

The synchronous response includes the status code :ok or :error, a tuple containing the child_id/0 and a list of tuples where the first key is the node where the child is started, and the second key is the pid() of the started child. By default, the list should contain only one tuple, but if the redundancy strategy is configured for replicas, it may contain more than one tuple.

iex> child_spec = %{id: :my_child, start: {MyProcess, :start_link, []}}
iex> ProcessHub.start_child(:my_hub, child_spec, [async_wait: true]) |> ProcessHub.await()
{:ok, {:my_child, [{:mynode, #PID<0.123.0>}]}}

start_children(hub_id, child_specs, opts \\ [])

@spec start_children(hub_id(), [child_spec()], init_opts()) ::
  (-> {:ok, list()})
  | {:ok, :start_initiated}
  | {:error,
     :no_children
     | {:error, :children_not_list}
     | {:already_started, [atom() | binary(), ...]}}

Starts multiple child processes that will be distributed across the cluster.

Same as start_child/3, except it starts multiple children at once and is more efficient than calling start_child/3 multiple times.

Warning
Using start_children/3 with async_wait: true can lead to timeout errors, especially when the number of children is large. When async_wait: true, you must await the response from the function.

start_link(hub_settings)

@spec start_link(t()) :: {:ok, pid()} | {:error, term()}

Starts the ProcessHub with the given hub_id/0 and settings.

It is recommended to start the ProcessHub under a supervision tree.

stop(hub_id)

@spec stop(atom()) :: :ok | {:error, :not_alive}

Stops the ProcessHub with the given hub_id/0.

stop_child(hub_id, child_id, opts \\ [])

@spec stop_child(hub_id(), child_id(), stop_opts()) ::
  (-> {:ok, list()}) | {:ok, :stop_initiated}

Stops a child process in the cluster.

By default, this function is asynchronous and returns immediately. You can wait for the child to stop by passing async_wait: true in the opts argument. When async_wait: true, you must await the response from the function.

Example

iex> ProcessHub.stop_child(:my_hub, :my_child)
{:ok, :stop_initiated}

See stop_opts/0 for more options.

Example with synchronous wait

iex> ProcessHub.stop_child(:my_hub, :my_child, [async_wait: true]) |> ProcessHub.await()
{:ok, {:my_child, [:mynode]}}

stop_children(hub_id, child_ids, opts \\ [])

@spec stop_children(hub_id(), [child_id()], stop_opts()) ::
  (-> {:ok, list()}) | {:ok, :stop_initiated} | {:error, list()}

Stops multiple child processes in the cluster.

This function is similar to stop_child/3, but it stops multiple children at once, making it more efficient than calling stop_child/3 multiple times.

Warning
Using stop_children/3 with async_wait: true can lead to timeout errors, especially when stopping a large number of child processes.

which_children(hub_id, opts \\ [])

@spec which_children(hub_id(), [:global | :local] | nil) ::
  list()
  | {node(),
     [
       {any(), :restarting | :undefined | pid(), :supervisor | :worker,
        :dynamic | list()}
     ]}

Works similarly to Supervisor.which_children/1, but wraps the result in a tuple containing the node name and the children.

Info
The Supervisor.which_children/1 function is known to consume a lot of memory and this can affect performance. The problem is even more relevant when using the :global option as it will make a network call to all nodes in the cluster.
It is highly recommended to use ProcessHub.process_list/2 instead.

Available options:

:global - returns a list of all child processes started by all nodes in the cluster. The return result will be in the format of [{:node, children}].
:local - returns a list of all child processes started by the local node. The return result will be in the format of {:node, children}.

Settings View Source ProcessHub (ProcessHub v0.2.0-alpha)

Summary

Types

Functions

Types

child_id()

child_spec()

hub_id()

init_opts()

reply_to()

stop_opts()

t()

Functions

await(init_func)

Example

child_lookup(hub_id, child_id)

Example

child_spec(opts)

is_alive?(hub_id)

Example

is_locked?(hub_id)

Example

is_partitioned?(hub_id)

Example

nodes(hub_id, opts \\ [])

Example

process_list(hub_id, scope)

Example

process_registry(hub_id)

start_child(hub_id, child_spec, opts \\ [])

Example

Example with synchronous wait

start_children(hub_id, child_specs, opts \\ [])

Warning

start_link(hub_settings)

stop(hub_id)

stop_child(hub_id, child_id, opts \\ [])

Example

Example with synchronous wait

stop_children(hub_id, child_ids, opts \\ [])

Warning

which_children(hub_id, opts \\ [])

Info

View Source ProcessHub (ProcessHub v0.2.0-alpha)