Orchestrators

Orchestrators use an LLM to dynamically compose available tools (actions and agents) at runtime via a ReAct-style loop.

Upgrading from 0.2 to 0.3

query_sync error reasons are now structured data instead of inspect()-ed strings. If you parse error strings, switch to pattern-matching:

# Before (0.2)
{:error, "some stringified error"} = MyOrch.query_sync(agent, query)

# After (0.3)
{:error, reason} = MyOrch.query_sync(agent, query)
# reason is the original struct/tuple/atom — not a string

ReAct Loop

flowchart TD
    A[Query] --> B[Call LLM with tools]
    B --> C{Response type?}
    C -->|Tool calls| D[Execute tools]
    D --> E[Feed results to LLM]
    E --> B
    C -->|Final answer| F[Return result]
    C -->|Max iterations| G[Return error]

The orchestrator sends the user's query to the LLM along with available tool definitions. The LLM either calls tools or provides a final answer. Tool results are fed back to the LLM for further reasoning. This repeats until the LLM responds with a final answer or the iteration limit is reached.

DSL Options

Option	Type	Required	Default	Description
`name`	`string`	yes	—	Unique orchestrator identifier
`description`	`string`	no	`"Orchestrator: #{name}"`	Documentation text
`schema`	`keyword`	no	`[]`	Input validation schema
`nodes`	`list`	yes	—	List of action/agent modules or `{module, opts}` tuples
`model`	`string`	no	`nil`	LLM model identifier (e.g., `"anthropic:claude-sonnet-4-20250514"`)
`system_prompt`	`string`	no	`nil`	System message for the LLM
`max_iterations`	`integer`	no	`10`	Maximum ReAct loop iterations
`temperature`	`float`	no	`nil`	LLM temperature parameter
`max_tokens`	`integer`	no	`nil`	Token budget for LLM responses
`stream`	`boolean`	no	`false`	Whether to use streaming generation
`termination_tool`	`module`	no	`nil`	A `Jido.Action` module for structured termination
`llm_opts`	`keyword`	no	`[]`	Additional options passed to ReqLLM
`req_options`	`keyword`	no	`[]`	HTTP options for Req (useful for testing)
`rejection_policy`	`atom`	no	`:continue_siblings`	Behavior when a gated tool is rejected
`ambient`	`[atom]`	no	`[]`	Read-only context keys
`fork_fns`	`map`	no	`%{}`	Context transformation at child boundaries
`max_tool_concurrency`	`integer`	no	unlimited	Backpressure limit for concurrent tool execution

Model Format

Models use the "provider:model_name" format supported by req_llm:

model: "anthropic:claude-sonnet-4-20250514"
model: "openai:gpt-4o"
model: "google:gemini-2.0-flash"

Tools

Actions and agents listed in nodes are automatically converted to LLM tool definitions. The tool name comes from the action's name/0 callback, the description from description/0, and parameters from schema/0.

Tip: Write clear, specific description strings for your tools — the LLM uses them to decide which tool to call. A vague description like "process data" leads to poor tool selection. Prefer "Search the product catalog by name or SKU and return matching items with prices."

use Jido.Composer.Orchestrator,
  nodes: [
    SearchAction,                    # plain action
    {WriteAction, some_option: true},  # action with options
    ResearchAgent                    # agent as tool
  ]

When the LLM calls a tool, the orchestrator:

Converts the tool call arguments to action parameters
Executes the action (or spawns the agent)
Converts the result to a tool result message
Adds it to the conversation for the next LLM call

Streaming

When stream: true, LLMAction uses streaming generation internally (collect-then-return). The strategy sees no difference from non-streaming mode.

Note: Streaming uses Finch directly, bypassing Req plugs. When using cassette/stub testing, set stream: false (the default).

Termination Tool (Structured Output)

For structured output, define a Jido.Action module whose schema describes the output shape, and pass it as termination_tool:. The LLM sees it as a regular tool and calls it when ready to produce the final answer.

defmodule FinalReportAction do
  use Jido.Action,
    name: "final_report",
    description: "Produce the final analysis report. Call when you have the answer.",
    schema: [
      summary: [type: :string, required: true, doc: "Summary of findings"],
      confidence: [type: :float, required: true, doc: "Confidence score 0.0-1.0"]
    ]

  def run(%{summary: summary, confidence: confidence}, _ctx) do
    {:ok, %{summary: summary, confidence: confidence}}
  end
end

defmodule Analyzer do
  use Jido.Composer.Orchestrator,
    name: "analyzer",
    model: "anthropic:claude-sonnet-4-20250514",
    nodes: [SearchAction, CalculateAction],
    termination_tool: FinalReportAction,
    system_prompt: "Analyze the query. Call final_report when you have the answer."
end

{:ok, _agent, %{summary: _, confidence: _}} = Analyzer.query_sync(agent, "Analyze X")

The termination tool action's run/2 executes with the LLM's arguments, allowing validation and transformation. If the action returns an error, the error is fed back to the LLM so it can retry with corrected arguments.

When the LLM returns both regular tools and the termination tool in the same batch, termination wins and sibling calls are dropped.

Running Orchestrators

Async (`query/3`)

Returns the agent and directives for external runtime execution:

agent = MyOrchestrator.new()
{agent, directives} = MyOrchestrator.query(agent, "What is 5 + 3?")

Blocking (`query_sync/3`)

Executes the full ReAct loop internally and returns the post-execution agent:

agent = MyOrchestrator.new()
{:ok, agent, answer} = MyOrchestrator.query_sync(agent, "What is 5 + 3?")

query_sync returns three variants:

Return	Meaning
`{:ok, agent, result}`	Completed — `result` is the LLM's final answer (string or structured map)
`{:suspended, agent, suspension}`	Paused — a tool requires human approval or an action suspended
`{:error, reason}`	Failed — `reason` is structured (e.g., error struct, not a stringified inspect)

The returned agent carries the full post-execution strategy state: conversation history, token usage, iteration count, and tool call results. This enables multi-turn persistence and suspension handling.

Both accept an optional context map as a third argument:

{:ok, agent, answer} = MyOrchestrator.query_sync(agent, "Analyze this", %{data: dataset})

Tool Approval Gates

Mark individual tools as requiring human approval before execution:

use Jido.Composer.Orchestrator,
  nodes: [
    SearchAction,
    {DeployAction, requires_approval: true},
    {DeleteAction, requires_approval: true}
  ]

When the LLM calls a gated tool, the orchestrator:

Partitions tool calls into gated and ungated
Executes ungated tools immediately
Suspends with an ApprovalRequest for each gated tool
Waits for human approval before executing

Handling Suspension

When the LLM calls a gated tool, query_sync returns {:suspended, agent, suspension} instead of an error. The returned agent contains the full conversation history — including the LLM's tool_use message — which is essential for correct resumption:

case MyOrchestrator.query_sync(agent, "Deploy the new version") do
  {:ok, agent, answer} ->
    # Normal completion
    answer

  {:suspended, agent, suspension} ->
    # Persist the agent's conversation so we can resume later
    strat = agent.state.__strategy__
    MyDB.save_conversation(session_id, strat.conversation)

    # Present the approval request to a human
    %{reason: :human_input, approval_request: request} = suspension
    send_to_reviewer(request)

  {:error, reason} ->
    handle_error(reason)
end

Without the returned agent, the conversation would be lost on suspension. When the user later approves and the agent resumes, it needs the tool_use message in history — otherwise the API rejects the tool_result as orphaned.

See the HITL guide for resume flows and persistence tiers.

Rejection Policy

Controls behavior when a gated tool call is rejected:

:continue_siblings (default) — Continue executing other (ungated) tool calls; skip the rejected one

Backpressure

Limit concurrent tool execution to prevent overwhelming external services:

use Jido.Composer.Orchestrator,
  max_tool_concurrency: 3  # max 3 tools executing at once

When the LLM requests more tool calls than the concurrency limit, excess calls are queued and executed as slots become available.

Orchestrators sit at the adaptive end of the control spectrum — the LLM decides which tools to call and in what order. For fully deterministic pipelines, see Workflows. For mixing both patterns, see Composition & Nesting.

Runtime Configuration

The DSL sets defaults at compile time, but configure/2 lets you override fields at runtime — after new/0 but before query_sync/3:

agent = MathAssistant.new()

agent = MathAssistant.configure(agent,
  system_prompt: "You are a math tutor helping #{user.name}.",
  model: "anthropic:claude-sonnet-4-20250514",
  temperature: 0.3,
  max_tokens: 4096,
  req_options: [plug: cassette_plug]
)

{:ok, _agent, answer} = MathAssistant.query_sync(agent, "What is 5 + 3?")

Overridable Fields

Key	Type	Description
`:system_prompt`	`String.t()`	Replace the system prompt
`:nodes`	`[module()]`	Replace available tools (rebuilds internally)
`:model`	`String.t()`	Replace the model identifier
`:temperature`	`float()`	Replace sampling temperature
`:max_tokens`	`integer()`	Replace token budget
`:req_options`	`keyword()`	Replace HTTP options (test plugs, etc.)
`:conversation`	`ReqLLM.Context.t()`	Pre-load conversation history for multi-turn

Filtering Tools (RBAC)

Use get_action_modules/1 to read the DSL-declared tools, filter them, then set them back:

agent = MyOrchestrator.new()

# Read what the DSL declared
all_modules = MyOrchestrator.get_action_modules(agent)

# Filter by user role
visible = Enum.filter(all_modules, fn mod ->
  mod in allowed_tools_for(current_user.role)
end)

# Write back — handles node/tool rebuild + termination tool dedup
agent = MyOrchestrator.configure(agent, nodes: visible)

When :nodes is overridden, configure/2 rebuilds ActionNode/AgentNode structs, ReqLLM.Tool descriptions, and internal lookup maps. If a termination_tool was declared in the DSL, it is automatically deduplicated — you don't need to exclude it from the node list.

Pre-loading Conversation History

For multi-turn agents that persist conversations to a database:

# Load prior messages from your database
messages = MyDB.load_messages(conversation_id)
context = ReqLLM.Context.new(messages)

agent = MyOrchestrator.new()
agent = MyOrchestrator.configure(agent, conversation: context)
{:ok, agent, answer} = MyOrchestrator.query_sync(agent, new_user_message)

Multi-Turn Conversation Persistence

query_sync returns the updated agent, which carries the full conversation history in its strategy state. Use this to persist conversations between turns:

# First turn — no prior history
agent = MyOrchestrator.new()
{:ok, agent, answer} = MyOrchestrator.query_sync(agent, "What is 5 + 3?")

# Extract and persist the conversation
conversation = agent.state.__strategy__.conversation
MyDB.save_conversation(session_id, ReqLLM.Context.to_list(conversation))

# Later turn — reload history and continue
messages = MyDB.load_messages(session_id)
agent = MyOrchestrator.new()
agent = MyOrchestrator.configure(agent, conversation: ReqLLM.Context.new(messages))
{:ok, agent, answer} = MyOrchestrator.query_sync(agent, "And what is that times 2?")

# The LLM sees the full prior conversation and can reference earlier results

The strategy state also exposes post-execution metadata:

strat = agent.state.__strategy__
strat.conversation  # ReqLLM.Context — full message history
strat.iteration     # number of ReAct loop iterations used
strat.status        # :completed, :error, etc.

See Jido.Composer.Orchestrator.Configure for the full API reference.

Context Accumulation

Tool results are scoped under the tool name in the working context, just like workflow states:

# After LLM calls "search" and "calculate" tools:
# context.working[:search] => %{results: [...]}
# context.working[:calculate] => %{result: 42}

Complete Example

defmodule AddAction do
  use Jido.Action,
    name: "add",
    description: "Add two numbers",
    schema: [value: [type: :float, required: true], amount: [type: :float, required: true]]

  @impl true
  def run(%{value: v, amount: a}, _ctx), do: {:ok, %{result: v + a}}
end

defmodule MultiplyAction do
  use Jido.Action,
    name: "multiply",
    description: "Multiply two numbers",
    schema: [value: [type: :float, required: true], amount: [type: :float, required: true]]

  @impl true
  def run(%{value: v, amount: a}, _ctx), do: {:ok, %{result: v * a}}
end

defmodule MathAssistant do
  use Jido.Composer.Orchestrator,
    name: "math_assistant",
    model: "anthropic:claude-sonnet-4-20250514",
    nodes: [AddAction, MultiplyAction],
    system_prompt: "You are a math assistant. Use the available tools.",
    max_iterations: 5
end

agent = MathAssistant.new()
{:ok, _agent, answer} = MathAssistant.query_sync(agent, "What is (5 + 3) * 2?")

← Previous Page Workflows

Next Page → Composition