Tutorial
View SourceMix.install([
  {:openai_responses, path: "~/src/responses"},
  {:kino, "~> 0.11.0"}
])Introduction
OpenAI.Responses is a Small Development Kit for the OpenAI Responses API. It can get you started in no time, and automatically supports conversation state, structured responses, streaming, function calls, and cost calculations.
If you want an industrial-grade library that supports multiple providers, you should instead consider one of LangChain, OpenaiEx, or Instructor. If, however, you want to start as quickly as possible without learning a new framework first, and are ready to trade the flexibility of choosing an LLM provider for the conveniences of OpenAI's latest API, you've come to the right place!
Basic usage
The basic usage is simple: set the OPENAI_API_KEY environment variable and add openai_responses to your mix.exs
def deps do
  [
    # ...
    {:openai_responses, "~> 0.8.1"}
  ]
endis enough to get you started:
alias OpenAI.Responses
# Explicit input and model are required
{:ok, response} = Responses.create(input: "Write me a haiku about Elixir", model: "gpt-4.1-mini")
# -> {:ok, %OpenAI.Responses.Response{text: ..., ...}}
IO.puts(response.text)In general, create/1 takes a keyword list or map with anything that Create a model response supports. Always pass input: and model: explicitly. create!/1 is a version of create/1 that raises on errors:
response =
  Responses.create!(
    input: [
      %{role: :developer, content: "Talk like a pirate."},
      %{role: :user, content: "Write me a haiku about Elixir"}
    ],
    model: "gpt-5-mini",
    reasoning: %{effort: :low}
  )
IO.puts("#{response.text}\n\nCost: $#{response.cost.total_cost}")There is also create/2 and create!/2 that take %Responses.Response{} as a first argument and keyword list as a second to automatically handle the conversation state. You can also build prompts with OpenAI.Responses.Prompt helpers:
alias OpenAI.Responses.Prompt
%{}
|> Prompt.add_developer("Talk like a pirate.")
|> Prompt.add_user("Write me a haiku about Elixir")
|> Map.put(:model, "gpt-4.1-mini")
|> Responses.create!()
|> Responses.create!(input: "Which programming language is this haiku about?")
|> Map.get(:text)
|> IO.puts()Notice that the follow-up response also talks like a pirate due to conversation state kept by OpenAI. In terms of generation settings, only the model, reasoning effort, and text verbosity are preserved across follow-ups; schema is not preserved and must be specified again if needed.
Structured Output
One of the coolest features of the latest generation of OpenAI models is their ability to output Structured Output that precisely matches the supplied JSON schema. create/1 accepts a schema: option that allows easy creation of such schema in the required format.
The return result will automatically parse the response into %Response{parsed: object}.
alias OpenAI.Responses
Responses.create!(
  input: "List facts about first 3 U.S. Presidents",
  schema: %{
    presidents:
      {:array,
       %{
         name: :string,
         birth_year: :integer,
         little_known_facts: {:array, {:string, max_items: 2}}
       }}
  },
  model: "gpt-4.1-mini"
)
|> Map.get(:parsed)
|> Map.get("presidents")The above example should give you enough to get started, and you can read the documentation about supported schemas.
Streaming
Streaming model responses is a great way to increase interactivity of your application. There are two ways to support streaming with OpenAI.Responses:
- By adding a - callback: callback_fnoption to- create/1. Here- callback_fn/1takes a map- %{event: type, data: data}and should return either- :okto continue streaming or- {:error, reason}to stop.- The call will be blocked until the streaming ends but will otherwise return the same - %Response{}structure.
- By calling - stream/1, which returns an Elixir Stream of- %{event: type, data: data}objects.
For the supported event types and data format, refer to the Streaming Responses API docs.
Here is an example of how this works. We use the Responses.Stream.text_deltas helper to transform the stream of events into a stream of text chunks, and use Kino.Frame to demonstrate interactive output updates in a Livebook.
alias OpenAI.Responses
frame = Kino.Frame.new()
Kino.render(frame)
Responses.stream(
  input: """
  Write a short fairy tale about an Elixir developer who tried to use Java,
  and about the horrors that have ensued.
  """,
  temperature: 0.7,
  model: "gpt-4.1-mini"
)
|> Responses.Stream.text_deltas()
|> Stream.each(fn delta ->
  Kino.Frame.append(frame, Kino.Markdown.new(delta, chunk: true))
end)
|> Stream.run()
:doneThere is also a Responses.Stream.json_events/1 helper, which uses the Jaxon library to stream JSON events:
Responses.stream(
  input: "Tell me about first 2 U.S. Presidents",
  schema: %{presidents: {:array, %{name: :string, birth_year: :integer}}},
  model: "gpt-4.1-mini"
)
|> Responses.Stream.json_events()
|> Stream.each(&IO.inspect/1)
|> Stream.run()Tools
Using OpenAI built-in tools, for example Web Search, is simple: just add a tools: parameter to create/1:
alias OpenAI.Responses
Responses.create!(
  input: "Summarize in 3 paragraphs a positive news story from today",
  tools: [%{type: "web_search_preview"}]
)
|> Map.get(:text)
|> IO.putsFor Function calling, you can provide a function description. The Responses.Schema.build_function/3 helper makes this easier:
# Using the build_function helper
weather_tool = Responses.Schema.build_function(
  "get_weather",
  "Get current temperature for a given location",
  %{location: {:string, description: "City and country e.g. Bogotá, Colombia"}}
)
response = Responses.create!(
  input: "What is the weather like in Paris today?",
  tools: [weather_tool],
  model: "gpt-4.1-mini"
)
response.function_callsThe resulting %Response{} has a :function_calls field populated with the function calls requested by the model. Our app can now call each function, and we can provide results back to the model.
Manually building a function output item:
response
|> Responses.create!(
  input: [%{
    type: "function_call_output",
    call_id: response.function_calls |> List.first() |> Map.get(:call_id),
    output: "15C"
  }],
  model: "gpt-4.1-mini"
)
|> Map.get(:text)Alternatively, use the helper to execute and append function outputs:
alias OpenAI.Responses.Prompt
functions = %{
  "get_weather" => fn %{"location" => location} -> "15C in #{location}" end
}
opts = Prompt.add_function_outputs(%{input: []}, response.function_calls, functions)
response
|> Responses.create!(opts)
|> Map.get(:text)Automating Function Calls with run/2
The Responses.run/2 function automates the process of handling function calls. It will repeatedly call your functions and feed the results back to the model until a final response is achieved:
# Define available functions
functions = %{
  "get_weather" => fn %{"location" => location} ->
    # In a real app, this would call a weather API
    case location do
      "Paris" -> "15°C, partly cloudy"
      "London" -> "12°C, rainy"
      "New York" -> "8°C, sunny"
      _ -> "Weather data not available"
    end
  end,
  "get_time" => fn %{"timezone" => timezone} ->
    # In a real app, this would get actual time for timezone
    case timezone do
      "Europe/Paris" -> "14:30"
      "Europe/London" -> "13:30"
      "America/New_York" -> "08:30"
      _ -> "Unknown timezone"
    end
  end
}
# Define function tools
weather_tool = Responses.Schema.build_function(
  "get_weather",
  "Get current weather for a location",
  %{location: {:string, description: "City name"}}
)
time_tool = Responses.Schema.build_function(
  "get_time",
  "Get current time in a timezone",
  %{timezone: {:string, description: "Timezone like Europe/Paris"}}
)
# Run the conversation with automatic function calling
responses = Responses.run(
  [
    input: "What's the weather and time in Paris?",
    tools: [weather_tool, time_tool],
    model: "gpt-4.1-mini"
  ],
  functions
)
# The last response contains the final answer
responses |> List.last() |> Map.get(:text) |> IO.puts()
# You can also inspect all intermediate responses
IO.puts("\nTotal responses: #{length(responses)}")The run/2 function returns a list of all responses generated during the conversation. This allows you to:
- Track the conversation flow
- See what functions were called
- Calculate total costs across all API calls
- Debug issues in function calling
There's also a run!/2 variant that raises on errors instead of returning error tuples.