Generated Wrappers

SnakeBridge generates type-safe Elixir wrappers from Python introspection at compile time. This provides compile-time guarantees, IDE autocompletion, and documentation integration.

Overview and Benefits

Generated wrappers offer advantages over the Universal FFI:

Type Safety: Generated @spec annotations catch type mismatches at compile time
IDE Support: Autocomplete and inline documentation in your editor
Discoverable APIs: __functions__/0, __classes__/0, and __search__/1 for exploration
Consistent Arities: Proper handling of optional args, keyword-only params, and variadics

The trade-off is a compilation step requiring Python at build time (unless using strict mode).

Configuring python_deps

Python dependencies are declared in your mix.exs:

def project do
  [
    app: :my_app,
    version: "1.0.0",
    deps: deps(),
    python_deps: python_deps()
  ]
end

defp python_deps do
  [
    {:numpy, "1.26.0"},
    {:pandas, "2.0.0", include: ["DataFrame", "read_csv"]},
    {:mylib, "1.0.0", generate: :all, module_mode: :public},
    {:math, :stdlib}
  ]
end

Dependency Syntax

{:numpy, "1.26.0"}                                    # Version-pinned PyPI package
{:math, :stdlib}                                      # Python standard library
{:pandas, "2.0.0", include: ["DataFrame"]}            # With options

Configuration Options

Option	Type	Default	Description
`include`	list	`[]`	Symbols to always generate
`exclude`	list	`[]`	Symbols to never generate
`generate`	`:used` or `:all`	`:used`	Generation mode
`module_mode`	atom	`nil`	Module discovery mode (`:root/:light`, `:exports/:api`, `:public/:standard`, `:explicit`, `:docs`, `:all/:nuclear`)
`docs_manifest`	string	`nil`	Docs manifest JSON path (required for `module_mode: :docs`)
`docs_profile`	atom or string	`nil`	Profile key inside `docs_manifest` (`:summary`, `:full`, etc.)
`module_include`	list	`[]`	Force-include submodules
`module_exclude`	list	`[]`	Exclude submodules
`module_depth`	integer	`nil`	Limit submodule discovery depth
`submodules`	boolean or list	`false`	Legacy submodule selection (use `module_mode`)
`public_api`	boolean	`false`	Legacy public filter (use `module_mode`)
`module_name`	atom	derived	Override Elixir module name
`python_name`	string	derived	Override Python module name
`streaming`	list	`[]`	Functions that return generators
`class_method_scope`	`:all` or `:defined`	`:all`	How to enumerate class methods during introspection
`max_class_methods`	non-negative integer	`1000`	Guardrail for inheritance-heavy classes (0 disables)
`on_not_found`	`:error` or `:stub`	depends	Missing-symbol behavior (`:error` for `:used`, `:stub` for `:all`)
`signature_sources`	list	config default	Ordered list of allowed signature sources
`strict_signatures`	boolean	config default	Enforce minimum signature tier for this library
`min_signature_tier`	atom	config default	Minimum tier when strict signatures are enabled
`stub_search_paths`	list	config default	Additional paths to search for `.pyi` stubs
`use_typeshed`	boolean	config default	Enable typeshed lookup for missing stubs
`typeshed_path`	string	config default	Typeshed root path
`stubgen`	keyword	config default	Stubgen options (enabled, cache_dir)

Generation Modes

:used (default) - Only generates wrappers for symbols detected in your codebase. The compiler scans your configured scan_paths (default ["lib"]) for calls like Numpy.mean/1 and generates only those. Use scan_extensions to include .exs scripts/examples when needed.

:all - Generates wrappers for all public symbols in the Python module:

{:mylib, "1.0.0", generate: :all, module_mode: :public}

Missing Symbols (`on_not_found`)

If SnakeBridge requests a symbol that does not exist in the current Python environment:

generate: :used defaults to on_not_found: :error (fail fast, since your Elixir code is calling it)
generate: :all defaults to on_not_found: :stub (generate deterministic stubs so the surface can still be generated)

Override per library:

{:mylib, "1.0.0", on_not_found: :stub}

Module Discovery Modes

Module discovery determines which submodules are introspected when generate: :all is set. SnakeBridge provides multiple modes:

:light / :root - only the root module
:exports / :api - root module plus submodules explicitly exported by root __all__ (no package walk)
:public / :standard - discover submodules and keep public API modules
:explicit - discover submodules but keep only modules/packages that define __all__
:docs - generate a docs-defined public surface from a committed manifest file
:all / :nuclear - discover everything (including private)

Docs Manifest Mode

Some libraries define their perceived "public API" via published docs rather than __all__. For these, module_mode: :docs uses a docs-derived allowlist so you can generate a large, documented surface without walking the full importable module tree.

Generate a manifest from Sphinx docs:

mix snakebridge.docs.manifest --library <pkg> \
  --inventory <objects.inv> \
  --nav <api index url or path> \
  --nav-depth 1 \
  --summary <api index url or path> \
  --out priv/snakebridge/<pkg>.docs.json

Configure python_deps to use it:

{:mylib, "1.0.0",
  generate: :all,
  module_mode: :docs,
  docs_manifest: "priv/snakebridge/mylib.docs.json",
  docs_profile: :summary}

Class Method Guardrails

Some Python classes inherit extremely large APIs (thousands of methods). Generating wrappers for these can produce very large .ex files and slow compilation/tooling.

Use max_class_methods to cap inherited method enumeration, and optionally switch to declared-only enumeration with class_method_scope: :defined:

{:mylib, "1.0.0",
  generate: :all,
  max_class_methods: 500}

{:mylib, "1.0.0",
  generate: :all,
  class_method_scope: :defined}

Preview expected size:

mix snakebridge.plan

Lazy Import Handling

Many Python libraries use lazy imports via __getattr__ patterns. SnakeBridge handles these by iterating over __all__ when present to discover classes and functions that aren't visible to inspect.getmembers():

# Python library with lazy imports
# mylib/__init__.py
__all__ = ["LazyClass", "lazy_function"]

def __getattr__(name):
    if name == "LazyClass":
        from .internal import LazyClass
        return LazyClass
    raise AttributeError(name)

When introspecting, SnakeBridge:

Checks if __all__ is defined on the module
Iterates over __all__ entries to trigger lazy loading
Discovers the actual class/function after it's loaded
Records any import errors for later reference

This ensures libraries using lazy loading patterns generate complete wrappers.

{:mylib, "1.0.0", generate: :all, module_mode: :light}
{:mylib, "1.0.0", generate: :all, module_mode: :exports}
{:mylib, "1.0.0", generate: :all, module_mode: :public}
{:mylib, "1.0.0", generate: :all, module_mode: :explicit}
{:mylib, "1.0.0", generate: :all, module_mode: :all}

You can further refine discovery:

{:mylib, "1.0.0",
  generate: :all,
  module_mode: :public,
  module_depth: 1,
  module_include: ["linalg"],
  module_exclude: ["internal.*"]}

Include and Exclude

{:pandas, "2.0.0",
  include: ["DataFrame", "Series", "read_csv"],
  exclude: ["eval", "exec"]}

Compilation Pipeline

The pipeline runs during mix compile:

1. Configuration

SnakeBridge.Config.load/0 reads python_deps from mix.exs.

2. Scanning

SnakeBridge.Scanner parses Elixir files to detect Python library calls:

Numpy.mean(data)           # Function call
Pandas.DataFrame.new(...)  # Class instantiation

3. Introspection

For detected symbols, Python introspection gathers signatures, types, and docstrings using Python's inspect module.

Docstrings are converted to ExDoc-friendly Markdown and sanitized to repair common upstream issues (for example: unclosed fenced code blocks or manpage-style quotes like `sys.byteorder' → `sys.byteorder`) so generated docs render cleanly.

Escape Hatch: Dynamic Calls

Generated wrappers are the preferred interface, but SnakeBridge also supports dynamic invocation when you need to reach a function or module attribute that wasn't generated.

SnakeBridge.Runtime.call("math", "sqrt", [16])
SnakeBridge.Runtime.get_module_attr("math", "pi")

4. Manifest

Results cache in .snakebridge/manifest.json:

{
  "version": "0.15.0",
  "symbols": {
    "Numpy.mean/1": {
      "module": "Numpy",
      "function": "mean",
      "parameters": [...],
      "docstring": "Compute the arithmetic mean...",
      "signature_source": "runtime",
      "doc_source": "runtime"
    }
  },
  "classes": {
    "Numpy.Ndarray": { "class": "ndarray", "methods": [...] }
  }
}

5. Generation

SnakeBridge.Generator produces Elixir files in lib/snakebridge_generated/.

6. Lock Update

SnakeBridge.Lock updates snakebridge.lock with environment info.

Regeneration

To force regeneration even when SnakeBridge considers the project up to date:

mix snakebridge.regen

Use --clean to remove generated artifacts and metadata first (useful after large surface changes):

mix snakebridge.regen --clean

Generated File Layout

SnakeBridge generates Elixir files that mirror the Python module structure. This produces navigable, IDE-friendly bindings that match Python's package organization.

Directory Structure

Generated files are organized to match Python module paths:

lib/snakebridge_generated/
├── mylib/
│   ├── __init__.ex          # Mylib module (root functions)
│   ├── models/
│   │   ├── __init__.ex       # Mylib.Models module
│   │   ├── classifier.ex     # Mylib.Models.Classifier (class)
│   │   └── regressor.ex      # Mylib.Models.Regressor (class)
│   └── utils/
│       └── __init__.ex       # Mylib.Utils module
├── numpy/
│   ├── __init__.ex           # Numpy module
│   └── linalg/
│       └── __init__.ex       # Numpy.Linalg module
└── pandas/
    ├── __init__.ex           # Pandas module
    └── data_frame.ex         # Pandas.DataFrame (class)

The `init.ex` Convention

Following Python's __init__.py pattern, package modules use __init__.ex:

Python Module	Generated File	Elixir Module
`mylib`	`mylib/__init__.ex`	`Mylib`
`mylib.models`	`mylib/models/__init__.ex`	`Mylib.Models`
`numpy.linalg`	`numpy/linalg/__init__.ex`	`Numpy.Linalg`

Class Files

Classes are generated as separate files named after the class:

Python Class	Generated File	Elixir Module
`mylib.models.Classifier`	`mylib/models/classifier.ex`	`Mylib.Models.Classifier`
`pandas.DataFrame`	`pandas/data_frame.ex`	`Pandas.DataFrame`
`numpy.ndarray`	`numpy/ndarray.ex`	`Numpy.Ndarray`

Documentation Placement

Documentation follows the same structure users see in HexDocs:

Module files (__init__.ex) define the module and carry module docs plus module-level function docs.
Class files define the class module and carry class docs plus method docs.
Submodule docs live in submodule __init__.ex; class docs live in class files.
Module docs prepend the Python module docstring (when available), then include runtime options, plus Python docs/version metadata when configured.

File names do not appear in HexDocs. Only module names and @moduledoc/@doc content are shown. When Python docstrings are missing, SnakeBridge emits a concise fallback description so HexDocs remains consistent.

For third-party libraries, you can include an explicit docs URL to surface a "Python Docs" section in the generated @moduledoc:

defp python_deps do
  [
    {:pillow, "10.2.0", docs_url: "https://pillow.readthedocs.io/"}
  ]
end

Benefits

IDE Navigation: Jump to definitions matches Python's module structure
Smaller Files: Each module is independently viewable and diffable
Git-Friendly: Changes to one submodule don't affect others
Familiar Layout: Developers familiar with Python recognize the structure

HexDocs Grouping (Optional)

You can group generated modules by Python package path for a clean HexDocs navigation tree. SnakeBridge provides a helper that reads the manifest and builds a groups_for_modules keyword list:

def project do
  [
    # ...
    docs: [
      groups_for_modules: SnakeBridge.Docs.groups_for_modules(),
      nest_modules_by_prefix: SnakeBridge.Docs.nest_modules_by_prefix()
    ]
  ]
end

By default, grouping uses one submodule level beyond the library root. To group by full Python paths (more granular), set depth: :full:

docs: [
  groups_for_modules: SnakeBridge.Docs.groups_for_modules(depth: :full)
]

The helper reads .snakebridge/manifest.json, so run mix compile (or mix snakebridge.setup) before generating docs to ensure the manifest is up to date.

Configuration

The split layout is the default. To use the legacy single-file layout:

# config/config.exs
config :snakebridge, generated_layout: :single

With :single, all modules for a library are nested in one file:

lib/snakebridge_generated/
├── mylib.ex     # All Mylib.* modules nested inside
├── numpy.ex     # All Numpy.* modules nested inside
└── pandas.ex    # All Pandas.* modules nested inside

Note: The single-file layout is preserved for backward compatibility but may be deprecated in future versions.

Max Coverage and Signature Tiers

When generate: :all is enabled, SnakeBridge attempts to wrap every public symbol and records the source tier for signatures and docs in the manifest.

Signature tiers (highest to lowest):

runtime - inspect.signature or __signature__
text_signature - __text_signature__
runtime_hints - runtime type hints
stub - .pyi stubs (local, types- packages, typeshed)
stubgen - generated stubs fallback
variadic - fallback wrapper when no signature is available

Doc tiers:

runtime - runtime docstrings
stub - stub docstrings
module - module docstring fallback
empty - no docstring available

Each symbol records signature_source, signature_detail, signature_missing_reason, doc_source, and doc_missing_reason in the manifest.

Stub Discovery and Configuration

Stub discovery checks, in order:

Local .pyi next to the module or package
types-<pkg> stub packages when installed
Typeshed when use_typeshed: true
Stubgen fallback when stubs are missing (cached)

Configure stub sources and search paths:

config :snakebridge,
  signature_sources: [:runtime, :text_signature, :runtime_hints, :stub, :stubgen, :variadic],
  stub_search_paths: ["priv/python/stubs"],
  use_typeshed: true,
  typeshed_path: "/path/to/typeshed",
  stubgen: [enabled: true, cache_dir: ".snakebridge/stubgen_cache"]

Strict Signature Thresholds

Use strict_signatures and min_signature_tier (global or per-library) to fail builds when any symbol falls below the minimum tier.

config :snakebridge,
  strict_signatures: true,
  min_signature_tier: :stub

defp python_deps do
  [
    {:pandas, "2.2.0",
     generate: :all,
     strict_signatures: true,
     min_signature_tier: :stub}
  ]
end

Coverage Reports

Enable coverage reports to capture tier counts and issues without warnings:

config :snakebridge,
  coverage_report: [output_dir: ".snakebridge/coverage"]

Reports are written as *.coverage.json and *.coverage.md.

Generated Module Structure

defmodule Numpy do
  @moduledoc "SnakeBridge bindings for `numpy`."

  def __snakebridge_python_name__, do: "numpy"
  def __snakebridge_library__, do: "numpy"

  @spec mean(list(), keyword()) :: {:ok, term()} | {:error, Snakepit.Error.t()}
  def mean(a, opts \\ []) do
    SnakeBridge.Runtime.call(__MODULE__, :mean, [a], opts)
  end

  # Discovery
  def __functions__, do: [{:mean, 1, __MODULE__, "Compute the arithmetic mean..."}]
  def __classes__, do: [{Numpy.Ndarray, "ndarray object..."}]
  def __search__(query), do: SnakeBridge.Docs.search(__MODULE__, query)
end

Discovery Functions

__functions__/0 - Returns [{name, arity, module, summary}]
__classes__/0 - Returns [{module, docstring}]
__search__/1 - Fuzzy search across names and docs

iex> Numpy.__functions__() |> Enum.take(3)
[{:mean, 1, Numpy, "Compute the arithmetic mean..."}, ...]

Arity Handling

Python's calling conventions map to Elixir function variants.

Parameter Classification

Python Kind	Behavior
`POSITIONAL_ONLY`	Counts toward required arity
`POSITIONAL_OR_KEYWORD`	Counts toward required arity
`VAR_POSITIONAL` (*args)	Variadic list parameter
`KEYWORD_ONLY`	Passed via opts keyword list
`VAR_KEYWORD` (**kwargs)	Passed via opts keyword list

Optional Positional Arguments

def func(a, b=10, c=20): ...

Generates multiple arities:

def func(a)
def func(a, b)
def func(a, b, c)
def func(a, b, c, opts)

Keyword-Only Arguments

def func(a, *, required_kw, optional_kw=None): ...

def func(a, opts \\ []) do
  missing = ["required_kw"] -- Keyword.keys(opts)
  if missing != [], do: raise ArgumentError, "Missing required keyword-only arguments..."
  SnakeBridge.Runtime.call(__MODULE__, :func, [a], opts)
end

Variadic Fallback (C Extensions)

Functions without introspectable signatures get multiple arities (up to 8 args):

def func()
def func(a)
def func(a, b)
# ... up to 8 args (configurable via :variadic_max_arity)

Class Generation

Python classes become nested Elixir modules:

defmodule Numpy.Ndarray do
  @opaque t :: SnakeBridge.Ref.t()

  # Constructor: __init__ -> new
  def new(shape, opts \\ []) do
    SnakeBridge.Runtime.call_class(__MODULE__, :__init__, [shape], opts)
  end

  # Method: ref as first argument
  def reshape(ref, shape, opts \\ []) do
    SnakeBridge.Runtime.call_method(ref, :reshape, [shape], opts)
  end

  # Attribute accessor
  def shape(ref), do: SnakeBridge.Runtime.get_attr(ref, :shape)
end

Method Name Mapping

Python	Elixir
`__init__`	`new`
`__str__`	`to_string`
`__repr__`	`inspect`
`__len__`	`length`
`__getitem__`	`get`
`__setitem__`	`put`
`__contains__`	`member?`

Other dunder methods are skipped.

Reserved Word Handling

Python names conflicting with Elixir reserved words are prefixed with py_:

def py_class(ref, opts \\ [])  # Python method named "class"

Method Name Collision Handling

When a Python class has both __init__ (mapped to new) and a method literally named new, SnakeBridge renames the method to python_new to avoid arity conflicts:

# Python class with collision
class MyClass:
    def __init__(self, value):
        self.value = value

    def new(self, other_value):  # Collides with __init__ -> new
        return MyClass(other_value)

# Generated Elixir module
defmodule MyLib.MyClass do
  def new(value, opts \\ [])           # From __init__
  def python_new(ref, other_value)     # Renamed from 'new' method
end

This prevents "function new/N defined multiple times" compilation errors.

Strict Mode

Enable strict mode for CI without Python:

SNAKEBRIDGE_STRICT=1 mix compile

Or in config:

config :snakebridge, strict: true

This strict mode verifies manifest coverage and generated files. For signature tier enforcement, see the "Strict Signature Thresholds" section above.

Strict Mode Behavior

Load existing manifest (no new introspection)
Scan project for Python calls
Verify all calls exist in manifest
Verify generated files exist
Verify all symbols are defined

Failure Example

Strict mode: 3 symbol(s) not in manifest.

Missing:
  - Numpy.new_function/2
  - Pandas.DataFrame.new_method/1

To fix:
  1. Run `mix snakebridge.setup` locally
  2. Run `mix compile` to generate bindings
  3. Commit the updated manifest and generated files
  4. Re-run CI

CI Workflow

- name: Build
  env:
    SNAKEBRIDGE_STRICT: 1
  run: mix compile --warnings-as-errors

Lockfile

The snakebridge.lock captures environment state:

{
  "version": "0.15.0",
  "environment": {
    "snakebridge_version": "0.15.0",
    "generator_hash": "a1b2c3...",
    "python_version": "3.12.3",
    "elixir_version": "1.18.4",
    "hardware": {
      "accelerator": "cuda",
      "cuda_version": "12.1"
    },
    "platform": { "os": "linux", "arch": "x86_64" }
  },
  "libraries": {
    "numpy": { "requested": "1.26.0", "resolved": "1.26.0" }
  }
}

The generator hash triggers regeneration when core logic changes. Hardware info detects compatibility issues across environments.

Commit Strategy

Commit both snakebridge.lock and .snakebridge/manifest.json to ensure:

Reproducible builds across environments
Strict mode verification in CI
Hardware compatibility checking