Dala.ML.Burn (dala v0.6.1)

Dala integration for the Burn deep learning framework.

ExBurn provides a Nx.Backend implementation that delegates tensor operations to Burn via Rust NIFs, enabling GPU-accelerated ML/DL on mobile and desktop.

Architecture

Axon model
   ↓
Nx.Defn graph
   ↓
ExBurn.Defn.Compiler (Nx.Defn.Compiler behaviour)
   ↓
ExBurn.Backend (Nx.Backend behaviour)
   ↓
ExBurn.Nif (Rustler NIF) ←→ ExCubecl (GPU buffers, kernels, pipelines)
   ↓
Burn Autodiff<CubeCL> (Rust)
   ↓
CubeCL kernels
   ↓
Metal (iOS) / Vulkan (Android) / CUDA → GPU

Quick Start

# Set ExBurn as the default Nx backend
Dala.ML.Burn.configure!()

# Create and manipulate tensors
t = Nx.tensor([1.0, 2.0, 3.0])
Nx.add(t, t) |> Nx.to_list()

# Define a model with Axon
model =
  Axon.input("input", shape: {nil, 784})
  |> Axon.dense(256, activation: :relu)
  |> Axon.dropout(rate: 0.2)
  |> Axon.dense(10)

# Compile for training
compiled = Dala.ML.Burn.compile(model,
  loss: :cross_entropy,
  optimizer: :adam,
  learning_rate: 0.001
)

# Train
Dala.ML.Burn.fit(compiled, {train_x, train_y},
  epochs: 10,
  batch_size: 32
)

Platform GPU Backends

Platform	Backend	Status
iOS	Metal	✅
Android	Vulkan	✅
macOS	Metal	✅
Linux	Vulkan	✅
NVIDIA	CUDA	✅

Integration with Dala.ML

This module complements the existing Dala ML backends:

Dala.ML.EMLX — MLX backend for Apple Silicon (iOS recommended)
Dala.ML.CoreML — iOS-native CoreML (Neural Engine)
Dala.ML.ONNX — Cross-platform ONNX Runtime
Dala.ML.Burn — Burn framework via ExBurn (this module)

Use Dala.ML.available_backends/0 to see all available backends, and Dala.ML.Burn.available?/0 specifically for Burn support.

Training on Mobile — Caveats

Burn's Autodiff backend is memory-intensive. On iOS/Android with limited RAM:

Fine-tuning small models (< 10M parameters) is feasible on modern devices
Full training of large models is not recommended on mobile
Inference is the primary use case for mobile deployment
Minimum recommended: 4GB RAM, A12+ chip (iOS) / Snapdragon 700+ (Android)

Summary

Functions

abs(a)

Absolute value of a Burn tensor.

add(a, b)

Adds two Burn tensors.

available?()

Checks whether ExBurn is available (loaded and functional).

available_backends()

Returns a list of available GPU backends on this system.

benchmark(model, input, opts \\ [])

Benchmarks the model's forward pass on the given input.

buffer(data, shape, type \\ :f32)

Creates a GPU buffer via ExCubecl from a list of values.

buffer!(data, shape, type \\ :f32)

Creates a GPU buffer via ExCubecl, raising on error.

buffer_shape(buf)

Returns the shape of an ExCubecl buffer.

buffer_size(buf)

Returns the byte size of an ExCubecl buffer.

clone(model)

Creates a deep copy of the model with identical parameters and configuration.

compile(model, opts \\ [])

Compiles an Axon model for training with the ExBurn backend.

compute_gradients(model, batch, opts \\ [])

Computes gradients for a given mini-batch.

compute_loss(model, pred, target)

Computes the loss between predictions and targets.

configure!(opts \\ [])

Configures ExBurn for the current platform with Dala-specific defaults.

cross_entropy(pred, target)

Cross-entropy loss between predictions and targets.

cuda_available?()

Checks whether an NVIDIA CUDA GPU is available.

data_loader(arg, opts \\ [])

Creates a data loader that yields mini-batches from a dataset.

default_device()

Returns the default device for tensor operations.

deserialize_params(binary)

Deserializes model parameters from a binary.

device_info()

Returns a map with device information including GPU availability, backend name, and available backends.

device_name()

Returns the name of the active compute device (e.g., "CUDA (NVIDIA GPU)").

device_summary()

Returns a human-readable summary of the GPU device.

div(a, b)

Divides two Burn tensors element-wise.

dropout(a, prob \\ 0.5, training \\ true)

Dropout (identity during inference).

enable_defn_compiler!()

Enables the ExBurn defn compiler for GPU-accelerated Nx.Defn expressions.

error(opts \\ [])

Creates an ExBurn.Error struct (non-raising).

error_from_tuple(arg, opts)

Wraps an error tuple in an ExBurn.Error.

error_to_log_string(error)

Converts an ExBurn.Error to a log string.

evaluate(model, arg, opts \\ [])

Evaluates a model on a dataset.

exp(a)

Exponential of a Burn tensor.

export(model, path, opts \\ [])

Exports model parameters to a file.

fit(model, arg, opts \\ [])

Trains a model on the given dataset.

format_error(error)

Formats an ExBurn.Error for logging or display.

forward(model, input)

Runs a forward pass through the model using the ExBurn GPU defn compiler.

forward_pattern(model)

Returns the output shape and type information from the Axon model.

free(bt)

Frees a Burn tensor's underlying GPU/CPU memory.

freeze(model, layer_names)

Freezes the specified layers so their parameters are not updated during training.

from_binary(data, shape, type)

Creates a Burn tensor from raw binary data.

from_nx(tensor)

Converts an Nx tensor to a Burn tensor.

from_nx_batch(tensors)

Batch converts a list of Nx tensors to Burn tensors.

frozen?(model, layer_name)

Checks whether a layer is frozen.

frozen_layers(model)

Returns the set of frozen layer names.

gpu?()

Checks whether a GPU device is available for Burn operations.

gpu_available?()

Checks whether a GPU is available via Burn bridge.

gpu_memory_info()

Returns GPU memory info as %{total: bytes, used: bytes, free: bytes}.

import_params(model, path, opts \\ [])

Imports model parameters from a file saved with export/3.

info(model)

Returns a map with model information (param count, layer count, device, memory estimate).

layer_norm(a, dim \\ -1, eps \\ 1.0e-5)

Layer normalization.

load(model, path)

Loads model parameters from a file.

log(a)

Natural logarithm of a Burn tensor.

matmul(a, b)

Matrix multiplication of two Burn tensors.

mean(a)

Mean of all elements in a Burn tensor.

mse(pred, target)

Mean squared error between predictions and targets.

mul(a, b)

Multiplies two Burn tensors element-wise.

neg(a)

Negates a Burn tensor.

new_model(opts \\ [])

Creates a new empty model struct. Useful for incremental model building.

nif_function_count()

Returns the number of NIF functions registered by the Rust library. Useful for debugging NIF loading issues.

nif_loaded?()

Checks whether the NIF library is loaded and responds to calls.

ones(shape, type \\ :f32)

Creates a tensor filled with ones via Burn.

parameters(model)

Returns the current model parameters.

predict(model, input)

Runs a forward pass through the model using Axon's default backend.

profile_step(model, batch, opts \\ [])

Profiles a single training step, returning detailed timing for each phase.

quantize(model, dtype)

Quantizes model parameters to a lower precision type (:f16 or :bf16).

rand(shape, type \\ :f32, low \\ 0.0, high \\ 1.0)

Creates a random tensor with uniform distribution via Burn.

read_buffer(buf)

Reads data from an ExCubecl buffer.

relu(a)

ReLU activation of a Burn tensor.

reshape(a, shape)

Reshapes a Burn tensor.

save(model, path)

Saves the model parameters to a file using compressed Erlang term format.

serialize_params(model)

Serializes model parameters to a binary for network transfer or storage.

sigmoid(a)

Sigmoid activation of a Burn tensor.

smoke_test()

Performs a quick smoke test of the ExBurn pipeline.

softmax(a, dim \\ -1)

Softmax along a dimension.

sqrt(a)

Square root of a Burn tensor.

sub(a, b)

Subtracts two Burn tensors.

sum(a)

Sum of all elements in a Burn tensor.

summary()

Returns a summary of the ExBurn environment.

summary(model)

Returns a summary of the model architecture including parameter count.

tensor_numel(bt)

Returns the total number of elements in a Burn tensor.

tensor_rank(bt)

Returns the rank (number of dimensions) of a Burn tensor.

tensor_shape(bt)

Returns the shape of a Burn tensor.

tensor_type(bt)

Returns the element type of a Burn tensor.

to_cpu(bt)

Moves a Burn tensor to the CPU.

to_device(model, device)

Moves all model parameters to the specified device (:gpu or :cpu).

to_gpu(bt)

Moves a Burn tensor to the GPU.

to_nx(bt)

Converts a Burn tensor to an Nx tensor.

to_nx_batch(tensors)

Batch converts a list of Burn tensors to Nx tensors.

train_step(model, batch, opts \\ [])

Performs a single training step: forward + backward + optimizer update.

transpose(a, dim0 \\ 0, dim1 \\ 1)

Transposes a Burn tensor.

unfreeze(model, layer_names)

Unfreezes the specified layers so their parameters are updated during training.

update_params(model, new_params)

Returns a new model with updated parameters.

version()

Returns the current ExBurn version.

zeros(shape, type \\ :f32)

Creates a tensor filled with zeros via Burn.

Functions

abs(a)

@spec abs(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Absolute value of a Burn tensor.

add(a, b)

@spec add(ExBurn.Tensor.t(), ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Adds two Burn tensors.

available?()

@spec available?() :: boolean()

Checks whether ExBurn is available (loaded and functional).

available_backends()

@spec available_backends() :: [atom()]

Returns a list of available GPU backends on this system.

benchmark(model, input, opts \\ [])

@spec benchmark(ExBurn.Model.t(), Nx.Tensor.t(), keyword()) :: map()

Benchmarks the model's forward pass on the given input.

Options

:warmup — Number of warmup runs (default: 3)
:runs — Number of benchmarked runs (default: 10)

Returns a map with :avg_ms, :min_ms, :max_ms, :median_ms, :std_ms.

buffer(data, shape, type \\ :f32)

@spec buffer(list(), [non_neg_integer()], atom()) :: ExCubecl.buffer_ref()

Creates a GPU buffer via ExCubecl from a list of values.

buffer!(data, shape, type \\ :f32)

@spec buffer!(list(), [non_neg_integer()], atom()) :: ExCubecl.buffer_ref()

Creates a GPU buffer via ExCubecl, raising on error.

buffer_shape(buf)

@spec buffer_shape(ExCubecl.buffer_ref()) :: [non_neg_integer()]

Returns the shape of an ExCubecl buffer.

buffer_size(buf)

@spec buffer_size(ExCubecl.buffer_ref()) :: non_neg_integer()

Returns the byte size of an ExCubecl buffer.

clone(model)

@spec clone(ExBurn.Model.t()) :: ExBurn.Model.t()

Creates a deep copy of the model with identical parameters and configuration.

compile(model, opts \\ [])

@spec compile(
  Axon.ModelState.t(),
  keyword()
) :: ExBurn.Model.t()

Compiles an Axon model for training with the ExBurn backend.

Options

:loss — Loss function: :cross_entropy, :mse, :binary_cross_entropy (default: :cross_entropy)
:optimizer — Optimizer: :adam, :sgd, :rmsprop (default: :adam)
:learning_rate — Learning rate (default: 0.001)
:device — Device: :cpu or :gpu (default: auto-detected)
:weight_decay — L2 regularization coefficient (default: 0.0)

compute_gradients(model, batch, opts \\ [])

@spec compute_gradients(ExBurn.Model.t(), {Nx.Tensor.t(), Nx.Tensor.t()}, keyword()) ::
  map()

Computes gradients for a given mini-batch.

Options

:grad_method — :numerical (central differences) or :numerical_batch (one-sided, faster)
:epsilon — Finite difference step size (default: 1.0e-5)

compute_loss(model, pred, target)

@spec compute_loss(ExBurn.Model.t(), Nx.Tensor.t(), Nx.Tensor.t()) ::
  {:ok, Nx.Tensor.t()} | {:error, term()}

Computes the loss between predictions and targets.

configure!(opts \\ [])

@spec configure!(keyword()) :: :ok

Configures ExBurn for the current platform with Dala-specific defaults.

Options

:device — Override device (:cpu or :gpu). Auto-detected by default.
:backend — Override GPU backend (:metal, :vulkan, :cuda). Auto-detected.

cross_entropy(pred, target)

@spec cross_entropy(ExBurn.Tensor.t(), ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Cross-entropy loss between predictions and targets.

cuda_available?()

@spec cuda_available?() :: boolean()

Checks whether an NVIDIA CUDA GPU is available.

data_loader(arg, opts \\ [])

@spec data_loader({Nx.Tensor.t(), Nx.Tensor.t()}, keyword()) :: Enumerable.t()

Creates a data loader that yields mini-batches from a dataset.

Options

:batch_size — Mini-batch size (default: 32)
:shuffle — Shuffle data each iteration (default: true)

default_device()

@spec default_device() :: :cpu | :gpu

Returns the default device for tensor operations.

deserialize_params(binary)

@spec deserialize_params(binary()) :: {:ok, map()} | {:error, String.t()}

Deserializes model parameters from a binary.

device_info()

@spec device_info() :: map()

Returns a map with device information including GPU availability, backend name, and available backends.

device_name()

@spec device_name() :: String.t()

Returns the name of the active compute device (e.g., "CUDA (NVIDIA GPU)").

device_summary()

@spec device_summary() :: String.t()

Returns a human-readable summary of the GPU device.

div(a, b)

@spec div(ExBurn.Tensor.t(), ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Divides two Burn tensors element-wise.

dropout(a, prob \\ 0.5, training \\ true)

@spec dropout(ExBurn.Tensor.t(), float(), boolean()) :: ExBurn.Tensor.t()

Dropout (identity during inference).

enable_defn_compiler!()

@spec enable_defn_compiler!() :: :ok

Enables the ExBurn defn compiler for GPU-accelerated Nx.Defn expressions.

After calling this, all defn functions will be compiled through ExBurn.Defn.Compiler and executed on the GPU via Burn.

Example

Dala.ML.Burn.enable_defn_compiler!

defmodule MyMath do
  import Nx.Defn

  defn add_and_scale(x, y, scale) do
    x |> Nx.add(y) |> Nx.multiply(scale)
  end
end

# Runs on GPU via Burn
MyMath.add_and_scale(Nx.tensor([1.0]), Nx.tensor([2.0]), Nx.tensor(3.0))

error(opts \\ [])

@spec error(keyword()) :: ExBurn.Error.t()

Creates an ExBurn.Error struct (non-raising).

Example

Dala.ML.Burn.error(op: :forward, reason: "shape mismatch")

error_from_tuple(arg, opts)

@spec error_from_tuple({:error, String.t()}, keyword()) :: ExBurn.Error.t()

Wraps an error tuple in an ExBurn.Error.

Example

Dala.ML.Burn.error_from_tuple({:error, "failed"}, op: :predict)

error_to_log_string(error)

@spec error_to_log_string(ExBurn.Error.t()) :: String.t()

Converts an ExBurn.Error to a log string.

evaluate(model, arg, opts \\ [])

@spec evaluate(ExBurn.Model.t(), {Nx.Tensor.t(), Nx.Tensor.t()}, keyword()) ::
  float() | {float(), float() | nil}

Evaluates a model on a dataset.

Returns the average loss, or {loss, accuracy} when track_accuracy: true.

exp(a)

@spec exp(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Exponential of a Burn tensor.

export(model, path, opts \\ [])

@spec export(ExBurn.Model.t(), Path.t(), keyword()) :: :ok | {:error, String.t()}

Exports model parameters to a file.

Options

:format — :elixir_terms (default, compressed) or :json (human-readable)

fit(model, arg, opts \\ [])

@spec fit(ExBurn.Model.t(), {Nx.Tensor.t(), Nx.Tensor.t()}, keyword()) ::
  ExBurn.Model.t()

Trains a model on the given dataset.

Options

:epochs — Number of training epochs (default: 10)
:batch_size — Mini-batch size (default: 32)
:shuffle — Shuffle training data each epoch (default: true)
:validation_data — Validation dataset as {inputs, targets} tuple
:callbacks — List of callback functions called after each epoch
:verbose — Print training progress (default: true)
:lr_schedule — Learning rate schedule (default: nil)
:clip_norm — Max gradient norm for clipping (default: nil)
:clip_value — Max absolute gradient value for clipping (default: nil)
:weight_decay — L2 regularization coefficient (default: nil)
:accumulate_gradients — Gradient accumulation steps (default: 1)
:accuracy — Track and report classification accuracy (default: false)
:nesterov — Use Nesterov momentum for SGD (default: false)

format_error(error)

@spec format_error(ExBurn.Error.t()) :: String.t()

Formats an ExBurn.Error for logging or display.

forward(model, input)

@spec forward(ExBurn.Model.t(), Nx.Tensor.t()) ::
  {:ok, Nx.Tensor.t()} | {:error, term()}

Runs a forward pass through the model using the ExBurn GPU defn compiler.

This is the GPU-accelerated path. Requires the model to be compiled.

Returns {:ok, output_tensor} or {:error, reason}.

forward_pattern(model)

@spec forward_pattern(ExBurn.Model.t()) :: %{
  output_shape: tuple() | nil,
  output_type: atom()
}

Returns the output shape and type information from the Axon model.

free(bt)

@spec free(ExBurn.Tensor.t()) :: :ok

Frees a Burn tensor's underlying GPU/CPU memory.

freeze(model, layer_names)

@spec freeze(ExBurn.Model.t(), [atom() | String.t()]) :: ExBurn.Model.t()

Freezes the specified layers so their parameters are not updated during training.

from_binary(data, shape, type)

@spec from_binary(binary(), [non_neg_integer()], atom()) ::
  {:ok, ExBurn.Tensor.t()} | {:error, term()}

Creates a Burn tensor from raw binary data.

from_nx(tensor)

@spec from_nx(Nx.Tensor.t()) :: {:ok, ExBurn.Tensor.t()} | {:error, term()}

Converts an Nx tensor to a Burn tensor.

from_nx_batch(tensors)

@spec from_nx_batch([Nx.Tensor.t()]) :: {:ok, [ExBurn.Tensor.t()]} | {:error, term()}

Batch converts a list of Nx tensors to Burn tensors.

frozen?(model, layer_name)

@spec frozen?(ExBurn.Model.t(), atom() | String.t()) :: boolean()

Checks whether a layer is frozen.

frozen_layers(model)

@spec frozen_layers(ExBurn.Model.t()) :: MapSet.t()

Returns the set of frozen layer names.

gpu?()

@spec gpu?() :: boolean()

Checks whether a GPU device is available for Burn operations.

gpu_available?()

@spec gpu_available?() :: boolean()

Checks whether a GPU is available via Burn bridge.

gpu_memory_info()

@spec gpu_memory_info() :: {:ok, map()} | {:error, term()}

Returns GPU memory info as %{total: bytes, used: bytes, free: bytes}.

import_params(model, path, opts \\ [])

@spec import_params(ExBurn.Model.t(), Path.t(), keyword()) ::
  {:ok, ExBurn.Model.t()} | {:error, String.t()}

Imports model parameters from a file saved with export/3.

Options

:format — :elixir_terms (default) or :json

info(model)

@spec info(ExBurn.Model.t()) :: map()

Returns a map with model information (param count, layer count, device, memory estimate).

layer_norm(a, dim \\ -1, eps \\ 1.0e-5)

@spec layer_norm(ExBurn.Tensor.t(), non_neg_integer(), float()) :: ExBurn.Tensor.t()

Layer normalization.

load(model, path)

@spec load(ExBurn.Model.t(), Path.t()) :: {:ok, ExBurn.Model.t()} | {:error, term()}

Loads model parameters from a file.

log(a)

@spec log(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Natural logarithm of a Burn tensor.

matmul(a, b)

@spec matmul(ExBurn.Tensor.t(), ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Matrix multiplication of two Burn tensors.

mean(a)

@spec mean(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Mean of all elements in a Burn tensor.

mse(pred, target)

@spec mse(ExBurn.Tensor.t(), ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Mean squared error between predictions and targets.

mul(a, b)

@spec mul(ExBurn.Tensor.t(), ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Multiplies two Burn tensors element-wise.

neg(a)

@spec neg(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Negates a Burn tensor.

new_model(opts \\ [])

@spec new_model(keyword()) :: ExBurn.Model.t()

Creates a new empty model struct. Useful for incremental model building.

nif_function_count()

@spec nif_function_count() :: non_neg_integer()

Returns the number of NIF functions registered by the Rust library. Useful for debugging NIF loading issues.

nif_loaded?()

@spec nif_loaded?() :: boolean()

Checks whether the NIF library is loaded and responds to calls.

ones(shape, type \\ :f32)

@spec ones([non_neg_integer()], atom()) :: ExBurn.Tensor.t()

Creates a tensor filled with ones via Burn.

parameters(model)

@spec parameters(ExBurn.Model.t()) :: map()

Returns the current model parameters.

predict(model, input)

@spec predict(ExBurn.Model.t(), Nx.Tensor.t()) ::
  {:ok, Nx.Tensor.t()} | {:error, term()}

Runs a forward pass through the model using Axon's default backend.

For GPU execution, use forward/2 which uses the ExBurn defn compiler.

Returns {:ok, output_tensor} or {:error, reason}.

profile_step(model, batch, opts \\ [])

@spec profile_step(ExBurn.Model.t(), {Nx.Tensor.t(), Nx.Tensor.t()}, keyword()) ::
  map()

Profiles a single training step, returning detailed timing for each phase.

Returns a map with :forward_ms, :backward_ms, :optimizer_ms, :total_ms.

quantize(model, dtype)

@spec quantize(ExBurn.Model.t(), :f16 | :bf16) :: ExBurn.Model.t()

Quantizes model parameters to a lower precision type (:f16 or :bf16).

Useful for reducing model size and speeding up inference on devices with limited compute.

rand(shape, type \\ :f32, low \\ 0.0, high \\ 1.0)

@spec rand([non_neg_integer()], atom(), float(), float()) :: ExBurn.Tensor.t()

Creates a random tensor with uniform distribution via Burn.

read_buffer(buf)

@spec read_buffer(ExCubecl.buffer_ref()) :: binary()

Reads data from an ExCubecl buffer.

relu(a)

@spec relu(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

ReLU activation of a Burn tensor.

reshape(a, shape)

@spec reshape(ExBurn.Tensor.t(), [non_neg_integer()]) :: ExBurn.Tensor.t()

Reshapes a Burn tensor.

save(model, path)

@spec save(ExBurn.Model.t(), Path.t()) :: :ok | {:error, term()}

Saves the model parameters to a file using compressed Erlang term format.

serialize_params(model)

@spec serialize_params(ExBurn.Model.t()) :: binary()

Serializes model parameters to a binary for network transfer or storage.

sigmoid(a)

@spec sigmoid(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Sigmoid activation of a Burn tensor.

smoke_test()

@spec smoke_test() :: :ok | {:error, String.t()}

Performs a quick smoke test of the ExBurn pipeline.

Returns :ok on success or {:error, reason} on failure.

softmax(a, dim \\ -1)

@spec softmax(ExBurn.Tensor.t(), non_neg_integer()) :: ExBurn.Tensor.t()

Softmax along a dimension.

sqrt(a)

@spec sqrt(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Square root of a Burn tensor.

sub(a, b)

@spec sub(ExBurn.Tensor.t(), ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Subtracts two Burn tensors.

sum(a)

@spec sum(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Sum of all elements in a Burn tensor.

summary()

@spec summary() :: String.t()

Returns a summary of the ExBurn environment.

summary(model)

@spec summary(ExBurn.Model.t()) :: String.t()

Returns a summary of the model architecture including parameter count.

tensor_numel(bt)

@spec tensor_numel(ExBurn.Tensor.t()) :: non_neg_integer()

Returns the total number of elements in a Burn tensor.

tensor_rank(bt)

@spec tensor_rank(ExBurn.Tensor.t()) :: non_neg_integer()

Returns the rank (number of dimensions) of a Burn tensor.

tensor_shape(bt)

@spec tensor_shape(ExBurn.Tensor.t()) :: [non_neg_integer()]

Returns the shape of a Burn tensor.

tensor_type(bt)

@spec tensor_type(ExBurn.Tensor.t()) :: atom()

Returns the element type of a Burn tensor.

to_cpu(bt)

@spec to_cpu(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Moves a Burn tensor to the CPU.

to_device(model, device)

@spec to_device(ExBurn.Model.t(), :cpu | :gpu) :: ExBurn.Model.t()

Moves all model parameters to the specified device (:gpu or :cpu).

to_gpu(bt)

@spec to_gpu(ExBurn.Tensor.t()) :: ExBurn.Tensor.t()

Moves a Burn tensor to the GPU.

to_nx(bt)

@spec to_nx(ExBurn.Tensor.t()) :: {:ok, Nx.Tensor.t()} | {:error, term()}

Converts a Burn tensor to an Nx tensor.

to_nx_batch(tensors)

@spec to_nx_batch([ExBurn.Tensor.t()]) :: {:ok, [Nx.Tensor.t()]} | {:error, term()}

Batch converts a list of Burn tensors to Nx tensors.

train_step(model, batch, opts \\ [])

@spec train_step(ExBurn.Model.t(), {Nx.Tensor.t(), Nx.Tensor.t()}, keyword()) ::
  {float(), ExBurn.Model.t()}

Performs a single training step: forward + backward + optimizer update.

Useful for custom training loops. Returns {loss, updated_model}.

transpose(a, dim0 \\ 0, dim1 \\ 1)

@spec transpose(ExBurn.Tensor.t(), non_neg_integer(), non_neg_integer()) ::
  ExBurn.Tensor.t()

Transposes a Burn tensor.

unfreeze(model, layer_names)

@spec unfreeze(ExBurn.Model.t(), [atom() | String.t()]) :: ExBurn.Model.t()

Unfreezes the specified layers so their parameters are updated during training.

update_params(model, new_params)

@spec update_params(ExBurn.Model.t(), map()) :: ExBurn.Model.t()

Returns a new model with updated parameters.

version()

@spec version() :: String.t()

Returns the current ExBurn version.

zeros(shape, type \\ :f32)

@spec zeros([non_neg_integer()], atom()) :: ExBurn.Tensor.t()

Creates a tensor filled with zeros via Burn.