Getting Started

Follow this guide to install Tinkex, configure credentials, and run your first requests through the CLI or SDK. Examples assume Elixir 1.18+ and a valid TINKER_API_KEY.

Install the SDK

Add the dependency and fetch packages:

# mix.exs
def deps do
  [
    {:tinkex, "~> 0.4.0"}
  ]
end

mix deps.get

To build the CLI locally:

MIX_ENV=prod mix escript.build   # emits ./tinkex
# optional: install to PATH
mix escript.install ./tinkex

Configure credentials

Tinkex reads configuration from Tinkex.Config.new/1, pulling fallbacks from the application env and environment variables:

TINKER_API_KEY (required)
TINKER_BASE_URL (defaults to the production base URL)
Optional per-request overrides: :timeout, :max_retries, :http_pool, :project_id, :user_metadata

Advanced env/app-config knobs such as TINKEX_POLL_BACKOFF, TINKEX_PARITY, TINKEX_POOL_SIZE, TINKEX_POOL_COUNT, and TINKEX_OTEL_PROPAGATE are covered in docs/guides/environment_configuration.md and docs/guides/advanced_configuration.md.

config =
  Tinkex.Config.new(
    api_key: System.fetch_env!("TINKER_API_KEY"),
    base_url: System.get_env("TINKER_BASE_URL", "https://tinker.thinkingmachines.dev/services/tinker-prod")
  )

First sampling request (SDK)

{:ok, _} = Application.ensure_all_started(:tinkex)

{:ok, service} = Tinkex.ServiceClient.start_link(config: config)
{:ok, sampler} =
  Tinkex.ServiceClient.create_sampling_client(service,
    base_model: "meta-llama/Llama-3.1-8B"
  )

{:ok, prompt} =
  Tinkex.Types.ModelInput.from_text("Hello there", model_name: "meta-llama/Llama-3.1-8B")

params = %Tinkex.Types.SamplingParams{max_tokens: 64, temperature: 0.7, top_p: 0.9}

{:ok, task} = Tinkex.SamplingClient.sample(sampler, prompt, params, num_samples: 1)
{:ok, response} = Task.await(task, 10_000)
IO.inspect(response.sequences, label: "sequences")

SamplingClient.sample/4 returns a Task.t() so you can await, stream, or supervise requests alongside other work.

Session & checkpoint management (REST)

Use the synchronous RestClient to inspect sessions and checkpoints, then pull artifacts with CheckpointDownload:

{:ok, service} = Tinkex.ServiceClient.start_link(config: config)
{:ok, rest} = Tinkex.ServiceClient.create_rest_client(service)

{:ok, sessions} = Tinkex.RestClient.list_sessions(rest, limit: 10)
IO.inspect(sessions.sessions, label: "sessions")

{:ok, checkpoints} = Tinkex.RestClient.list_user_checkpoints(rest, limit: 20)
IO.inspect(Enum.map(checkpoints.checkpoints, & &1.tinker_path), label: "checkpoints")

{:ok, download} =
  Tinkex.CheckpointDownload.download(rest, "tinker://run-123/weights/0001",
    output_dir: "./models",
    force: true
  )

IO.puts("Extracted to #{download.destination}")

CLI checkpoints and runs

After MIX_ENV=prod mix escript.build, invoke the CLI:

# Save weights metadata for a base model
./tinkex checkpoint \
  --base-model meta-llama/Llama-3.1-8B \
  --rank 32 \
  --output ./checkpoint.json \
  --api-key "$TINKER_API_KEY"

# Set a 24h checkpoint TTL
./tinkex checkpoint set-ttl tinker://run-123/weights/0001 \
  --ttl 86400 \
  --api-key "$TINKER_API_KEY"

# Generate text with configurable sampling params
./tinkex run \
  --base-model meta-llama/Llama-3.1-8B \
  --prompt "Hello there" \
  --max-tokens 64 \
  --temperature 0.7 \
  --num-samples 2 \
  --api-key "$TINKER_API_KEY"

Flags to know:

--prompt-file <path> reads a prompt from disk (plain text or JSON array of token IDs).
--json prints the full response payload instead of decoded text.
--output <path> writes generation output to a file.
--access-scope accessible widens run-management reads to resources shared with the caller.
checkpoint push-hf exports a checkpoint adapter directly to Hugging Face Hub.
For parallelism and quicker startup, prefer the async client factories (Tinkex.ServiceClient.create_sampling_client_async/2 and Tinkex.SamplingClient.create_async/2) inside your own scripts; the CLI remains a thin wrapper over the same APIs.

checkpoint push-hf resolves Hugging Face auth in this order: --hf-token, HF_TOKEN, HUGGING_FACE_HUB_TOKEN, then HF_TOKEN_PATH (default: $HF_HOME/token).

See ./tinkex checkpoint --help and ./tinkex run --help for the full option set, plus the troubleshooting guide for timeout/backoff tips.

Tokenization helpers

Tinkex wraps the tokenizers NIF and caches handles in ETS:

{:ok, ids} = Tinkex.Tokenizer.encode_text("Hello", "gpt2")
{:ok, text} = Tinkex.Tokenizer.decode(ids, "gpt2")
{:ok, model_input} = Tinkex.Types.ModelInput.from_text("Chat prompt", model_name: "gpt2")

Tokenizer resolution honors metadata from TrainingClient.get_info/1 if provided and applies the Llama-3 gating workaround automatically.