Vllm.Reasoning.ReasoningParser (VLLM v0.3.0)

Abstract reasoning parser class that should not be used directly.

Provided and methods should be used in derived classes.

It is used to extract reasoning content from the model output.

Summary

Functions

extract_content_ids(ref, input_ids, opts \\ [])

Extract content token ids from the input_ids.

extract_reasoning(ref, model_output, request, opts \\ [])

Extract reasoning content from a complete model-generated string.

extract_reasoning_streaming(ref, previous_text, current_text, delta_text, previous_token_ids, current_token_ids, delta_token_ids, opts \\ [])

Instance method that should be implemented for extracting reasoning

is_reasoning_end(ref, input_ids, opts \\ [])

Check if the reasoning content ends in the input_ids.

is_reasoning_end_streaming(ref, input_ids, delta_ids, opts \\ [])

Check if the reasoning content ends in the input_ids on a

new(tokenizer, opts \\ [])

Initialize self. See help(type(self)) for accurate signature.

prepare_structured_tag(ref, original_tag, tool_server, opts \\ [])

Instance method that is implemented for preparing the structured tag

vocab(ref)

Types

t()

@opaque t()

Functions

extract_content_ids(ref, input_ids, opts \\ [])

@spec extract_content_ids(SnakeBridge.Ref.t(), [integer()], keyword()) ::
  {:ok, [integer()]} | {:error, Snakepit.Error.t()}

Extract content token ids from the input_ids.

Parameters: input_ids: list[int]

The input_ids of the model output.

Parameters

input_ids (list(integer()))

Returns

list(integer())

extract_reasoning(ref, model_output, request, opts \\ [])

@spec extract_reasoning(SnakeBridge.Ref.t(), String.t(), term(), keyword()) ::
  {:ok, {term(), term()}} | {:error, Snakepit.Error.t()}

Extract reasoning content from a complete model-generated string.

Used for non-streaming responses where we have the entire model response available before sending to the client.

Parameters: model_output: str

The model-generated string to extract reasoning content from.

request: ChatCompletionRequest

The request object that was used to generate the model_output.

Parameters

model_output (String.t())
request (term())

Returns

{term(), term()}

extract_reasoning_streaming(ref, previous_text, current_text, delta_text, previous_token_ids, current_token_ids, delta_token_ids, opts \\ [])

@spec extract_reasoning_streaming(
  SnakeBridge.Ref.t(),
  String.t(),
  String.t(),
  String.t(),
  term(),
  term(),
  term(),
  keyword()
) :: {:ok, term()} | {:error, Snakepit.Error.t()}

Instance method that should be implemented for extracting reasoning

from an incomplete response; for use when handling reasoning calls and streaming. Has to be an instance method because it requires state - the current tokens/diffs, but also the information about what has previously been parsed and extracted (see constructor)

Parameters

previous_text (String.t())
current_text (String.t())
delta_text (String.t())
previous_token_ids (term())
current_token_ids (term())
delta_token_ids (term())

Returns

term()

is_reasoning_end(ref, input_ids, opts \\ [])

@spec is_reasoning_end(SnakeBridge.Ref.t(), [integer()], keyword()) ::
  {:ok, boolean()} | {:error, Snakepit.Error.t()}

Check if the reasoning content ends in the input_ids.

It is used in structured engines like xgrammar to check if the reasoning content ends in the model output.

Parameters: input_ids: list[int]

The input_ids of the model output.

Parameters

input_ids (list(integer()))

Returns

boolean()

is_reasoning_end_streaming(ref, input_ids, delta_ids, opts \\ [])

@spec is_reasoning_end_streaming(
  SnakeBridge.Ref.t(),
  [integer()],
  [integer()],
  keyword()
) ::
  {:ok, boolean()} | {:error, Snakepit.Error.t()}

Check if the reasoning content ends in the input_ids on a

decode step.

It is used in structured engines like xgrammar to check if the reasoning content ends in the model output during a decode step. input_ids the entire model output and delta_ids are the last few computed tokens of the model output (like during a decode step).

Parameters: input_ids: list[int]

The entire model output.

delta_ids: list[int]

The last few computed tokens of the model output at the current decode step.

Parameters

input_ids (list(integer()))
delta_ids (list(integer()))

Returns

boolean()

new(tokenizer, opts \\ [])

@spec new(
  term(),
  keyword()
) :: {:ok, SnakeBridge.Ref.t()} | {:error, Snakepit.Error.t()}

Initialize self. See help(type(self)) for accurate signature.

Parameters

tokenizer (term())

prepare_structured_tag(ref, original_tag, tool_server, opts \\ [])

@spec prepare_structured_tag(SnakeBridge.Ref.t(), term(), term(), keyword()) ::
  {:ok, term()} | {:error, Snakepit.Error.t()}

Instance method that is implemented for preparing the structured tag

Otherwise, None is returned

Parameters

original_tag (term())
tool_server (term())

Returns

term()

vocab(ref)

@spec vocab(SnakeBridge.Ref.t()) :: {:ok, term()} | {:error, Snakepit.Error.t()}