View Source ExOpenAI.Components.EvalRun (ex_openai.ex v2.0.0-beta2)
A schema representing an evaluation run.
Fields
:created_at- required -integer()
Unix timestamp (in seconds) when the evaluation run was created.:data_source- required -map()
Information about the run's data source.:error- required -ExOpenAI.Components.EvalApiError.t():eval_id- required -String.t()
The identifier of the associated evaluation.:id- required -String.t()
Unique identifier for the evaluation run.:metadata- required -ExOpenAI.Components.Metadata.t():model- required -String.t()
The model that is evaluated, if applicable.:name- required -String.t()
The name of the evaluation run.:object- required -:"eval.run"
The type of the object. Always "eval.run".
Allowed values:"eval.run"
Default:"eval.run":per_model_usage- required -[{:%{}, [], [{{:required, [], [:cached_tokens]}, {:integer, [], []}}, {{:required, [], [:completion_tokens]}, {:integer, [], []}}, {{:required, [], [:invocation_count]}, {:integer, [], []}}, {{:required, [], [:model_name]}, {{:., [], [{:__aliases__, [alias: false], [:String]}, :t]}, [], []}}, {{:required, [], [:prompt_tokens]}, {:integer, [], []}}, {{:required, [], [:total_tokens]}, {:integer, [], []}}]}]
Usage statistics for each model during the evaluation run.:per_testing_criteria_results- required -[{:%{}, [], [{{:required, [], [:failed]}, {:integer, [], []}}, {{:required, [], [:passed]}, {:integer, [], []}}, {{:required, [], [:testing_criteria]}, {{:., [], [{:__aliases__, [alias: false], [:String]}, :t]}, [], []}}]}]
Results per testing criteria applied during the evaluation run.:report_url- required -String.t()
The URL to the rendered evaluation run report on the UI dashboard.:result_counts- required -{:%{}, [], [{{:required, [], [:errored]}, {:integer, [], []}}, {{:required, [], [:failed]}, {:integer, [], []}}, {{:required, [], [:passed]}, {:integer, [], []}}, {{:required, [], [:total]}, {:integer, [], []}}]}
Counters summarizing the outcomes of the evaluation run.:status- required -String.t()
The status of the evaluation run.
Summary
Types
@type t() :: %ExOpenAI.Components.EvalRun{ created_at: integer(), data_source: map(), error: ExOpenAI.Components.EvalApiError.t(), eval_id: String.t(), id: String.t(), metadata: ExOpenAI.Components.Metadata.t(), model: String.t(), name: String.t(), object: :"eval.run", per_model_usage: [ %{ cached_tokens: integer(), completion_tokens: integer(), invocation_count: integer(), model_name: String.t(), prompt_tokens: integer(), total_tokens: integer() } ], per_testing_criteria_results: [ %{failed: integer(), passed: integer(), testing_criteria: String.t()} ], report_url: String.t(), result_counts: %{ errored: integer(), failed: integer(), passed: integer(), total: integer() }, status: String.t() }