Compares LLM output against an expected answer.
Correctness means the output conveys the same meaning as the expected answer. Combines factual accuracy with semantic similarity: the output must be factually equivalent even if worded differently.
Compares LLM output against an expected answer.
Correctness means the output conveys the same meaning as the expected answer. Combines factual accuracy with semantic similarity: the output must be factually equivalent even if worded differently.