{
* dataType: "NUMERIC",
* dataType: LangfuseAPI.unstable.EvaluatorOutputDataType.Numeric,
* reasoning: {
* description: "Explain why the answer is correct or incorrect."
* },
* score: {
* description: "Return a score between 0 and 1."
* }
* }
{
* dataType: "BOOLEAN",
* dataType: LangfuseAPI.unstable.EvaluatorOutputDataType.Boolean,
* reasoning: {
* description: "Explain why the output satisfies the requirement."
* },
* score: {
* description: "Return true if the output satisfies the requirement, otherwise false."
* }
* }
{
* dataType: "CATEGORICAL",
* dataType: LangfuseAPI.unstable.EvaluatorOutputDataType.Categorical,
* reasoning: {
* description: "Explain which category best fits the output."
* },
* score: {
* description: "Choose the best category.",
* categories: ["correct", "partially_correct", "incorrect"],
* shouldAllowMultipleMatches: false
* }
* }
Structured output definition to send when creating an evaluator.
Agent guidance:
dataTypeis required.version; that is an internal storage detail and is not part of the public request contract.NUMERICandBOOLEAN, providereasoning.descriptionandscore.description.CATEGORICAL, also providescore.categoriesandscore.shouldAllowMultipleMatches.