Loading...
Loading...
Score assistant responses for relevance on a strict 1-5 scale, then return strict JSON only with score, rationale, and improvement suggestions. Use when the user asks to evaluate relevance, grade relevance, or critique topical alignment.
npx skill4agent add whitespectre/ai-assistant-evals eval-relevancedimension"relevance"scorerationaleimprovement_suggestions