Loading...
Loading...
Create an AI Evals Pack (eval PRD, test set, rubric, judge plan, results + iteration loop). Use for LLM evaluation, benchmarks, rubrics, error analysis/open coding, and ship/no-ship quality gates for AI features.
npx skill4agent add oldwinter/skills ai-evalsproblem-definitionbuilding-with-llmsai-product-strategyevaluating-new-technologyai-evalsai-evalsamountdue_dateproblem-definitionbuilding-with-llmsai-evals