Loading...
Loading...
Found 44 Skills
Evaluate and improve Claude Code commands, skills, and agents. Use when testing prompt effectiveness, validating context engineering choices, or measuring improvement quality.
Comprehensive multi-perspective review using specialized judges with debate and consensus building
Execute complex tasks through sequential sub-agent orchestration with intelligent model selection, meta-judge → LLM-as-a-judge verification
Launch a sub-agent judge to evaluate results produced in the current conversation
Launch a meta-judge then a judge sub-agent to evaluate results produced in the current conversation
Comprehensive multi-perspective review using specialized judges with debate and consensus building
Execute a task with sub-agent implementation and LLM-as-a-judge verification with automatic retry loop
Evaluate and improve Claude Code commands, skills, and agents. Use when testing prompt effectiveness, validating context engineering choices, or measuring improvement quality.