Loading...
Loading...
Found 3 Skills
Design and optimize AI agent action spaces, tool definitions, and observation formatting for higher completion rates.
Run Microsoft's eval-recipes benchmarks to validate amplihack improvements against baseline agents. Auto-activates when testing improvements, running evals, or benchmarking changes.
Agent skill for benchmark-suite - invoke with $agent-benchmark-suite