Loading...
Loading...
Found 2 Skills
Agent skill for benchmark-suite - invoke with $agent-benchmark-suite
Automated reproduction of comprehensive model evaluation benchmarks following the Benchmark Suite V3. Auto-activates for model benchmarking, comparison evaluation, or performance testing between AI models.