Search Results: ai-evaluation

Found 16 Skills

AI & Machine Learningkeyvaluesoftwaresystems/n...

netra-best-practices

Code-first Netra best-practices playbook covering setup, instrumentation, context tracking, custom spans/metrics, integration patterns, evaluation, simulation, and troubleshooting.

🇺🇸|EnglishTranslated

AI & Machine Learningneolabhq/context-engineer...

judge-with-debate

Evaluate solutions through multi-round debate between independent judges until consensus

🇺🇸|EnglishTranslated

Testing & QAyonatangross/orchestkit

testing-llm

LLM and AI testing patterns — mock responses, evaluation with DeepEval/RAGAS, structured output validation, and agentic test patterns (generator, healer, planner). Use when testing AI features, validating LLM outputs, or building evaluation pipelines.

🇺🇸|EnglishTranslated

AI & Machine Learningaffaan-m/everything-claud...

skill-stocktake

Use when auditing Claude skills and commands for quality. Supports Quick Scan (changed skills only) and Full Stocktake modes with sequential subagent batch evaluation.

🇺🇸|EnglishTranslated

3 scripts/Attention