Search Results: agent-testing

Found 33 Skills

AI & Machine Learninggooglecloudplatform/cxas-...

cxas-sim-eval

Converts CXAS golden evaluations to SCRAPI SimulationEvals test cases. Use when generating high-level, goal-oriented test cases from turn-by-turn evaluation JSONs, and when enriching test expectations with inferred tool calls.

🇺🇸|EnglishTranslated

4 scripts/Attention

AI & Machine Learningsickn33/antigravity-aweso...

agent-orchestration-improve-agent

Systematic improvement of existing agents through performance analysis, prompt engineering, and continuous iteration.

🇺🇸|EnglishTranslated

AI & Machine Learninggalaxy-dawn/claude-schola...

agent-identifier

Use when creating or configuring Claude Code agents and their frontmatter.

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningneolabhq/context-engineer...

test-prompt

Use when creating or editing any prompt (commands, hooks, skills, subagent instructions) to verify it produces desired behavior - applies RED-GREEN-REFACTOR cycle to prompt engineering using subagents for isolated testing

🇺🇸|EnglishTranslated

AI & Machine Learningneolabhq/context-engineer...

test-skill

Use when creating or editing skills, before deployment, to verify they work under pressure and resist rationalization - applies RED-GREEN-REFACTOR cycle to process documentation by running baseline without skill, writing to address failures, iterating to close loopholes

🇺🇸|EnglishTranslated

AI & Machine Learningcekura-ai/cekura-skills

cekura-onboarding

Use when the user says "get started with Cekura", "set up Cekura", "onboard to Cekura", "I'm new to Cekura", "help me set up my agent", "how do I use Cekura", "walk me through Cekura", "configure my project", "first time using Cekura", or needs guidance on initial platform setup. Covers two onboarding paths: **testing** (default — build evaluators and run simulated calls) and **observability** (ingest production call logs and evaluate them).

🇺🇸|EnglishTranslated

AI & Machine Learningfrankxai/arcanea

agent-implementer

Implementation guidance for creating individual agents in the Arcanea system with proper structure, capabilities, and integration.

🇺🇸|EnglishTranslated

AI & Machine Learninglerianstudio/ring

ring:testing-agents-with-subagents

Agent testing methodology - run agents with test inputs, observe outputs, iterate until outputs are accurate and well-structured.

🇺🇸|EnglishTranslated

AI & Machine Learningantithesishq/antithesis-s...

test:triage

This skill should be used when the user asks to "test the triage skill", "run triage tests", "validate antithesis triage", "test:triage", or "smoke test triage". Orchestrates end-to-end testing of the antithesis-triage skill by running real triage operations via sub-agents and reviewing the results for bugs, skill compliance issues, and papercuts.

🇺🇸|EnglishTranslated