Search Results: agent-testing

Found 29 Skills

AI & Machine Learninged3dai/ed3d-plugins

creating-an-agent

Use when creating specialized subagents for Claude Code plugins or the Task tool - covers description writing for auto-delegation, tool selection, prompt structure, and testing agents

🇺🇸|EnglishTranslated

AI & Machine Learningneolabhq/context-engineer...

test-prompt

Use when creating or editing any prompt (commands, hooks, skills, subagent instructions) to verify it produces desired behavior - applies RED-GREEN-REFACTOR cycle to prompt engineering using subagents for isolated testing

🇺🇸|EnglishTranslated

Testing & QAexistential-birds/beagle

pydantic-ai-testing

Test PydanticAI agents using TestModel, FunctionModel, VCR cassettes, and inline snapshots. Use when writing unit tests, mocking LLM responses, or recording API interactions.

🇺🇸|EnglishTranslated

Testing & QAincept5/eve-skillpacks

eve-verification-plans

Author agentic verification plans for Eve-compatible apps. Use when building structured test suites that verify app correctness AND Eve platform conformance — CLI parity, manifest conventions, SSO auth, managed migrations, fixture-driven ingestion, and agent efficiency.

🇺🇸|EnglishTranslated

AI & Machine Learninggoogle-gemini/gemini-cli

behavioral-evals

Guidance for creating, running, fixing, and promoting behavioral evaluations. Use when verifying agent decision logic, debugging failures, debugging prompt steering, or adding workspace regression tests.

🇺🇸|EnglishTranslated

Testing & QAveris-ai/veris-skills

agent-integration

Integrate a raw customer agent repo with Veris end to end. Installs or verifies veris-cli, logs in, creates or reuses a Veris environment, analyzes the repo, generates or updates `.veris/veris.yaml`, `.veris/Dockerfile.sandbox`, `.veris/.dockerignore`, configures runtime env vars, and can finish with `veris env push`. Use when a repo has no Veris setup yet, or when an existing `.veris/` integration is stale and needs to be refreshed.

🇺🇸|EnglishTranslated

AI & Machine Learningsickn33/antigravity-aweso...

agent-orchestration-improve-agent

Systematic improvement of existing agents through performance analysis, prompt engineering, and continuous iteration.

🇺🇸|EnglishTranslated

AI & Machine Learningadenhq/hive

hive

Complete workflow for building, implementing, and testing goal-driven agents. Orchestrates hive-* skills. Use when starting a new agent project, unsure which skill to use, or need end-to-end guidance.

🇺🇸|EnglishTranslated

AI & Machine Learningdawiddutoit/custom-claude

manage-agents

Creates, modifies, and manages Claude Code subagents by writing agent files with YAML frontmatter, system prompts, and tool configurations. Use when you need to "create an agent", "modify an agent", "set up a specialist", "I need an agent for [task]", "agent to handle [domain]", or "configure agent tools". Covers agent file format, YAML frontmatter, system prompts, tool restrictions, MCP integration, model selection, and testing.

🇺🇸|EnglishTranslated

3 scripts/Attention

AI & Machine Learningfrankxai/arcanea

agent-implementer

Implementation guidance for creating individual agents in the Arcanea system with proper structure, capabilities, and integration.

🇺🇸|EnglishTranslated

Testing & QAyonatangross/orchestkit

testing-e2e

End-to-end testing patterns with Playwright — page objects, AI agent testing, visual regression, accessibility testing with axe-core, and CI integration. Use when writing E2E tests, setting up Playwright, implementing visual regression, or testing accessibility.

🇺🇸|EnglishTranslated

AI & Machine Learningneolabhq/context-engineer...

test-skill

Use when creating or editing skills, before deployment, to verify they work under pressure and resist rationalization - applies RED-GREEN-REFACTOR cycle to process documentation by running baseline without skill, writing to address failures, iterating to close loopholes

🇺🇸|EnglishTranslated