Loading...
Loading...
[Hyper] Test Codex/agent skills for intended triggering and behavior with realistic positive, negative, boundary, and edge-case scenarios. Use when validating a skill folder, SKILL.md, rules/references/scripts/assets, trigger precision, workflow correctness, or regression coverage before shipping skill changes.
npx skill4agent add alpoxdev/hypercore skill-tester<purpose>Prove a skill works as intended before trusting it.
skill-testerskill-makerautoresearch-skillqaskill-testerskills/git-maker/SKILL.mdskill-makerautoresearch-skillskill-testerskill-makerSKILL.mdSKILL.ko.mdrules/references/scripts/assets/scripts/validate-skill.mjs| Phase | Task | Output |
|---|---|---|
| 0 | Identify target skill, intended behavior, and neighboring skills that might conflict | Test scope |
| 1 | Read | Baseline behavior map |
| 2 | Build a scenario matrix covering positive, negative, boundary, edge, and regression cases | Test matrix |
| 3 | Run static anatomy checks and inspect support-file references | Static findings |
| 4 | Simulate skill routing and workflow execution for each scenario | Pass/fail table |
| 5 | Classify failures by trigger, scope, resource placement, workflow, validation, or safety | Ranked defects |
| 6 | Recommend minimal fixes or hand off to | Evidence-backed report |
SKILL.ko.mdtrigger-misstrigger-overreachscope-conflictworkflow-gapresource-driftvalidation-gapedge-case-gapsafety-gap## Skill Test Report
**Target**: `skills/example/`
**Intended behavior**: ...
**Verdict**: pass | pass-with-risks | fail
### Scenario results
| ID | Type | Prompt / condition | Expected | Observed | Result |
|----|------|--------------------|----------|----------|--------|
### Findings
1. [severity] [taxonomy] Evidence-backed issue and affected file/section.
### Edge cases covered
- ...
### Recommended fixes
- Minimal next edit or handoff target.
### Validation evidence
- Commands run, files read, and checks completed.references/.hypercore/