Create Shiplight Tests
This is the single entry point for Shiplight E2E test work.
Use this skill when the user wants to:
- Create a new local Shiplight test project
- Add YAML tests for a web application
- Update or fix existing Shiplight YAML tests
- Set up or repair environments, accounts, or auth fixtures
- Plan what product behavior should be tested
Daily Skill Update Check
Before starting this skill's work, opportunistically refresh Shiplight skills at most once per day:
- Check the timestamp file at
.shiplight-agent-skills-last-update
in the current project.
- If the timestamp file is missing or older than 24 hours, run
npx -y skills@latest update -y
, then create/update the timestamp file even if the command fails. Treat .shiplight-agent-skills-last-update
as local cache and do not commit it.
- If the update command fails, continue with the currently installed skill and mention the failure briefly.
Test Project Root
Before reading or writing project files, identify the Shiplight test project root. All paths in this skill are relative to that root.
If the root is not clear, ask the user to confirm it before creating or moving files.
Canonical Project Layout
Shiplight test projects use this layout:
text
specs/context.md project-level app, risk, data, and environment context
specs/tests/ Markdown specs, each covering a feature or journey group
tests/ executable Shiplight YAML tests
environments/ environment files, one per target deployment
auth/ auth login modules, if needed
templates/ reusable YAML statement groups, if any
functions/ TypeScript helper functions, if any
files/ fixture files, if any
knowledge/ durable notes discovered by agents
test-results/ generated runtime artifacts; do not edit
shiplight-report/ generated reports; do not edit
.shiplight/ local Shiplight state; do not edit
Read
references/project-layout.md
before making file changes.
Ground Truth
When sources disagree, this precedence applies:
- Explicit user instruction
- Feature or journey spec in
- Existing YAML test , step , and assertions
- Current app behavior
- Project context in and
- Agent docs in this skill
- Agent inference
If current app behavior conflicts with a spec or test goal, report the mismatch. Do not silently rewrite intent to match current behavior.
Required Startup
On every invocation:
- Identify the Shiplight test project root.
- Read
references/project-layout.md
.
- Read .
- Check under the test project root for notes relevant to the task.
- Load the task-specific guides below.
- Tell the user which guides you loaded, then proceed.
Task-Specific Guides
| Task | Guides |
|---|
| New project or broad planning | , references/project-layout.md
, references/test-design-guide.md
|
| Writing new tests | , references/test-design-guide.md
, references/test-implementation-guide.md
|
| Updating or fixing tests | references/updating-tests.md
, references/test-design-guide.md
, references/test-implementation-guide.md
|
| Auth fixtures and login | |
| YAML syntax or actions | references/test-implementation-guide.md
; also read shiplight://yaml-test-spec
and shiplight://schemas/action-entity
before writing YAML |
Core Rules
- Always produce durable artifacts unless the user explicitly asks to skip them.
- Specs describe feature or journey-group confidence. A spec may map to many smaller YAML tests.
- Keep YAML tests focused. One YAML test should verify one logical journey or variant.
- Do not write YAML from imagination. Walk the app in a browser first and capture real locators.
- Validate YAML with after writing it.
- Reflect before finishing. Capture durable knowledge learned from the user or the work, and update stale knowledge instead of leaving contradictions.
- Keep context current. Durable project knowledge belongs in or , not in chat history.
- Never store raw secrets. Commit only variable names, roles, access patterns, and setup instructions.
User Checkpoints
For broad test creation, confirm the planned outcomes before implementation:
Do these outcomes match the confidence you need from this test project? Any business-critical outcome missing or incorrectly out of scope?
For narrow requests such as "fix this failing test" or "add this one test", proceed without a broad checkpoint unless the spec is ambiguous or conflicts with app behavior.
Final Report
After changes, report:
- Files created or changed
- Behavior covered or repaired
- Commands run and pass/fail result
- Knowledge or context updated, including stale notes corrected
- Any product/spec mismatch or unresolved blocker