Session Reflection
Value: Feedback -- every user intervention is a signal that the system
prompt is incomplete. Turning corrections into durable instructions creates
compound improvement across sessions.
Purpose
Teaches agents to analyze session history for recurring corrections, generate
project-specific instructions that prevent known failure modes, and maintain
working state that survives context compaction and crashes. Transforms reactive
corrections into proactive prevention.
Practices
Reflection Triggers
Reflect after: milestones (PR merged, feature complete), 3+ repeated
corrections from the user, session restart or crash recovery, every 5
completed tasks, and on explicit user request. Do not wait for a "good time"
-- reflect when triggered.
Analyze Session History
Examine conversation history, git log, memory files, WORKING_STATE.md, and
session logs. Categorize each user intervention into one of five types:
- Correction: Agent did the wrong thing (instruction gap)
- Repetition: Agent was told the same thing again (emphasis gap)
- Role Redirect: Agent stepped outside its role (boundary gap)
- Frustration Escalation: User became more forceful (decay problem)
- Workaround: User did it themselves (skill gap)
See
references/analysis-framework.md
for detailed categorization and
prioritization.
Generate or Refine Project Instructions
Route project-specific directives into the appropriate instruction file
based on content type. There is no separate system prompt file — everything
flows from CLAUDE.md → AGENTS.md → .team/coordinator-instructions.md:
- CLAUDE.md — Session management (startup procedure, compaction recovery,
state tracking), harness-specific configuration
- AGENTS.md — General project rules, dos/don'ts, coding conventions,
workflow configuration
- .team/coordinator-instructions.md — Coordinator/pipeline-controller role
distinctions, build pre-flight gates, spawn discipline, domain review
checklists
Refinement rules: add new items for new gaps, promote advisory to structural
when gaps recur, rewrite ambiguous items for clarity. Never remove items
until the gap is confirmed solved across 3+ sessions. See
references/system-prompt-patterns.md
for writing patterns and
references/launcher-templates.md
for harness-specific file routing.
Self-Reminder Protocol
Every 5-10 messages during long sessions, re-read: project instructions
(CLAUDE.md, AGENTS.md, coordinator-instructions.md as applicable), role
constraints, current task context, WORKING_STATE.md. Mandatory after context
compaction (you lose implicit context). This is the primary defense against
role drift in long sessions.
Working State Persistence
Maintain WORKING_STATE.md as insurance against context compaction and crashes.
Update after every significant state change. Always read before acting after
any interruption. Location:
.factory/WORKING_STATE.md
(pipeline) or project
root (standalone). See
references/working-state-schema.md
for format.
Do:
- Update on task start, phase change, decision made, blocker encountered
- Overwrite with current state (not append)
- Read FIRST after any interruption
Do not:
- Guess state from memory after a compaction or restart
- Use as a journal -- keep it concise and current
- Skip updates because "nothing important changed"
Post-Session Learning Loop
At session end: identify patterns from this session, update memory files,
refine project instructions if triggers were hit, archive working state.
The loop closes when the same category of intervention stops recurring. If
an intervention category persists across 3+ sessions after instruction
refinement, escalate: the gap may require a new skill or a structural
change to the workflow.
Enforcement Note
This skill provides advisory guidance. The self-reminder protocol and working
state maintenance require agent self-discipline. On Claude Code, a
pre-tool-use hook could mechanically enforce state file reads, but no such
hook is provided by default. The reflection triggers are event-driven and
cannot be mechanically enforced. If you observe the agent drifting from its
role or repeating corrected mistakes, point out the pattern and suggest a
reflection.
Verification
After completing work guided by this skill, verify:
If any criterion is not met, revisit the relevant practice before proceeding.
Dependencies
This skill works standalone. For enhanced workflows, it integrates with:
- memory-protocol: Persistent storage for session learnings and working state
- agent-coordination: Coordination patterns referenced in instruction generation
- pipeline: Pipeline controller benefits from self-reminder and crash recovery
- ensemble-team: Team retrospectives feed into session reflection analysis
Missing a dependency? Install with:
npx skills add jwilger/agent-skills --skill memory-protocol