Loading...
Loading...
Autonomous Goal-directed Iteration. Apply Karpathy's autoresearch principles to ANY task. Loops autonomously — modify, verify, keep/discard, repeat. Supports optional loop count via Claude Code's /loop command.
npx skill4agent add uditgoenka/autoresearch autoresearch| Subcommand | Purpose |
|---|---|
| Run the autonomous loop (default) |
| Interactive wizard to build Scope, Metric, Direction & Verify from a Goal |
references/plan-workflow.md/autoresearch:plan
Goal: Make the API respond faster
/autoresearch:plan Increase test coverage to 95%
/autoresearch:plan Reduce bundle size below 200KB/autoresearch/autoresearch/ug:autoresearch/autoresearch:plan/loopRequires: Claude Code v1.0.32+ (thecommand was introduced in this version)/loop
/autoresearch
Goal: Increase test coverage to 90%/loop 25 /autoresearch
Goal: Increase test coverage to 90%/autoresearch/loop 25| Scenario | Recommendation |
|---|---|
| Run overnight, review in morning | Unlimited (default) |
| Quick 30-min improvement session | |
| Targeted fix with known scope | |
| Exploratory — see if approach works | |
| CI/CD pipeline integration | |
references/results-logging.mdreferences/autonomous-loop-protocol.mdLOOP (FOREVER or N times):
1. Review: Read current state + git history + results log
2. Ideate: Pick next change based on goal, past results, what hasn't been tried
3. Modify: Make ONE focused change to in-scope files
4. Commit: Git commit the change (before verification)
5. Verify: Run the mechanical metric (tests, build, benchmark, etc.)
6. Decide:
- IMPROVED → Keep commit, log "keep", advance
- SAME/WORSE → Git revert, log "discard"
- CRASHED → Try to fix (max 3 attempts), else log "crash" and move on
7. Log: Record result in results log
8. Repeat: Go to step 1.
- If unbounded: NEVER STOP. NEVER ASK "should I continue?"
- If bounded (N): Stop after N iterations, print final summaryreferences/core-principles.md| Domain | Metric | Scope | Verify Command |
|---|---|---|---|
| Backend code | Tests pass + coverage % | | |
| Frontend UI | Lighthouse score | | |
| ML training | val_bpb / loss | | |
| Blog/content | Word count + readability | | Custom script |
| Performance | Benchmark time (ms) | Target files | |
| Refactoring | Tests pass + LOC reduced | Target module | |