Loading...
Loading...
Design hypothesis-driven ML/AI experiments before running them. Use this skill whenever the user wants to plan experiments, ablations, baselines, metrics, controls, seeds, logging, stop conditions, reviewer-proof evidence, or an experiment matrix for a paper claim before using run-experiment or writing results.
npx skill4agent add a-green-hand-jack/ml-research-skills experiment-design-plannerresearch-project-memoryrun-experimentexperiment-report-writerpaper-reviewer-simulatorbaseline-selection-auditfigure-results-review<installed-skill-dir>/
├── SKILL.md
└── references/
├── ablation-matrix.md
├── evidence-standards.md
├── metrics-and-controls.md
└── report-template.mdreferences/evidence-standards.mdreferences/metrics-and-controls.mdreferences/ablation-matrix.mdreferences/report-template.mdmemory/research-project-memorysingleablationbenchmarktheorydiagnosticVague: Does our method work?
Testable: Does component X improve metric M over baseline B on datasets D1/D2 under the same training budget?references/evidence-standards.mdreferences/metrics-and-controls.mdreferences/ablation-matrix.mdreferences/report-template.mddocs/experiments/experiment_plan_YYYY-MM-DD_<short-name>.mdinit-python-projectnew-workspacedocs/reports/experiment_plan_YYYY-MM-DD_<short-name>.mdrun-experimentexperiment-report-writerresearch-project-memorymemory/evidence-board.mdEVD-###EXP-###memory/claim-board.mdmemory/risk-board.mdmemory/action-board.md.agent/worktree-status.mdplannedmemory/