Loading...
Loading...
Systematic incident investigation methodology. Use when investigating production issues, service degradation, errors, latency spikes, or outages.
npx skill4agent add incidentfox/incidentfox investigatequery_datadog_metricsget_cloudwatch_metricsdetect_anomaliescorrelate_metricsfind_change_pointfilter @message like /ERROR/ | stats count(*) by bin(5m)get_pod_eventsget_pod_logslist_podsget_pod_resources**Root Cause**: [Specific, actionable cause]
**Evidence**:
- [Metric/log/event that supports the cause]
- [Correlation or change point identified]
- [Timeline of events]
**Confidence**: [High/Medium/Low - explain why]
**Recommended Actions**:
1. Immediate: [Use propose_* tools if applicable]
2. Short-term: [Follow-up investigation or fixes]
3. Long-term: [Prevention measures]
**Caveats**: [What you couldn't determine]