Search Results: incident-management

Found 18 Skills

DevOps & Cloud Servicesjeffallan/claude-skills

sre-engineer

Use when defining SLIs/SLOs, managing error budgets, or building reliable systems at scale. Invoke for incident management, chaos engineering, toil reduction, capacity planning.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicescomposiohq/awesome-claude...

pagerduty-automation

Automate PagerDuty tasks via Rube MCP (Composio): manage incidents, services, schedules, escalation policies, and on-call rotations. Always search tools first for current schemas.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicespersonamanagmentlayer/pcl

sre-expert

Expert-level site reliability engineering, SLOs, incident management, and operational excellence

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesaj-geddes/useful-ai-promp...

root-cause-analysis

Conduct systematic root cause analysis to identify underlying problems. Use structured methodologies to prevent recurring issues and drive improvements.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesanthropics/knowledge-work...

incident-response

Triage and manage production incidents. Trigger with "we have an incident", "production is down", "something is broken", "there's an outage", "SEV1", or when the user describes a production issue needing immediate response.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesyuyz0112/public-api-skill...

pagerduty-api

This document describes the PagerDuty REST APIs.. Use when working with the PagerDuty API or when the user needs to interact with this API.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicessickn33/antigravity-aweso...

incident-responder

Expert SRE incident responder specializing in rapid problem resolution, modern observability, and comprehensive incident management. Masters incident command, blameless post-mortems, error budget management, and system reliability patterns. Handles critical outages, communication strategies, and continuous improvement. Use IMMEDIATELY for production incidents or SRE practices.

🇺🇸|EnglishTranslated

DevOps & Cloud Services404kidwiz/claude-supercod...

devops-incident-responder

Expert in SRE practices, incident management, root cause analysis, and automated remediation.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesmembranedev/application-s...

better-stack

Better Stack integration. Manage Incidents, Users, Teams. Use when the user wants to interact with Better Stack data.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesbobmatnyc/claude-mpm-skil...

emergency-release-workflow

Emergency release workflow for critical bug fixes and security patches. Use when production issues require fast-track deployment.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesborghei/claude-skills

delivery-manager

Expert delivery management covering continuous delivery, release management, deployment coordination, and service operations.

🇺🇸|EnglishTranslated

Product & Designkostja94/marketing-skills

status-page-generator

When the user wants to create, optimize, or structure a status page. Also use when the user mentions "status page," "status.yourdomain.com," "uptime," "service health," "incident page," or "system status."

🇺🇸|EnglishTranslated