Loading...
Loading...
Found 76 Skills
Expert SRE investigator for incidents and debugging. Uses hypothesis-driven methodology and systematic triage. Can query Axiom observability when available. Use for incident response, root cause analysis, production debugging, or log investigation.
Use when defining SLIs/SLOs, managing error budgets, or building reliable systems at scale. Invoke for incident management, chaos engineering, toil reduction, capacity planning.
Observability and SRE expert. Use when setting up monitoring, logging, tracing, defining SLOs, or managing incidents. Covers Prometheus, Grafana, OpenTelemetry, and incident response best practices.
Expert-level site reliability engineering, SLOs, incident management, and operational excellence
Use when building reliable and scalable distributed systems.
Use when building comprehensive monitoring and observability systems.
FansRevenue platform help — creator monetization through affiliate brand promotion, creator-to-creator promotion, MyEroLink link-in-bio, Creator Academy, and CrakRevenue network. Use when setting up FansRevenue brand partnerships, commission earnings aren't showing up, custom landing pages aren't converting, MyEroLink bio links need optimization, Creator Academy isn't loading, or trying to understand which offers pay highest commissions. Do NOT use for general affiliate program design strategy (use /sales-affiliate-program), influencer marketing strategy from a brand's perspective (use /sales-influencer-marketing), or CPA network selection (use /sales-performcb).
Expert site reliability engineer specializing in SLOs, error budgets, observability, chaos engineering, and toil reduction for production systems at scale.
Go back through the previous year of work and create a Notion doc that groups relevant links into projects that can then be documented as SRED projects.
Take a list of projects and their related documentation, and organize them into the SRED format for submission.
Expert Site Reliability Engineer specializing in SLOs, error budgets, and reliability engineering practices. Proficient in incident management, post-mortems, capacity planning, and building scalable, resilient systems with focus on reliability, availability, and performance.
Manage the full lifecycle of Alibaba Cloud EMR Serverless StarRocks instances — create, scale, configure, maintain and diagnose. Use this Skill when operations engineers, SREs, or architects need to manage StarRocks instances. Typical scenarios include: "create a StarRocks", "check instance status", "scale up CU", "modify configuration", "restart instance", "diagnose issues", etc. Not applicable for: writing SQL/DDL, data import/export, query tuning, materialized view configuration, or managing non-StarRocks products (EMR clusters, Spark, Milvus, ClickHouse, Doris, RDS, ECS).