Search Results: observability

Found 389 Skills

DevOps & Cloud Serviceszhanghandong/rust-skills

domain-cloud-native

Use when building cloud-native apps. Keywords: kubernetes, k8s, docker, container, grpc, tonic, microservice, service mesh, observability, tracing, metrics, health check, cloud, deployment, 云原生, 微服务, 容器

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesaj-geddes/useful-ai-promp...

prometheus-monitoring

Set up Prometheus monitoring for applications with custom metrics, scraping configurations, and service discovery. Use when implementing time-series metrics collection, monitoring applications, or building observability infrastructure.

🇺🇸|EnglishTranslated

AI & Machine Learningsickn33/antigravity-aweso...

llm-app-patterns

Production-ready patterns for building LLM applications. Covers RAG pipelines, agent architectures, prompt IDEs, and LLMOps monitoring. Use when designing AI applications, implementing RAG, building agents, or setting up LLM observability.

🇺🇸|EnglishTranslated

Backend Developmentvasilyu1983/ai-agents-pub...

software-backend

Production-grade backend service development across Node.js (Express/Fastify/NestJS/Hono), Bun, Python (FastAPI), Go, and Rust (Axum), with PostgreSQL and common ORMs (Prisma/Drizzle/SQLAlchemy/GORM/SeaORM). Use for REST/GraphQL/tRPC APIs, auth (OIDC/OAuth), caching, background jobs, observability (OpenTelemetry), testing, deployment readiness, and zero-trust defaults.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesgiuseppe-trisciuoglio/dev...

aws-cloudformation-cloudwatch

AWS CloudFormation patterns for CloudWatch monitoring, metrics, alarms, dashboards, logs, and observability. Use when creating CloudWatch metrics, alarms, dashboards, log groups, log subscriptions, anomaly detection, synthesized canaries, Application Signals, and implementing template structure with Parameters, Outputs, Mappings, Conditions, cross-stack references, and CloudWatch best practices for monitoring production infrastructure.

🇺🇸|EnglishTranslated

Backend Developmentahgraber/skills

python-runtime-operations

Use when building or reviewing service, job, or CLI runtime behavior in Python — designing startup validation, shutdown sequences, observability, and structured logging. Also use when startup crashes from late config, shutdown leaves orphaned processes, terminal states are implicit, or logs lack structure.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesbobmatnyc/claude-mpm-skil...

digitalocean-management

DigitalOcean management services for monitoring, uptime checks, and resource organization with Projects. Use when setting up observability, alerts, and operational visibility on DigitalOcean.

🇺🇸|EnglishTranslated

Backend Developmentabsolutelyskilled/absolut...

backend-engineering

Use this skill when designing backend systems, databases, APIs, or services. Triggers on schema design, database migrations, indexing strategies, distributed systems architecture, microservices, caching, message queues, observability setup, logging, metrics, tracing, SLO/SLI definition, performance optimization, query tuning, security hardening, authentication, authorization, API design (REST, GraphQL, gRPC), rate limiting, pagination, and failure handling patterns. Acts as a senior backend engineering advisor for mid-level engineers leveling up.

🇺🇸|EnglishTranslated

AI & Machine Learninggyanesh-m/skills

galileo-python-sdk

Complete reference for the Galileo AI platform Python SDK for evaluating, observing, and protecting GenAI applications. Use when building Python applications that need LLM evaluation, production observability, tracing, or runtime guardrails with Galileo.

🇺🇸|EnglishTranslated

DevOps & Cloud Serviceswshobson/agents

distributed-tracing

Implement distributed tracing with Jaeger and Tempo to track requests across microservices and identify performance bottlenecks. Use when debugging microservices, analyzing request flows, or implementing observability for distributed systems.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesaj-geddes/useful-ai-promp...

service-mesh-implementation

Implement service mesh (Istio, Linkerd) for service-to-service communication, traffic management, security, and observability.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicessickn33/antigravity-aweso...

error-debugging-error-analysis

You are an expert error analysis specialist with deep expertise in debugging distributed systems, analyzing production incidents, and implementing comprehensive observability solutions.

🇺🇸|EnglishTranslated