Search Results: cluster-management

Found 30 Skills

DevOps & Cloud Servicesrohitg00/kubectl-mcp-serv...

k8s-vind

Manage vCluster (virtual Kubernetes clusters) instances using vind. Use when creating, managing, or operating lightweight virtual clusters for development, testing, or multi-tenancy.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicessundial-org/awesome-openc...

kubernetes

Comprehensive Kubernetes and OpenShift cluster management skill covering operations, troubleshooting, manifest generation, security, and GitOps. Use this skill when: (1) Cluster operations: upgrades, backups, node management, scaling, monitoring setup (2) Troubleshooting: pod failures, networking issues, storage problems, performance analysis (3) Creating manifests: Deployments, StatefulSets, Services, Ingress, NetworkPolicies, RBAC (4) Security: audits, Pod Security Standards, RBAC, secrets management, vulnerability scanning (5) GitOps: ArgoCD, Flux, Kustomize, Helm, CI/CD pipelines, progressive delivery (6) OpenShift-specific: SCCs, Routes, Operators, Builds, ImageStreams (7) Multi-cloud: AKS, EKS, GKE, ARO, ROSA operations

🇺🇸|EnglishTranslated

6 scripts/Attention

DevOps & Cloud Servicesawslabs/agent-plugins

hyperpod-ssm

Remote command execution and file transfer on SageMaker HyperPod cluster nodes via AWS Systems Manager (SSM). This is the primary interface for accessing HyperPod nodes — direct SSH is not available. Use when any skill, workflow, or user request needs to execute commands on cluster nodes, upload files to nodes, read/download files from nodes, run diagnostics, install packages, or perform any operation requiring shell access to HyperPod instances. Other HyperPod skills depend on this skill for all node-level operations.

🇺🇸|EnglishTranslated

3 scripts/Attention

DevOps & Cloud Servicesvllm-project/vllm-skills

vllm-deploy-k8s

Deploy vLLM to Kubernetes (K8s) with GPU support, health probes, and OpenAI-compatible API endpoint. Use this skill whenever the user wants to deploy, run, or serve vLLM on a Kubernetes cluster, including creating deployments, services, checking existing deployments, or managing vLLM on K8s.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesspjoshis/claude-code-plug...

kubernetes-orchestration

Master Kubernetes with pods, deployments, services, ingress, ConfigMaps, secrets, and production cluster management.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesmicrosoftdocs/agent-skill...

azure-service-fabric

Expert knowledge for Azure Service Fabric development including troubleshooting, best practices, decision making, architecture & design patterns, limits & quotas, security, configuration, integrations & coding patterns, and deployment. Use when building Service Fabric clusters, Reliable Actors/Collections, reverse proxy, remoting, or Azure-integrated apps, and other Azure Service Fabric related development tasks. Not for Azure Kubernetes Service (AKS) (use azure-kubernetes-service), Azure App Service (use azure-app-service), Azure Container Apps (use azure-container-apps), Azure Red Hat OpenShift (use azure-redhat-openshift).

🇺🇸|EnglishTranslated

DevOps & Cloud Servicestruefoundry/tfy-deploy-sk...

truefoundry-workspaces

Lists TrueFoundry workspaces and clusters. Provides workspace FQNs for deployment, cluster connectivity status, available GPU types, and base domains.

🇺🇸|EnglishTranslated

2 scripts/Attention

DevOps & Cloud Servicesaffaan-m/everything-claud...

uncloud

Use when managing an Uncloud cluster — deploying services, configuring Caddy ingress, adding static proxy routes for non-cluster devices, publishing ports, scaling, inspecting logs, or managing machines and volumes with the `uc` CLI.

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesancoleman/ai-design-compo...

implementing-gitops

Implement GitOps continuous delivery for Kubernetes using ArgoCD or Flux. Use for automated deployments with Git as single source of truth, pull-based delivery, drift detection, multi-cluster management, and progressive rollouts.

🇺🇸|EnglishTranslated

4 scripts/Attention

DevOps & Cloud Servicesrohitg00/kubectl-mcp-serv...

k8s-troubleshoot

Debug Kubernetes pods, nodes, and workloads. Use when pods are failing, containers crash, nodes are unhealthy, or users mention debugging, troubleshooting, or diagnosing Kubernetes issues.

🇺🇸|EnglishTranslated

2 scripts/Attention

Data Processingpersonamanagmentlayer/pcl

databricks-expert

Expert-level Databricks platform, Apache Spark, Delta Lake, MLflow, notebooks, and cluster management

🇺🇸|EnglishTranslated

DevOps & Cloud Servicesaliyun/alibabacloud-aiops...

alibabacloud-emr-cluster-manage

Manage the full lifecycle of Alibaba Cloud E-MapReduce (EMR) ECS clusters—creation, scaling, renewal, and status queries. Use this Skill when users want to set up big data clusters, view cluster status, add nodes, release nodes, configure auto-scaling, check cluster and node states, or diagnose creation failures. Also applicable for scenarios like "create a Hadoop cluster", "data lake cluster", "running out of resources", "check my cluster", "renew", etc. NOTE: This Skill does NOT support cluster deletion, release, or termination under any circumstances. Any request to delete or terminate a cluster will be refused and redirected to the EMR console.

🇺🇸|EnglishTranslated