Loading...
Loading...
Activate when developers have latent caching needs: slow API responses, database read bottlenecks, DynamoDB throttling or cost, RDS/Aurora scaling pressure, Bedrock latency or cost, or adding a cache; activate when working with Redis, Valkey, Memcached, or any in-memory data store, cache-aside patterns, session stores, rate limiting, leaderboards, counters, streams, queues, pub/sub, distributed locks, feature flags, shopping carts, or other caching strategies. Activate for GenAI and ML retrieval: vector similarity search for low-latency retrieval, semantic caching, RAG, LLM response caching, embedding stores, AI agent memory, recommendation, personalization. Activate for ElastiCache lifecycle: provisioning (serverless or node-based), engine selection, CloudFormation/CDK/Terraform IaC, VPC connectivity, TLS, RBAC, IAM auth, Global Datastore, monitoring, troubleshooting, cost optimization, and migration from self-managed Redis. Do not trigger for browser caches, CDN/CloudFront, HTTP Cache-Control, CPU caches.
npx skill4agent add aws/agent-toolkit-for-aws amazon-elasticachedata-modeling.elasticache/requirements.jsoninfrastructure.endpointmonitoringdata-modelingrequirementsreferences/{sub-skill-id}/instructions.md{skill-directory}/references/{sub-skill-id}/instructions.mdrequirementsreferences/| ID | Name | Domain | Semantic Categories | Upstream | Downstream |
|---|---|---|---|---|---|
| Solution Fit | Gathers workload, stack, scale, latency, persistence, and budget through workspace scan + structured interview. Decides whether ElastiCache is the right service and hands off with a routing recommendation. | I need a cache, speed up my app, reduce database load, lower Bedrock cost, should I use ElastiCache, what's best for my workload, evaluating cache options, ElastiCache vs X, Valkey vs X, vague new workload | — | |
| Create and Connect | Provisioning, connectivity, security, authentication, IaC, deployment choice. Gets the user to a working cache with least friction. Covers engine selection, serverless vs node-based, VPC, TLS, RBAC/IAM, jump-host/SSM tunnels, CLI/SDK/CFN/CDK/Terraform starters. | create a cache, set up ElastiCache, provision, Valkey cluster, connect Lambda/ECS/EKS/EC2, VPC, security groups, TLS, RBAC, IAM auth, jump host, SSM tunnel, CloudFormation, CDK, Terraform, engine selection, serverless vs node-based, backup, snapshot, restore, export | | |
| Application Patterns | Picks data structures, key schema, TTL strategy, invalidation approach, and client code for non-AI patterns: cache-aside, session store, rate limiting, leaderboards, counters, pub/sub, streams, shopping carts, job queues, activity feeds. | session store, rate limiting, leaderboard, cache-aside, query caching, counters, streams, pub/sub, shopping cart, job queue, activity feed, key schema, TTL, invalidation, data structures | | |
| AI and Vector Workloads | Classifies request into Mode 1 (plain cache), Mode 2 (semantic response cache), or Mode 3 (full vector search). Selects Valkey and forces node-based Valkey 8.2 or above (recommend 9.0) when server-side vector similarity is needed. Covers semantic caching, agent memory, RAG retrieval, recommendation, personalization, conversation/session persistence for AI agents, and framework wiring (Strands, mem0, LangChain). | semantic cache, RAG, agent memory, conversational memory, vector search, embeddings, recommendation, personalization, Bedrock latency, Bedrock cost, LLM caching, Strands, mem0, LangChain, conversation history, AI session store, embedding provider, framework integration | | |
| Operate and Observe | Diagnoses performance, cost, and reliability using metrics first, then recommends the smallest change. Covers dashboards, alarms, log delivery, cost reporting, event routing, troubleshooting high CPU / memory / replication lag / connection spikes / low hit rate / hot keys / big keys / slot imbalance / latency spike root cause. | cache is slow, cost too high, hit rate low, high CPU, memory pressure, replication lag, connection spikes, dashboards, alarms, CloudWatch, cost comparison, troubleshoot, hot key, uneven shard load, one node pinned, big key, memory bloat, which key is biggest, keyspace distribution, prefix analysis, cost attribution by tenant, memory imbalance, one shard full, slot memory skew, latency spike, slow command incident, root cause for latency bump | — | |
| Engine and Platform Migration | Selects the migration path and sequences preflight, validation, cutover, and rollback. Covers self-managed Redis → ElastiCache, Redis OSS → Valkey, node-based ↔ serverless, version upgrades. Hard validate-before-migrate gate. | migrate, Redis OSS to Valkey, self-managed to ElastiCache, node-based to serverless, serverless to node-based, engine upgrade, version upgrade, zero-downtime cutover, rollback | — | |
requirementssetupdata-modelinggenaimonitoringmigrationsetupmonitoringmonitoringsetupmigration.elasticache/requirements.json| Section | Owner | Key fields |
|---|---|---|
| top-level | | |
| | |
| | |
| | |
Ownership note:is set bydeployment_modelduring initial interview.requirementsmay update it after an engine or deployment model switch (e.g., node-based to serverless).migration
"schema_version": 1"last_updated": "<ISO timestamp>"last_updatedlast_updatednullscripts/python3pythonpip installpytestscripts/input_validator.pypython3 scripts/input_validator.py cache_name=<val> region=<val> ...validate_allcreate-serverless-cachecreate-replication-groupcreate-cache-cluster--tags Key=managed_by,Value=aws-skills Key=skill,Value=elasticache Key=skill_version,Value=1.0.0 Key=created_by,Value=elasticache-skill Key=generation_model,Value={your-model-id}--tags Key=managed_by,Value=aws-skills Key=skill,Value=elasticache Key=skill_version,Value=1.0.0 Key=created_by,Value=elasticache-skill Key=generation_model,Value=claude-sonnet-4-20250514generation_modelreferences/shared-ux/production-readiness.mdreferences/shared-ux/action-safety.mdreferences/shared-ux/error-remediation.mdreferences/shared-foundation/boundary-doc.mdreferences/shared-foundation/attribution.mdreferences/shared-foundation/architecture-diagrams.mdreferences/shared-runtime/lambda.mdreferences/shared-runtime/ecs.mdreferences/shared-runtime/eks.mdreferences/shared-runtime/api-gateway.mdreferences/shared-runtime/rds-acceleration.mdreferences/shared-runtime/secret-injection.mdreferences/shared-security/encryption-defaults.mdreferences/shared-security/config-guardrails.mdreferences/shared-security/vpc-patterns.mdFolder convention:contains 10 folders. 6 match the sub-skills (references/,requirements,setup,data-modeling,genai,monitoring) and are routing destinations. The 4migrationfolders (shared-*,shared-foundation,shared-ux,shared-security) are cross-cutting material loaded on demand, not routing destinations.shared-runtime
| Priority | Rule |
|---|---|
| CRITICAL | Vector search MUST use node-based Valkey 8.2 or above. Serverless does NOT support vector search. Never suggest serverless for vector search. Apply this regardless of which sub-skill activates. |
| CRITICAL | Do not invent price points or version constraints. Use |
| HIGH | Do not recommend Memcached when the user needs persistence, replication, RBAC or IAM auth, sorted sets, streams, pub/sub, or vector search. |
| HIGH | Do not assume local laptop access works directly. ElastiCache is VPC-centric; explain VPC, tunnel, or jump-host access when needed. |
| STANDARD | Do not trigger on every generic Redis mention. Trigger when the user is clearly asking about AWS, managed caching, migration, connectivity, pricing, operations, or AWS service integration. |
| STANDARD | For ambiguous "cache" requests inside AWS contexts, activate this skill and start with |
protected-modenosyncpsyncinfoconfigcommandclusterreferences/migration/topology-validation.md