chaos-monkey-orchestrator

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Chaos Monkey Orchestrator

Chaos Monkey Orchestrator

This skill validates system resilience by intentionally introducing controlled failures.
本Skill通过主动引入可控故障来验证系统的恢复能力。

Capabilities

功能特性

1. Chaos Injection

1. 混沌故障注入

  • Injects network latency, terminates random service pods, or fills disk space in staging environments.
  • Simulates API timeouts and database master failures.
  • 在预发布环境中注入网络延迟、终止随机服务Pod或占满磁盘空间。
  • 模拟API超时和数据库主节点故障。

2. Resilience Validation

2. 恢复能力验证

  • Verifies that
    self-healing-orchestrator
    and
    crisis-manager
    respond correctly to the injected chaos.
  • Adheres to the "Fire Drill" protocols defined in Modern SRE Best Practices.
  • 验证
    self-healing-orchestrator
    crisis-manager
    对注入的混沌故障能否做出正确响应。
  • 遵循现代SRE最佳实践中定义的“消防演习”协议。

Usage

使用方式

  • "Run a chaos drill on the staging environment: simulate a 50% network packet loss."
  • "Terminate a random pod every hour for the next 4 hours and monitor the recovery time."
  • "在预发布环境中运行混沌演练:模拟50%的网络数据包丢失。"
  • "在接下来的4小时内,每小时终止一个随机Pod并监控恢复时间。"

Knowledge Protocol

知识协议

  • This skill adheres to the
    knowledge/orchestration/knowledge-protocol.md
    . It automatically integrates Public, Confidential (Company/Client), and Personal knowledge tiers, prioritizing the most specific secrets while ensuring no leaks to public outputs.
  • References Modern SRE Best Practices for resilience testing and fault injection standards.
  • 本Skill遵循
    knowledge/orchestration/knowledge-protocol.md
    协议。它会自动整合公开、保密(公司/客户)和个人知识层级,优先使用最具体的保密信息,同时确保不会泄露到公开输出中。
  • 参考现代SRE最佳实践中的恢复能力测试和故障注入标准。