data-center-design-execution-lead

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

Data Center Design Execution Lead

数据中心设计执行负责人

When to Use

适用场景

  • Define requirements for a new data hall, cage, or colo deployment
  • Compare build vs colo vs cloud extension for a workload footprint
  • Size power, cooling, and rack count for current and 3–5 year growth
  • Review mechanical/electrical single-line diagrams and rack elevations
  • Plan hot/cold aisle, containment, and cable pathways
  • Specify network demarc, cross-connects, and latency to cloud regions
  • Run design → procurement → install → commissioning → handoff
  • Prepare acceptance tests and operations runbooks for facilities team
  • 定义新数据机房、机柜笼或colo部署的需求
  • 针对工作负载部署范围对比自建、colo与云扩展方案
  • 为当前及3-5年的增长规划电源、冷却容量及机柜数量
  • 审核机电单线图与机柜立面图
  • 规划冷热通道、封闭设计及线缆路径
  • 指定网络分界点、交叉连接以及到云区域的延迟要求
  • 推进设计→采购→安装→调试→交接全流程
  • 为设施团队准备验收测试及运维手册

When NOT to Use

不适用场景

  • Terraform, cloud networking, managed Kubernetes →
    infrastructure-engineer
  • Helm, cluster add-ons, pod troubleshooting →
    cluster-deployment-engineer
  • Application integration ADRs →
    senior-system-architecture
  • Multi-team software program RAID →
    technical-program-manager
  • SOC 2 control catalog without facility scope →
    compliance-engineer
  • Physical security policy program only →
    cybersecurity
    ,
    information-security-engineer
  • Executive comms for launches →
    communication-lead
  • Utilization, GPU supply forecast, consolidation, refresh →
    data-center-compute-supply-efficiency
  • Multi-site roadmap, portfolio funding, steering →
    data-center-portfolio-planning-execution-lead
  • Delivery schedule, rack-ready gates, construction RAID →
    senior-data-center-capacity-delivery-manager
  • Terraform、云网络、托管Kubernetes →
    infrastructure-engineer
  • Helm、集群插件、Pod故障排查 →
    cluster-deployment-engineer
  • 应用集成ADRs →
    senior-system-architecture
  • 跨团队软件项目RAID →
    technical-program-manager
  • 无设施范围的SOC 2控制目录 →
    compliance-engineer
  • 仅物理安全政策项目 →
    cybersecurity
    ,
    information-security-engineer
  • 发布相关的高管沟通 →
    communication-lead
  • 利用率、GPU供应预测、整合、更新 →
    data-center-compute-supply-efficiency
  • 多站点路线图、组合资金、指导工作 →
    data-center-portfolio-planning-execution-lead
  • 交付计划、机柜就绪节点、建造RAID →
    senior-data-center-capacity-delivery-manager

Related skills

相关技能

NeedSkill
Hybrid on-prem virtualization patterns
infrastructure-engineer
Workloads hosted in on-prem K8s
cluster-deployment-engineer
Large build program coordination
technical-program-manager
Physical/logical security controls
cybersecurity
,
information-security-engineer
Compliance evidence for facilities
compliance-engineer
Stakeholder and exec updates
communication-lead
Rollout of apps into new site
deployment-strategist
Compute supply and resource efficiency
data-center-compute-supply-efficiency
Enterprise DC portfolio planning
data-center-portfolio-planning-execution-lead
Capacity delivery program
senior-data-center-capacity-delivery-manager
On-site install, labeling, acceptance
field-services-engineer
需求技能
混合本地虚拟化模式
infrastructure-engineer
本地K8s承载的工作负载
cluster-deployment-engineer
大型建造项目协调
technical-program-manager
物理/逻辑安全控制
cybersecurity
,
information-security-engineer
设施合规证据
compliance-engineer
利益相关方及高管更新
communication-lead
应用向新站点的部署
deployment-strategist
计算供应与资源效率
data-center-compute-supply-efficiency
企业数据中心组合规划
data-center-portfolio-planning-execution-lead
容量交付项目
senior-data-center-capacity-delivery-manager
现场安装、标识、验收
field-services-engineer

Core Workflows

核心工作流

1. Requirements and site strategy

1. 需求与站点策略

Capture:
  • Workload type (enterprise IT, HPC/GPU, edge)
  • Availability target (tier intent, RTO/RPO for facility)
  • Growth: racks, kW, network ports by year
  • Constraints: geography, latency to users/cloud, sustainability, budget
Decide: own build, colo cage, or cloud-only with small edge.
See
references/facility_requirements.md
.
收集:
  • 工作负载类型(企业IT、HPC/GPU、边缘)
  • 可用性目标(等级意向、设施的RTO/RPO)
  • 增长情况:每年的机柜数量、kW、网络端口数
  • 约束条件:地理位置、到用户/云的延迟、可持续性、预算
决策:自建colo机柜笼,或纯云+小型边缘方案。
参考
references/facility_requirements.md

2. Power and cooling

2. 电源与冷却

  • Design power chain: utility → ATS/STS → UPS → PDU → rack
  • Size for peak kW/rack (air vs liquid, GPU trays)
  • Cooling: CRAC/CRAH, containment, setpoints per ASHRAE class
  • Target PUE and measurement points
See
references/power_cooling.md
.
  • 设计电源链:市电→ATS/STS→UPS→PDU→机柜
  • 峰值kW/机柜规划容量(风冷 vs 液冷、GPU托盘)
  • 冷却:CRAC/CRAH、封闭设计、符合ASHRAE等级的设定值
  • PUE目标及测量点
参考
references/power_cooling.md

3. Physical layout

3. 物理布局

  • Rack rows, aisle width, floor load (lbs/sq ft)
  • Hot/cold containment; overhead vs underfloor delivery
  • Cable trays, fiber pathways, ladder rack
  • Staging, spares, and restricted zones
See
references/physical_layout.md
.
  • 机柜排、通道宽度、楼板承重(lbs/sq ft)
  • 冷热通道封闭;架空 vs 地板下送风
  • 线缆托盘、光纤路径、梯架
  • staging区、备件区及限制区域
参考
references/physical_layout.md

4. Network and connectivity

4. 网络与连通性

  • Meet-me room, carrier diversity, cross-connect model
  • Core/distribution/toR switching for facility (distinct from app network design in cloud)
  • Latency and bandwidth to primary cloud region for hybrid
See
references/network_connectivity.md
.
  • 汇聚室、运营商多样性、交叉连接模式
  • 设施的核心/分布/接入层交换(与云中的应用网络设计区分)
  • 混合架构下到主云区域的延迟与带宽
参考
references/network_connectivity.md

5. Execution and commissioning

5. 执行与调试

PhaseDeliverables
DesignBoM, drawings, capacity model sign-off
ProcureLong-lead gear tracked (switching, PDUs, genset)
InstallMEP fit-out, rack/stack, labeling standard
CommissionIST/FAT, red-tag clearance, integrated tests
HandoffDCIM, monitoring, as-built, ops training
See
references/execution_commissioning.md
.
阶段交付物
设计物料清单、图纸、容量模型签字确认
采购跟踪长周期设备(交换机、PDU、发电机组)
安装机电装修、机柜堆叠、标识标准
调试IST/FAT、红牌清除、集成测试
交接DCIM、监控、竣工图、运维培训
参考
references/execution_commissioning.md

6. Operations handoff

6. 运维交接

  • DCIM asset records match labels and diagrams
  • Environmental and power alarms to NOC
  • Maintenance windows and vendor SLAs documented
  • Runbooks: escort, smart hands, emergency power-down
See
references/operations_handoff.md
.
  • DCIM资产记录与标识、图纸一致
  • 环境与电源告警接入NOC
  • 维护窗口及厂商SLA文档化
  • 操作手册:陪同访问、智能运维、紧急断电
参考
references/operations_handoff.md

Output standards

输出标准

  • Capacity table: rack ID, kW design, weight, network Uplinks
  • One-line diagram summary (power and cooling) with margins noted
  • Commissioning checklist with pass/fail and owner
  • Risks: single points of failure, lead-time items, permit dependencies
  • 容量表:机柜ID、设计kW、重量、网络上行链路
  • 带余量标注的电源与冷却单线图摘要
  • 含通过/失败状态及负责人的调试检查表
  • 风险点:单点故障、长周期物料、许可证依赖

When to load references

何时加载参考文档

  • Site, tier, capacity
    references/facility_requirements.md
  • Power, UPS, cooling, PUE
    references/power_cooling.md
  • Racks, aisles, containment
    references/physical_layout.md
  • Carriers, MMR, hybrid links
    references/network_connectivity.md
  • Build phases and commissioning
    references/execution_commissioning.md
  • DCIM and ops
    references/operations_handoff.md
  • 站点、等级、容量
    references/facility_requirements.md
  • 电源、UPS、冷却、PUE
    references/power_cooling.md
  • 机柜、通道、封闭设计
    references/physical_layout.md
  • 运营商、MMR、混合链路
    references/network_connectivity.md
  • 建造阶段与调试
    references/execution_commissioning.md
  • DCIM与运维
    references/operations_handoff.md