scribe
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseZoom AI Services Scribe
Zoom AI Services Scribe
Background reference for Zoom AI Services Scribe across:
- synchronous single-file transcription ()
POST /aiservices/scribe/transcribe - asynchronous batch jobs ()
/aiservices/scribe/jobs* - browser microphone pseudo-streaming via repeated short file uploads
- webhook-driven batch status updates
- Build-platform JWT generation and credential handling
Official docs:
Zoom AI Services Scribe的背景参考,涵盖以下方面:
- 同步单文件转录 ()
POST /aiservices/scribe/transcribe - 异步批处理作业 ()
/aiservices/scribe/jobs* - 通过重复上传短文件实现的浏览器麦克风伪流
- Webhook驱动的批处理状态更新
- Build平台JWT生成与凭证处理
官方文档:
Routing Guardrail
路由规则
- If the user needs uploaded or stored media transcribed into text, route here first.
- If the user needs live meeting media without file-based upload/batch jobs, route to ../rtms/SKILL.md.
- If the user needs Zoom REST API inventory for AI Services paths, chain ../rest-api/SKILL.md.
- If the user needs webhook signature patterns or generic HMAC receiver hardening, optionally chain ../webhooks/SKILL.md.
- 如果用户需要将上传或存储的媒体转录为文本,优先路由到此处。
- 如果用户需要无需基于文件上传/批处理作业的实时会议媒体,路由到 ../rtms/SKILL.md。
- 如果用户需要AI服务路径的Zoom REST API清单,链式调用 ../rest-api/SKILL.md。
- 如果用户需要Webhook签名模式或通用HMAC接收器加固,可选择链式调用 ../webhooks/SKILL.md。
Quick Links
快速链接
- concepts/auth-and-processing-modes.md
- scenarios/high-level-scenarios.md
- examples/fast-mode-node.md
- examples/batch-webhook-pipeline.md
- references/api-reference.md
- references/environment-variables.md
- references/samples-validation.md
- references/versioning-and-drift.md
- troubleshooting/common-drift-and-breaks.md
- RUNBOOK.md
- concepts/auth-and-processing-modes.md
- scenarios/high-level-scenarios.md
- examples/fast-mode-node.md
- examples/batch-webhook-pipeline.md
- references/api-reference.md
- references/environment-variables.md
- references/samples-validation.md
- references/versioning-and-drift.md
- troubleshooting/common-drift-and-breaks.md
- RUNBOOK.md
Core Workflow
核心工作流
- Get Build-platform credentials and generate an HS256 JWT.
- Choose fast mode for one short file or batch mode for stored archives / large sets.
- Submit the transcription request.
- For batch jobs, poll job/file status or receive webhook notifications.
- Persist and post-process transcript JSON.
- 获取Build平台凭证并生成HS256 JWT。
- 单个短文件选择快速模式,存储归档/大量文件选择批处理模式。
- 提交转录请求。
- 对于批处理作业,轮询作业/文件状态或接收Webhook通知。
- 持久化并后处理转录JSON结果。
Hosted Fast-Mode Guardrail
托管快速模式注意事项
- The formal fast-mode API limits are and
100 MB, but hosted browser flows can still time out before the upstream response returns.2 hours - Current deployed-sample observations:
- ~17.2 MB MP4 completed in about
26s - ~38.6 MB MP4 completed in about
26-37s - ~59.2 MB MP4 completed in about on the backend
32-34s - some ~59.2 MB browser requests still surfaced as frontend while backend logs later showed
504200
- ~17.2 MB MP4 completed in about
- Treat frontend plus backend
504as a browser/edge timeout race, not an automatic transcription failure.200 - For hosted UIs, prefer an async request/polling wrapper for fast mode instead of holding the browser open for the full upstream response.
- For larger or less predictable media, prefer batch mode even when the file is still within the formal fast-mode size limit.
- 官方快速模式API限制为和
100 MB,但托管浏览器流可能在上游响应返回前就超时。2小时 - 当前已部署示例的观测数据:
- 约17.2 MB的MP4文件大约在内完成
26秒 - 约38.6 MB的MP4文件大约在内完成
26-37秒 - 约59.2 MB的MP4文件后端大约在内完成
32-34秒 - 部分约59.2 MB的浏览器请求仍然返回前端,但后端日志后续显示
504200
- 约17.2 MB的MP4文件大约在
- 将前端+后端
504的情况视为浏览器/边缘超时竞争,不要直接判定为转录失败。200 - 对于托管UI,优先为快速模式使用异步请求/轮询封装,而不是让浏览器一直等待完整的上游响应。
- 对于更大或不可预测的媒体,即使文件仍在官方快速模式大小限制内,也优先选择批处理模式。
Browser Microphone Pattern
浏览器麦克风实现模式
- does not expose a documented real-time streaming API surface.
scribe - If you want a browser microphone experience, use pseudo-streaming:
- capture microphone audio in short chunks
- upload each chunk through the async fast-mode wrapper
- poll for completion
- append chunk transcripts in sequence
- Recommended starting cadence:
- chunk size:
5 seconds - acceptable range:
5-10 seconds - in-flight chunk requests:
2-3
- chunk size:
- This is a practical UI pattern for incremental transcript updates, not a substitute for .
rtms - Treat this as a fallback demo pattern, not the preferred production architecture.
- It adds repeated upload overhead, chunk-boundary drift, browser codec/container variability, and transcript stitching complexity.
- If the user asks for actual live stream ingestion, low-latency continuous media, or server-push media transport, route to ../rtms/SKILL.md instead.
- 没有公开的官方实时流API接口。
scribe - 如果你需要浏览器麦克风体验,使用伪流方案:
- 将麦克风音频捕获为短分片
- 通过异步快速模式封装上传每个分片
- 轮询完成状态
- 按顺序拼接分片转录结果
- 推荐的初始参数:
- 分片大小:
5秒 - 可接受范围:
5-10秒 - 处理中的分片请求数:
2-3
- 分片大小:
- 这是实现增量转录更新的实用UI模式,不能替代。
rtms - 将其视为 fallback 演示模式,而不是首选的生产架构。
- 它会带来重复上传开销、分片边界偏移、浏览器编解码器/容器差异以及转录拼接复杂度。
- 如果用户需要真正的直播流摄入、低延迟连续媒体或服务器推送媒体传输,请路由到 ../rtms/SKILL.md。
Endpoint Surface
接口清单
| Mode | Method | Path | Use |
|---|---|---|---|
| Fast | | | Synchronous transcription for one file |
| Batch | | | Submit asynchronous batch job |
| Batch | | | List jobs |
| Batch | | | Inspect job summary/state |
| Batch | | | Cancel queued/processing job |
| Batch | | | Inspect per-file results |
| 模式 | 请求方法 | 路径 | 用途 |
|---|---|---|---|
| 快速模式 | | | 单文件同步转录 |
| 批处理 | | | 提交异步批处理作业 |
| 批处理 | | | 列出作业 |
| 批处理 | | | 查询作业概要/状态 |
| 批处理 | | | 取消队列中/处理中的作业 |
| 批处理 | | | 查询单文件结果 |
High-Level Scenarios
常见场景
- On-demand clip transcription after a user uploads one recording.
- Batch transcription of stored S3 call archives.
- Webhook-driven ETL pipeline that writes transcripts to your database/search index.
- Re-transcription of Zoom-managed recordings after exporting them to your own storage.
- Offline compliance or QA workflows that need timestamps, channel separation, and speaker hints.
- 用户上传单个录制文件后的按需剪辑转录。
- 存储在S3的通话归档文件的批量转录。
- Webhook驱动的ETL管道,将转录结果写入数据库/搜索索引。
- Zoom托管的录制文件导出到自有存储后的重新转录。
- 需要时间戳、声道分离和说话人提示的离线合规或QA工作流。
Chaining
链式调用
- Stored Zoom recordings -> ../rest-api/SKILL.md +
scribe - Webhook verification hardening -> ../webhooks/SKILL.md
- Real-time live transcript/media -> ../rtms/SKILL.md
- Cross-product routing -> ../general/SKILL.md
- 存储的Zoom录制文件 -> ../rest-api/SKILL.md +
scribe - Webhook验证加固 -> ../webhooks/SKILL.md
- 实时直播转录/媒体 -> ../rtms/SKILL.md
- 跨产品路由 -> ../general/SKILL.md
Operations
运维
- RUNBOOK.md - 5-minute preflight and debugging checklist.
- RUNBOOK.md - 5分钟预检查和调试清单。