videocut-subtitle

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

字幕

Subtitle

转录 → 纠错 → 审核 → 匹配 → 烧录
Transcription → Correction → Review → Matching → Burning

流程

Workflow

1. 转录视频(Whisper)
2. 词典纠错 + 分句
3. 输出字幕稿(纯文本,一句一行)
【用户审核修改】
4. 用户给回修改后的文本
5. 我匹配时间戳 → 生成 SRT
6. 烧录字幕(FFmpeg)
1. 转录视频(Whisper)
2. 词典纠错 + 分句
3. 输出字幕稿(纯文本,一句一行)
【用户审核修改】
4. 用户给回修改后的文本
5. 我匹配时间戳 → 生成 SRT
6. 烧录字幕(FFmpeg)

转录

Transcription

使用 OpenAI Whisper 模型进行语音转文字:
bash
whisper video.mp4 --model medium --language zh --output_format json
模型用途
medium
默认,平衡速度与准确率
large-v3
高精度,较慢
输出 JSON 包含逐词时间戳,用于后续 SRT 生成。

Use OpenAI Whisper model for speech to text conversion:
bash
whisper video.mp4 --model medium --language zh --output_format json
ModelUse Case
medium
Default, balances speed and accuracy
large-v3
High precision, slower
The output JSON contains word-level timestamps for subsequent SRT generation.

字幕规范

Subtitle Specifications

规则说明
一屏一行不换行,不堆叠
≤15字/行超过15字必须拆分(4:3竖屏)
句尾无标点
你好
不是
你好。
句中保留标点
先点这里,再点那里

RuleDescription
One line per screenNo line breaks, no stacking
≤15 Chinese characters per lineMust be split if exceeding 15 characters (4:3 vertical screen)
No punctuation at the end of sentences
Hello
instead of
Hello.
Punctuation is retained within sentences
Click here first, then click there

词典纠错

Dictionary Correction

读取
词典.txt
,每行一个正确写法:
skills
Claude
iPhone
我自动识别变体:
claude
Claude

Read
dictionary.txt
, each line records a correct spelling:
skills
Claude
iPhone
I automatically identify variants:
claude
Claude

字幕稿格式

Subtitle Draft Format

我给用户的(纯文本,≤15字/行):
今天给大家分享一个技巧
很多人可能不知道
其实这个功能
藏在设置里面
你只要点击这里
就能看到了
用户修改后给回我,我再匹配时间戳生成 SRT。

What I send to users (plain text, ≤15 characters per line):
Today I will share a tip with you
Many people may not know
Actually this feature
Is hidden in settings
You just need to click here
And you will see it
Users send back the modified text to me, then I match timestamps to generate SRT.

样式

Style

默认:24号白字、黑色描边、底部居中
可选样式:
样式说明
默认白字黑边
黄字黄字黑边(醒目)
用户可说:
  • "字大一点" → 32号
  • "放顶部" → 顶部居中
  • "黄色字幕" → 黄字黑边

Default: 24pt white text, black stroke, bottom centered
Optional styles:
StyleDescription
DefaultWhite text with black border
Yellow textYellow text with black border (eye-catching)
Users can request:
  • "Larger font" → 32pt
  • "Put on top" → Top centered
  • "Yellow subtitles" → Yellow text with black border

输出

Output

01-xxx_字幕稿.txt   # 纯文本,用户编辑
01-xxx.srt          # 字幕文件
01-xxx-字幕.mp4     # 带字幕视频
01-xxx_subtitle_draft.txt   # Plain text, for user editing
01-xxx.srt          # Subtitle file
01-xxx-with_subtitles.mp4     # Video with embedded subtitles