Loading...
Loading...
Compare original and translation side by side
agent-media audio transcribe --in <path> [options]agent-media audio transcribe --in <path> [options]| Option | Required | Description |
|---|---|---|
| Yes | Input audio file path or URL (supports mp3, wav, m4a, ogg) |
| No | Enable speaker identification |
| No | Language code (auto-detected if not provided) |
| No | Number of speakers hint for diarization |
| No | Output path, filename or directory (default: ./) |
| No | Provider to use (local, fal, replicate, runpod) |
| 选项 | 是否必填 | 描述 |
|---|---|---|
| 是 | 输入音频文件路径或URL(支持mp3、wav、m4a、ogg格式) |
| 否 | 启用说话人识别功能 |
| 否 | 语言代码(未提供时自动检测) |
| 否 | 语音分离的说话人数量提示 |
| 否 | 输出路径、文件名或目录(默认值:./) |
| 否 | 使用的服务提供商(local、fal、replicate、runpod) |
{
"ok": true,
"media_type": "audio",
"action": "transcribe",
"provider": "fal",
"output_path": "transcription_123_abc.json",
"transcription": {
"text": "Full transcription text...",
"language": "en",
"segments": [
{ "start": 0.0, "end": 2.5, "text": "Hello.", "speaker": "SPEAKER_0" },
{ "start": 2.5, "end": 5.0, "text": "Hi there.", "speaker": "SPEAKER_1" }
]
}
}{
"ok": true,
"media_type": "audio",
"action": "transcribe",
"provider": "fal",
"output_path": "transcription_123_abc.json",
"transcription": {
"text": "完整转录文本...",
"language": "en",
"segments": [
{ "start": 0.0, "end": 2.5, "text": "Hello.", "speaker": "SPEAKER_0" },
{ "start": 2.5, "end": 5.0, "text": "Hi there.", "speaker": "SPEAKER_1" }
]
}
}agent-media audio transcribe --in interview.mp3agent-media audio transcribe --in meeting.wav --diarizeagent-media audio transcribe --in podcast.mp3 --diarize --language en --speakers 3agent-media audio transcribe --in audio.wav --provider replicateagent-media audio transcribe --in interview.mp3agent-media audio transcribe --in meeting.wav --diarizeagent-media audio transcribe --in podcast.mp3 --diarize --language en --speakers 3agent-media audio transcribe --in audio.wav --provider replicateundefinedundefinedundefinedundefinedmutex lock failed"ok": trueagent-media audio transcribe --in audio.mp3 --provider localmutex lock failed"ok": trueagent-media audio transcribe --in audio.mp3 --provider localFAL_API_KEYwizperwhisperFAL_API_KEYwizperwhisperREPLICATE_API_TOKENwhisper-diarizationREPLICATE_API_TOKENwhisper-diarizationRUNPOD_API_KEYpruna/whisper-v3-largeagent-media audio transcribe --in audio.mp3 --provider runpodRUNPOD_API_KEYpruna/whisper-v3-largeagent-media audio transcribe --in audio.mp3 --provider runpod