Loading...
Loading...
Generate (TTS), Transcribe (STT), and Clone voices using Google's GenAI and Cloud Speech SDKs. Supports Gemini-TTS, Chirp 3, and Instant Custom Voice.
npx skill4agent add cnemri/google-genai-skills speech-useuvGOOGLE_API_KEYGOOGLE_CLOUD_PROJECTGOOGLE_APPLICATION_CREDENTIALStexttospeech.googleapis.comspeech.googleapis.comuv run skills/speech-use/scripts/generate_speech.py "Hello world, this is a test." --voice Puck --output hello.wavuv run skills/speech-use/scripts/generate_speech.py "This is my custom voice speaking." --voice-cloning-key "YOUR_KEY_HERE" --output custom.wavvoiceCloningKeyreference.wavconsent.wavuv run skills/speech-use/scripts/create_custom_voice.py --reference-audio reference.wav --consent-audio consent.wavgenerate_speech.pyuv run skills/speech-use/scripts/transcribe_audio.py audio.wav --language en-US --output transcript.txt--voiceKorePuckFenrirAoede--voice-cloning-keycreate_custom_voice.py--modelgemini-2.5-flash-preview-tts--modelchirp_3--languageauto--locationusBefore running scripts, review the reference guides for available voices and options.