Search Results: qwen

Found 104 Skills

AI & Machine Learningthepexcel/agent-skills

art-director

Creates professional AI image/video prompts with photographer's and cinematographer's eye. Specializes in composition, lighting, color grading, and storytelling. Use when generating AI images/videos with artistic vision, working with models like Nano Banana Pro, Qwen, Sora2, Wan 2.2. For graphic design work (thumbnails, banners, layouts), use /graphic-designer instead.

🇺🇸|EnglishTranslated

AI & Machine Learningpixverseai/skills

pixverse-ai-image-and-video-generator

PixVerse CLI — generate AI videos and images from the command line. Supports PixVerse, Veo, Sora, Kling, Hailuo, Wan, and more video models; Nano Banana (Gemini), Seedream, Qwen image models; and PixVerse's rich effect template library. Start here.

🇺🇸|EnglishTranslated

AI & Machine Learningiamzhihuix/happy-claude-s...

happy-audio-gen

Universal AI voice / text-to-speech skill supporting OpenAI TTS (gpt-4o-mini-tts, tts-1), ElevenLabs multilingual TTS with voice cloning, Bailian Qwen TTS (qwen-tts / qwen3-tts-vd with voice-design custom voices, long-text chunking built in), MiniMax speech-02-hd, SiliconFlow CosyVoice / SenseVoice, and PlayHT 2.0. Use this skill whenever the user asks to read text aloud, synthesize speech, generate narration, create voice-over, dub a script, or turn any text into audio (mp3 / wav / ogg / flac). Typical phrases include "read this aloud", "generate voice for ...", "create a narration of ...", "tts this", "把这段念出来", "做个配音", "合成语音", or mentions of voices / TTS model names like Alloy, Ash, Cherry, Rachel, CosyVoice, PlayHT. Always use this skill even if the user does not specify a provider — pick one from EXTEND.md defaults or available env keys.

🇺🇸|EnglishTranslated

10 scripts/Attention

Testing & QAcinience/alicloud-skills

alicloud-ai-audio-tts-realtime-test

Minimal realtime TTS smoke test for Model Studio Qwen TTS Realtime.

🇺🇸|EnglishTranslated

AI & Machine Learningaliyun/alibabacloud-aiops...

alibabacloud-avatar-video

Use Alibaba Cloud DashScope API and LingMou to generate AI video and speech. Seven capabilities — (1) LivePortrait talking-head (image + audio → video, two-step), (2) EMO talking-head, (3) AA/AnimateAnyone full-body animation (three-step), (4) T2I text-to-image (Wan 2.x, default wan2.2-t2i-flash), (5) I2V image-to-video (Wan 2.x, default wan2.7-i2v-flash, supports T2I→I2V pipeline), (6) Qwen TTS (auto model/voice by scene, default qwen3-tts-vd-realtime-2026-01-15), (7) LingMou digital-human template video with random template, public-template copy, and script confirmation. Trigger when the user needs talking-head, portrait, full-body animation, text-to-image, text-to-video, or speech synthesis.

🇺🇸|EnglishTranslated

9 scripts/Attention

AI & Machine Learningnvidia/skills

perf-moe-long-context

Long-context MoE training guidance for Megatron Bridge. Covers CP sizing, selective recompute, dispatcher choices, and practical patterns from DSV3, Qwen3, and Qwen3-Next long-context experiments.

🇺🇸|EnglishTranslated

AI & Machine Learningob-labs/agentseek

langchain-dev-guide

LangChain / LangGraph engineering pitfalls and verified fixes. Covers DeepAgents, OpenAI-compatible model integration (including Chinese provider adapters: DeepSeek, Qwen, GLM, etc.), middleware, streaming, multi-agent orchestration, and other common development issues. Use when hitting unexpected behavior, making architecture decisions, or integrating Chinese LLM providers during LangChain development.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningcountbot-ai/countbot

image-analysis

图片分析与识别，可分析本地图片、网络图片、视频、文件。适用于 OCR、物体识别、场景理解等。当用户发送图片或要求分析图片时必须使用此技能。

🇺🇸|EnglishTranslated

2 scripts/Checked