Search Results: gemini

Found 644 Skills

video-transcripts

Generate structured video transcripts from local files or video URLs using Gemini Files API. Use when a GitHub or Linear tracker item, comment, or attachment includes a screen recording, .mov, .mp4, or tracker-hosted video and you need a <video-transcripts> block instead of hand-written notes.

🇺🇸|EnglishTranslated

1 scripts/Attention

AI & Machine Learningresciencelab/opc-skills

nanobanana

Generate and edit images using Google Gemini 3 Pro Image (Nano Banana Pro). Supports text-to-image, image editing, various aspect ratios, and high-resolution output (2K/4K).

🇺🇸|EnglishTranslated

2 scripts/Checked

AI & Machine Learningskindhu/skind-skills

us-stock-researcher

US Stock Investment Research Assistant. Supports Gemini Deep Research or Claude Native Deep Research (7-Phase + GoT). Use when analyzing 10-K/10-Q reports or generating investment research reports.

🇺🇸|EnglishTranslated

3 scripts/Attention

AI & Machine Learningbinhmuc/autobot-review

ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (better image analysis than Claude models, captioning, reasoning, object detection, design extraction, OCR, visual Q&A, segmentation, handle multiple images), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image with Imagen 4, editing, composition, refinement), generate videos (text-to-video with Veo 3, 8-second clips with native audio). Use when working with audio/video files, analyzing images or screenshots (instead of default vision capabilities of Claude, only fallback to Claude's vision capabilities if needed), processing PDF documents, extracting structured data from media, creating images/videos from text prompts, or implementing multimodal AI features. Supports Gemini 3/2.5, Imagen 4, and Veo 3 models with context windows up to 2M tokens.

🇺🇸|EnglishTranslated

7 scripts/Attention

AI & Machine Learningjohnlindquist/claude

think

Deep multi-framework reasoning using Gemini. Use for complex problem analysis, challenging ideas, and evaluating multiple options with structured thinking.

🇺🇸|EnglishTranslated

AI & Machine Learningsundial-org/awesome-openc...

antigravity-image-gen

Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningyelban/orz99-skills

cine-shot

電影感 AI 繪圖提示詞生成器。根據使用者的場景描述，自動選配攝影機模組、光影預設，產出 Midjourney 與 Gemini 3 Pro 雙平台格式的 cinematic prompt。

🇺🇸|EnglishTranslated

AI & Machine Learningdaiki48/dotfiles

quad-fact-check

Cross fact-checking with 4 models: Claude + Gemini + Codex. Conduct independent checks with Opus itself, Gemini Flash, Gemini Pro, and Codex (gpt-5.3-codex) → extract issues → discuss → output a consensus report.

🇨🇳|ChineseTranslated

AI & Machine Learningwlzh/skills

image-generator

通用图片生成 Skill，支持多种 AI 模型（ModelScope、Gemini 等），可被其他 Skills 调用

🇺🇸|EnglishTranslated

2 scripts/Checked

Tools & Utilitiestddworks/asc-cli-skills

asc-app-shots

App Store screenshot generation skill with two workflows: (A) AI-powered: fetches app metadata via `asc` CLI, analyzes screenshots with Claude vision, writes a ScreenPlan JSON, then generates final marketing screenshots via Gemini (`asc app-shots generate`), and optionally translates them (`asc app-shots translate`). (B) HTML-based (deterministic): writes a CompositionPlan JSON with precise device placement, text overlays, and backgrounds, then runs `asc app-shots html` to produce a self-contained HTML page with real device mockup frames and client-side PNG export — no AI needed. Use this skill when: (1) User asks to "create App Store screenshots" or "generate screenshot plan" (2) User asks to "make an HTML screenshot page" or "compose screenshots with mockups" (3) User mentions "asc-app-shots", "app-shots html", "composition plan", or screenshot marketing (4) User wants deterministic, reproducible screenshot layouts with device mockups (5) User wants AI-generated screenshots via Gemini

🇺🇸|EnglishTranslated

AI & Machine Learningjamditis/claude-skills-jo...

nano-banana-image-gen

Use when generating images with Gemini models, choosing between Nano Banana 1/2/Pro, optimizing image generation costs, writing image prompts, or needing visual grounding with real-world reference images

🇺🇸|EnglishTranslated

AI & Machine Learningnansen-ai/nansen-cli

nansen-web-fetch

Fetch and analyze content from one or more URLs using AI (Gemini 2.5 Flash). Use when you have specific URLs and need to extract or summarize their content. Pairs well with `nansen web search` results.

🇺🇸|EnglishTranslated