Search Results: gemini

Found 484 Skills

gemini

Execute Gemini CLI for AI-powered code analysis and generation. Use when you need to leverage Google's Gemini models for complex reasoning tasks.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningbinhmuc/autobot-review

ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (better image analysis than Claude models, captioning, reasoning, object detection, design extraction, OCR, visual Q&A, segmentation, handle multiple images), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image with Imagen 4, editing, composition, refinement), generate videos (text-to-video with Veo 3, 8-second clips with native audio). Use when working with audio/video files, analyzing images or screenshots (instead of default vision capabilities of Claude, only fallback to Claude's vision capabilities if needed), processing PDF documents, extracting structured data from media, creating images/videos from text prompts, or implementing multimodal AI features. Supports Gemini 3/2.5, Imagen 4, and Veo 3 models with context windows up to 2M tokens.

🇺🇸|EnglishTranslated

7 scripts/Attention

AI & Machine Learningdelneg/nano-banana-pro-sk...

gemini3pro

Query Google Gemini 3 Pro via grsai.com API for text generation and image analysis. Use for text generation, Q&A, summarization, code generation, creative writing, image analysis/vision, complex reasoning, and structured document generation. Triggers on "ask gemini", "use gemini", "query gemini", "analyze this image with gemini", or when a second opinion from another LLM is needed. Optionally accepts an image input for vision tasks.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningaia-11-hn-mib/mib-mockint...

gemini-document-processing

Guide for implementing Google Gemini API document processing - analyze PDFs with native vision to extract text, images, diagrams, charts, and tables. Use when processing documents, extracting structured data, summarizing PDFs, answering questions about document content, or converting documents to structured formats. (project)

🇺🇸|EnglishTranslated

AI & Machine Learningyelban/orz99-skills

cine-shot

電影感 AI 繪圖提示詞生成器。根據使用者的場景描述，自動選配攝影機模組、光影預設，產出 Midjourney 與 Gemini 3 Pro 雙平台格式的 cinematic prompt。

🇺🇸|EnglishTranslated

AI & Machine Learningmelodic-software/claude-c...

gemini-token-optimization

Optimize token usage when delegating to Gemini CLI. Covers token caching, batch queries, model selection (Flash vs Pro), and cost tracking. Use when planning bulk Gemini operations.

🇺🇸|EnglishTranslated

AI & Machine Learningdaiki48/dotfiles

quad-fact-check

Cross fact-checking with 4 models: Claude + Gemini + Codex. Conduct independent checks with Opus itself, Gemini Flash, Gemini Pro, and Codex (gpt-5.3-codex) → extract issues → discuss → output a consensus report.

🇨🇳|ChineseTranslated

AI & Machine Learninggoogle-gemini/gemini-skil...

vertex-ai-api-dev

Guides the usage of Gemini API on Google Cloud Vertex AI with the Gen AI SDK. Use when the user asks about using Gemini in an enterprise environment or explicitly mentions Vertex AI. Covers SDK usage (Python, JS/TS, Go, Java, C#), capabilities like Live API, tools, multimedia generation, caching, and batch prediction.

🇺🇸|EnglishTranslated

Tools & Utilitiesyeachan-heo/oh-my-claudec...

ask-gemini

Ask Gemini via local CLI and capture a reusable artifact

🇺🇸|EnglishTranslated

Tools & Utilitiesykdojo/claude-code-tips

reddit-fetch

Fetch content from Reddit using Gemini CLI when WebFetch is blocked. Use when accessing Reddit URLs, researching topics on Reddit, or when Reddit returns 403/blocked errors.

🇺🇸|EnglishTranslated

Tools & Utilitiestddworks/asc-cli-skills

asc-app-shots

App Store screenshot generation skill with two workflows: (A) AI-powered: fetches app metadata via `asc` CLI, analyzes screenshots with Claude vision, writes a ScreenPlan JSON, then generates final marketing screenshots via Gemini (`asc app-shots generate`), and optionally translates them (`asc app-shots translate`). (B) HTML-based (deterministic): writes a CompositionPlan JSON with precise device placement, text overlays, and backgrounds, then runs `asc app-shots html` to produce a self-contained HTML page with real device mockup frames and client-side PNG export — no AI needed. Use this skill when: (1) User asks to "create App Store screenshots" or "generate screenshot plan" (2) User asks to "make an HTML screenshot page" or "compose screenshots with mockups" (3) User mentions "asc-app-shots", "app-shots html", "composition plan", or screenshot marketing (4) User wants deterministic, reproducible screenshot layouts with device mockups (5) User wants AI-generated screenshots via Gemini

🇺🇸|EnglishTranslated

AI & Machine Learningjamditis/claude-skills-jo...

nano-banana-image-gen

Use when generating images with Gemini models, choosing between Nano Banana 1/2/Pro, optimizing image generation costs, writing image prompts, or needing visual grounding with real-world reference images

🇺🇸|EnglishTranslated