Search Results: gemini

Found 644 Skills

Tools & Utilitiesagricidaniel/claude-blog

blog-audio

Generate audio narration of blog posts using Google Gemini TTS. Supports summary narration, full article read-aloud, and two-speaker podcast/dialogue mode with 30 voice options. Outputs MP3 with HTML5 audio embed code. Works standalone via /blog audio or internally from blog-write. Falls back gracefully when API key is not configured. Use when user says "blog audio", "narrate blog", "audio version", "text to speech", "tts", "podcast mode", "read aloud", "audio narration", "voice", "narration", "generate audio".

🇺🇸|EnglishTranslated

4 scripts/Checked

AI & Machine Learningalirezarezvani/claude-ski...

cross-eval

/cs:cross-eval <memo> — Multi-model consensus on a board memo or strategy brief. Claude + Codex + Gemini cross-review with graceful degradation.

🇺🇸|EnglishTranslated

AI & Machine Learningivanleomk/aie-workshop-20...

prompt_to_production

A complete workshop curriculum for building an agentic application using the Gemini Interactions API. Guides the user from basic API calls to a full production coding agent.

🇺🇸|EnglishTranslated

13 scripts/Attention

AI & Machine Learningmrgoonie/claudekit-skills

ai-multimodal

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (captioning, object detection, OCR, visual Q&A, segmentation), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image, editing, composition, refinement). Use when working with audio/video files, analyzing images or screenshots, processing PDF documents, extracting structured data from media, creating images from text prompts, or implementing multimodal AI features. Supports multiple models (Gemini 2.5/2.0) with context windows up to 2M tokens.

🇺🇸|EnglishTranslated

6 scripts/Attention

AI & Machine Learningjulianobarbosa/claude-cod...

consulting-design

Consult Gemini AI for architecture alternatives, design trade-offs, and brainstorming. Use when seeking different perspectives on design, evaluating architectural approaches, comparing solutions, or generating creative ideas.

🇺🇸|EnglishTranslated

AI & Machine Learningdnyoussef/context-cascade

multi-model-discovery

Use Gemini to find existing solutions before building from scratch. Leverages Google Search grounding to discover code examples, libraries, and best practices to avoid reinventing the wheel.

🇺🇸|EnglishTranslated

AI & Machine Learningnicepkg/ai-workflow

nano-banana

REQUIRED for all image generation requests. Generate and edit images using Nano Banana (Gemini CLI). Handles blog featured images, YouTube thumbnails, icons, diagrams, patterns, illustrations, photos, visual assets, graphics, artwork, pictures. Use this skill whenever the user asks to create, generate, make, draw, design, or edit any image or visual content.

🇺🇸|EnglishTranslated

AI & Machine Learningcluesmith/codev

generate-image

AI image generation CLI using Gemini. Use when generating images, checking syntax for resolution, aspect ratio, and reference image options.

🇺🇸|EnglishTranslated

AI & Machine Learningsamuraigpt/generative-med...

muapi-nano-banana

Reasoning-driven image generation using structured creative briefs (Gemini 3 style) — generates high-fidelity images via muapi.ai with logic-based prompting

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningintellectronica/agent-ski...

nano-banana-2

Generate and edit images using Google's Nano Banana 2 (Gemini 3.1 Flash Image Preview) API. This skill should be used when the user asks to create or modify images, especially when they need fast iteration, explicit aspect-ratio control, or resolution control from 512px to 4K.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningdinghuanghao/openword

openword-player

Operate OpenWord end-to-end for live adventure sessions. Use when Codex needs to download/install/start OpenWord, guide a human player in the browser, or play autonomously through REST API (create/load game, do_action loop, state/image retrieval), including configuring GEMINI_API_KEY and sharing interesting scenes and choices during play.

🇺🇸|EnglishTranslated

1 scripts/Attention

Automationsupercent-io/skills-templ...

ralphmode

Configure Claude Code, Codex CLI, and Gemini CLI for Ralph-style automation with fewer approval prompts while keeping project boundaries, secret denylists, and sandbox-first safety rules intact.

🇺🇸|EnglishTranslated