Search Results: gemini-api

Found 35 Skills

AI & Machine Learningbinhmuc/autobot-review

ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up to 9.5 hours), understand images (better image analysis than Claude models, captioning, reasoning, object detection, design extraction, OCR, visual Q&A, segmentation, handle multiple images), process videos (scene detection, Q&A, temporal analysis, YouTube URLs, up to 6 hours), extract from documents (PDF tables, forms, charts, diagrams, multi-page), generate images (text-to-image with Imagen 4, editing, composition, refinement), generate videos (text-to-video with Veo 3, 8-second clips with native audio). Use when working with audio/video files, analyzing images or screenshots (instead of default vision capabilities of Claude, only fallback to Claude's vision capabilities if needed), processing PDF documents, extracting structured data from media, creating images/videos from text prompts, or implementing multimodal AI features. Supports Gemini 3/2.5, Imagen 4, and Veo 3 models with context windows up to 2M tokens.

🇺🇸|EnglishTranslated

7 scripts/Attention

AI & Machine Learningfeedtailor/ccskill-nanoba...

nano-banana-pro

Skill for image generation. Uses Google Nano Banana Pro (Gemini 3 Pro Image) API to generate high-quality images. Supports logos, infographics, illustrations, photorealistic images, and more.

🇺🇸|EnglishTranslated

AI & Machine Learninggoogle-gemini/gemini-skil...

vertex-ai-api-dev

Guides the usage of Gemini API on Google Cloud Vertex AI with the Gen AI SDK. Use when the user asks about using Gemini in an enterprise environment or explicitly mentions Vertex AI. Covers SDK usage (Python, JS/TS, Go, Java, C#), capabilities like Live API, tools, multimedia generation, caching, and batch prediction.

🇺🇸|EnglishTranslated

AI & Machine Learningyhx0516/ai-toolkits

sofunny-image

Generate or edit images using Gemini's native `generateContent` via New-API. Suitable for scenarios requiring text-to-image generation, reference image editing, local PNG output, and those who want to reuse the `.sofunny-image.env` file or current shell environment variables.

🇨🇳|ChineseTranslated

1 scripts/Attention

AI & Machine Learningdcaigc/dc-skills

gemini-infographic

Generate infographic images from user descriptions using Gemini API (Nano Banana Pro). Converts natural language descriptions into structured infographic prompts, then calls Gemini image generation to produce PNG images. Supports 11 visual styles (sketchnote, kawaii, professional, scientific, anime, claymation, editorial, storyboard, bento grid, bricks), 3 orientations (landscape/portrait/square), 3 detail levels (brief/standard/detailed), and multiple languages. Use when user asks to create infographics, generate visual summaries, make data visualizations, or produce illustrated explanations. Trigger words include 信息图, infographic, 生成图, 可视化, visual summary, data visualization.

🇺🇸|EnglishTranslated

1 scripts/Attention

Marketing & Growthsharadchaturveda-coder/ag...

agency-carousel-growth-engine

Autonomous TikTok and Instagram carousel generation specialist. Analyzes any website URL with Playwright, generates viral 6-slide carousels via Gemini image generation, publishes directly to feed via Upload-Post API with auto trending music, fetches analytics, and iteratively improves through a data-driven learning loop.

🇺🇸|EnglishTranslated

Marketing & Growthdev-dennis-040/openclaw-a...

marketing-carousel-growth-engine

You are an autonomous growth machine that turns any website into viral TikTok and Instagram carousels. You think in 6-slide narratives, obsess over hook psychology, and let data drive every creativ...

🇺🇸|EnglishTranslated

AI & Machine Learninggoogle-antigravity/antigr...

google-antigravity-sdk

Design, implement, and debug autonomous AI agents and multi-agent systems using the Google Antigravity (AGY) SDK. ACTIVATE this skill when the user wants to create, configure, or orchestrate Google Antigravity agents.

🇺🇸|EnglishTranslated

AI & Machine Learningjohnlindquist/claude

brainstorm

Generate ideas and explore possibilities with AI. Use for creative problem solving, generating alternatives, and expanding on concepts.

🇺🇸|EnglishTranslated

AI & Machine Learningaaaaqwq/claude-code-skill...

google-web-search

Enables grounded question answering by automatically executing the Google Search tool within Gemini models. Use when the required information is recent (post knowledge cutoff) or requires verifiable citation.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningjezweb/claude-skills

ai-image-generator

Generate AI images using Gemini or GPT APIs directly. Covers model selection (Gemini for scenes, GPT for transparent icons), the 5-part prompting framework, API calling patterns, multi-turn editing, and quality assurance. Produces photorealistic scenes, icons, illustrations, OG images, and product shots. Use when building websites that need images, creating marketing assets, or generating visual content. Triggers: 'generate image', 'ai image', 'create hero image', 'make an icon', 'generate illustration', 'create og image', 'ai art', 'image generation'.

🇺🇸|EnglishTranslated

Tools & Utilitiespedronauck/skills

viz

Transforms content (URLs, uploaded documents, pasted text, meeting transcripts) into professional visualizations across four output modes. Accepts a mode argument or a keyword trigger in the user message. Mode "diagram" produces an Excalidraw diagram via Excalidraw:create_view. Mode "infographic" generates a Swiss Pulse PNG via the Gemini image-generation API. Mode "visualize" renders an inline Visualizer widget (SVG or HTML) via visualize:show_widget. Mode "publish" ships an interactive Swiss Pulse HTML visual to HeyGenverse via HeyGenverse:create_app and returns a shareable link. Keywords that activate the skill: "diagram it", "excalidraw this", "draw a diagram of this", "nano this", "vis it", "ver it", "hey it", "heygenverse this". Do not use for plain-text summaries, code explanations, prose responses, or generic chat visualizations without a chosen output format.

🇺🇸|EnglishTranslated