Search Results: model-comparison

Found 8 Skills

AI & Machine Learningdatocms/agent-skills

eval-triggers

Run the trigger evaluation pipeline — classify, analyze, and optionally compare against a baseline. Only run when explicitly asked — evals are expensive.

🇺🇸|EnglishTranslated

AI & Machine Learninghuggingface/skills

huggingface-best

Use when the user asks about finding the best, top, or recommended model for a task, wants to know what AI model to use, or wants to compare models by benchmark scores. Triggers on: "best model for X", "what model should I use for", "top models for [task]", "which model runs on my laptop/machine/device", "recommend a model for", "what LLM should I use for", "compare models for", "what's state of the art for", or any question about choosing an AI model for a specific use case. Always use this skill when the user wants model recommendations or comparisons, even if they don't explicitly mention HuggingFace or benchmarks.

🇺🇸|EnglishTranslated

AI & Machine Learninghoodini/ai-agents-skills

nano-banana-pro

Generate images with Google's Nano Banana Pro (Gemini 3 Pro Image). Use when generating AI images via Gemini API, creating professional visuals, or building image generation features. Triggers on Nano Banana Pro, Gemini 3 Pro Image, gemini-3-pro-image-preview, Google image generation.

🇺🇸|EnglishTranslated

AI & Machine Learningreplicate/skills

compare-models

Compare Replicate models by cost, speed, quality, and capabilities.

🇺🇸|EnglishTranslated

AI & Machine Learningopenrouterteam/skills

openrouter-models

Query OpenRouter for available AI models, pricing, capabilities, throughput, and provider performance. Use when the user asks about available OpenRouter models, model pricing, model context lengths, model capabilities, provider latency or uptime, throughput limits, supported parameters, wants to search/filter/compare models, or find the fastest provider for a model.

🇺🇸|EnglishTranslated

6 scripts/Attention

AI & Machine Learningeyadsibai/ltk

nemo-evaluator

Use when evaluating LLMs, running benchmarks like MMLU/HumanEval/GSM8K, setting up evaluation pipelines, or asking about "NeMo Evaluator", "LLM benchmarking", "model evaluation", "MMLU", "HumanEval", "GSM8K", "benchmark harnesses"

🇺🇸|EnglishTranslated

AI & Machine Learninglebsral/dspy-programming-...

ai-tracking-experiments

Track which optimization experiment was best. Use when you've run multiple optimization passes, need to compare experiments, want to reproduce past results, need to pick the best prompt configuration, track experiment costs, manage optimization artifacts, decide which optimized program to deploy, or justify your choice to stakeholders. Covers experiment logging, comparison, and promotion to production.

🇺🇸|EnglishTranslated

AI & Machine Learningshiqkuangsan/oh-my-daily-...

tooyoung:easy-openrouter

Quickly test and compare LLM models via OpenRouter. Find the fastest/cheapest model, compare response quality. Trigger words: openrouter, test model, compare models, find fastest model, find cheapest model

🇺🇸|EnglishTranslated

1 scripts/Attention