Search Results: llm

Found 1,211 Skills

architecture-review

Staff-level codebase health review. Finds monolithic modules, silent failures, type safety gaps, test coverage holes, and LLM-friendliness issues.

🇺🇸|EnglishTranslated

AI & Machine Learningblockrunai/clawrouter

clawrouter

Smart LLM router — save 78% on inference costs. Routes every request to the cheapest capable model across 30+ models from OpenAI, Anthropic, Google, DeepSeek, and xAI.

🇺🇸|EnglishTranslated

AI & Machine Learningpluginagentmarketplace/cu...

data-engineering

Data engineering, machine learning, AI, and MLOps. From data pipelines to production ML systems and LLM applications.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningkriscard/kriscard-claude-...

prompt-engineer

AI/LLM: Use when crafting system prompts, optimizing LLM outputs, or improving agent instructions. NOT for general coding.

🇺🇸|EnglishTranslated

AI & Machine Learningjezweb/claude-skills

mcp-builder

Build MCP servers in Python with FastMCP. Workflow: define tools and resources, build server, test locally, deploy to FastMCP Cloud or Docker. Use when creating MCP servers, exposing tools/resources/prompts to LLMs, building Claude integrations, or troubleshooting FastMCP module-level server, storage, lifespan, middleware, OAuth, or deployment errors.

🇺🇸|EnglishTranslated

13 scripts/Attention

AI & Machine Learningmaragudk/evals-skills

prompt-engineering

Use this skill when crafting, reviewing, or improving prompts for LLM pipelines — including task prompts, system prompts, and LLM-as-Judge prompts. Triggers include: requests to write or refine a prompt, diagnose why an LLM produces inconsistent or incorrect outputs, bridge the gap between intent and model behavior, reduce ambiguity in instructions, add few-shot examples, structure complex prompts, or improve output formatting. Also use when the user needs help distinguishing specification failures (unclear instructions) from generalization failures (model limitations), or when iterating on prompts based on observed failure modes. Do NOT use for general coding tasks, document creation, or non-LLM writing.

🇺🇸|EnglishTranslated

AI & Machine Learningancoleman/ai-design-compo...

evaluating-llms

Evaluate LLM systems using automated metrics, LLM-as-judge, and benchmarks. Use when testing prompt quality, validating RAG pipelines, measuring safety (hallucinations, bias), or comparing models for production deployment.

🇺🇸|EnglishTranslated

9 scripts/Attention

AI & Machine Learningmelvynx/aiblueprint

prompt-creator

Expert prompt engineering for creating effective prompts for Claude, GPT, and other LLMs. Use when writing system prompts, user prompts, few-shot examples, or optimizing existing prompts for better performance.

🇺🇸|EnglishTranslated

AI & Machine Learningmlflow/skills

agent-evaluation

Use this when you need to EVALUATE OR IMPROVE or OPTIMIZE an existing LLM agent's output quality - including improving tool selection accuracy, answer quality, reducing costs, or fixing issues where the agent gives wrong/incomplete responses. Evaluates agents systematically using MLflow evaluation with datasets, scorers, and tracing. Covers end-to-end evaluation workflow or individual components (tracing setup, dataset creation, scorer definition, evaluation execution).

🇺🇸|EnglishTranslated

12 scripts/Attention

AI & Machine Learningmlflow/skills

querying-mlflow-metrics

Fetches aggregated trace metrics (token usage, latency, trace counts, quality evaluations) from MLflow tracking servers. Triggers on requests to show metrics, analyze token usage, view LLM costs, check usage trends, or query trace statistics.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningmgratzer/bloomery

bloomery

Interactive tutorial that guides engineers through building their own coding agent (agentic loop) from scratch using raw HTTP calls to an LLM API. Supports Gemini, OpenAI (and compatible endpoints), and Anthropic. Supports TypeScript, Python, Go, and Ruby. Detects progress automatically. Use when someone says "build an agent", "teach me agents", or "/build-agent".

🇺🇸|EnglishTranslated

AI & Machine Learningprefactordev/typescript-s...

instrument-existing-agent-with-prefactor-sdk

Use when an existing agent already works without Prefactor and you need to add tracing for runs, llm calls, tool calls, and failures with minimal behavior changes.

🇺🇸|EnglishTranslated