Search Results: llm

Found 1,211 Skills

AI & Machine Learningdavila7/claude-code-templ...

llama-factory

Expert guidance for fine-tuning LLMs with LLaMA-Factory - WebUI no-code, 100+ models, 2/3/4/5/6/8-bit QLoRA, multimodal support

🇺🇸|EnglishTranslated

AI & Machine Learningportkey-ai/skills

portkey-python-sdk

Complete reference for the Portkey AI Gateway Python SDK with unified API access to 200+ LLMs, automatic fallbacks, caching, and full observability. Use when building Python applications that need LLM integration with production-grade reliability.

🇺🇸|EnglishTranslated

Data Processingjackspace/claudeskillz

firecrawl-scraper

Complete knowledge domain for Firecrawl v2 API - web scraping and crawling that converts websites into LLM-ready markdown or structured data. Use when: scraping websites, crawling entire sites, extracting web content, converting HTML to markdown, building web scrapers, handling dynamic JavaScript content, bypassing anti-bot protection, extracting structured data from web pages, or when encountering "content not loading", "JavaScript rendering issues", or "blocked by bot detection". Keywords: firecrawl, firecrawl api, web scraping, web crawler, scrape website, crawl website, extract content, html to markdown, site crawler, content extraction, web automation, firecrawl-py, firecrawl-js, llm ready data, structured data extraction, bot bypass, javascript rendering, scraping api, crawling api, map urls, batch scraping

🇺🇸|EnglishTranslated

Automationeveryinc/compound-enginee...

ce-optimize

Run metric-driven iterative optimization loops. Define a measurable goal, build measurement scaffolding, then run parallel experiments that try many approaches, measure each against hard gates and/or LLM-as-judge quality scores, keep improvements, and converge toward the best solution. Use when optimizing clustering quality, search relevance, build performance, prompt quality, or any measurable outcome that benefits from systematic experimentation. Inspired by Karpathy's autoresearch, generalized for multi-file code changes and non-ML domains.

🇺🇸|EnglishTranslated

3 scripts/Attention

AI & Machine Learningneolabhq/context-engineer...

implement-task

Implement a task with automated LLM-as-Judge verification for critical steps

🇺🇸|EnglishTranslated

AI & Machine Learningvasilyu1983/ai-agents-pub...

ai-prompt-engineering

Operational prompt engineering for production LLM apps: structured outputs (JSON/schema), deterministic extractors, RAG grounding/citations, tool/agent workflows, prompt safety (injection/exfiltration), and prompt evaluation/regression testing. Use when designing, debugging, or standardizing prompts for Codex CLI, Claude Code, and OpenAI/Anthropic/Gemini APIs.

🇺🇸|EnglishTranslated

Testing & QAgiuseppe-trisciuoglio/dev...

langchain4j-testing-strategies

Testing strategies for LangChain4j-powered applications. Mock LLM responses, test retrieval chains, and validate AI workflows. Use when testing AI-powered features reliably.

🇺🇸|EnglishTranslated

Security & Compliancemastepanoski/claude-skill...

owasp-llm-top10

Security audit for LLM and GenAI applications using OWASP Top 10 for LLM Apps 2025. Assess prompt injection, data leakage, supply chain, and 7 more critical vulnerabilities.

🇺🇸|EnglishTranslated

AI & Machine Learningyonatangross/orchestkit

llm-safety-patterns

Security patterns for LLM integrations including prompt injection defense and hallucination prevention. Use when implementing context separation, validating LLM outputs, or protecting against prompt injection attacks.

🇺🇸|EnglishTranslated

2 scripts/Checked

Code Qualitysimhacker/moollm

sniffable-python

Structure Python so LLMs can understand it in 50 lines.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learninglaunchdarkly/agent-skills

aiconfig-online-evals

Attach judges to AI Config variations for automatic LLM-as-a-judge evaluation. Create custom judges, configure sampling rates, and monitor quality scores.

🇺🇸|EnglishTranslated

AI & Machine Learningflora131/atomic

evaluation

This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines. Part of the context engineering skill suite — also activates when the user mentions "context engineering" or "context-engineering" in the context of measuring agent effectiveness.

🇺🇸|EnglishTranslated

1 scripts/Checked