Search Results: llm-fine-tuning

Found 20 Skills

AI & Machine Learningqodex-ai/ai-agent-skills

llm-fine-tuning-guide

Master fine-tuning of large language models for specific domains and tasks. Covers data preparation, training techniques, optimization strategies, and evaluation methods. Use when adapting models for specialized applications, reducing inference costs, or improving domain-specific performance.

🇺🇸|EnglishTranslated

5 scripts/Attention

AI & Machine Learningjg-chalk-io/nora-livekit

moai-ml-llm-fine-tuning

Enterprise LLM Fine-Tuning with LoRA, QLoRA, and PEFT techniques

🇺🇸|EnglishTranslated

AI & Machine Learningfirecrawl/firecrawl-workf...

firecrawl-knowledge-base

Build a knowledge base from web content with Firecrawl. Use for local reference docs, RAG-ready chunks, fine-tuning datasets, documentation mirrors, topic corpora, or LLM-ready markdown organized from web sources.

🇺🇸|EnglishTranslated

526

AI & Machine Learninghuggingface/skills

trl-training

Train and fine-tune transformer language models using TRL (Transformers Reinforcement Learning). Supports SFT, DPO, GRPO, KTO, RLOO and Reward Model training via CLI commands.

🇺🇸|EnglishTranslated

AI & Machine Learningdavila7/claude-code-templ...

llama-factory

Expert guidance for fine-tuning LLMs with LLaMA-Factory - WebUI no-code, 100+ models, 2/3/4/5/6/8-bit QLoRA, multimodal support

🇺🇸|EnglishTranslated

AI & Machine Learningjeffallan/claude-skills

fine-tuning-expert

Use when fine-tuning LLMs, training custom models, or optimizing model performance for specific tasks. Invoke for parameter-efficient methods, dataset preparation, or model adaptation.

🇺🇸|EnglishTranslated

AI & Machine Learningitsmostafa/llm-engineerin...

mlx

Running and fine-tuning LLMs on Apple Silicon with MLX. Use when working with models locally on Mac, converting Hugging Face models to MLX format, fine-tuning with LoRA/QLoRA on Apple Silicon, or serving models via HTTP API.

🇺🇸|EnglishTranslated

AI & Machine Learningsundial-org/skills

tinker-training-cost

Calculate training costs for Tinker fine-tuning jobs. Use when estimating costs for Tinker LLM training, counting tokens in datasets, or comparing Tinker model training prices. Tokenizes datasets using the correct model tokenizer and provides accurate cost estimates.

🇺🇸|EnglishTranslated

AI & Machine Learninghuggingface/skills

hugging-face-model-trainer

This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs package, UV scripts with PEP 723 format, dataset preparation and validation, hardware selection, cost estimation, Trackio monitoring, Hub authentication, and model persistence. Should be invoked for tasks involving cloud GPU training, GGUF conversion, or when users mention training on Hugging Face Jobs without local GPU setup.

🇺🇸|EnglishTranslated

6 scripts/Checked

AI & Machine Learningrightnow-ai/openfang

llm-finetuning

LLM fine-tuning expert for LoRA, QLoRA, dataset preparation, and training optimization

🇺🇸|EnglishTranslated

AI & Machine Learningdavila7/claude-code-templ...

rwkv-architecture

RNN+Transformer hybrid with O(n) inference. Linear time, infinite context, no KV cache. Train like GPT (parallel), infer like RNN (sequential). Linux Foundation AI project. Production at Windows, Office, NeMo. RWKV-7 (March 2025). Models up to 14B parameters.

🇺🇸|EnglishTranslated

AI & Machine Learningorchestra-research/ai-res...

axolotl

Expert guidance for fine-tuning LLMs with Axolotl - YAML configs, 100+ models, LoRA/QLoRA, DPO/KTO/ORPO/GRPO, multimodal support

🇺🇸|EnglishTranslated