Search Results: fine-tuning

Found 94 Skills

AI & Machine Learningjeffallan/claude-skills

fine-tuning-expert

Use when fine-tuning LLMs, training custom models, or optimizing model performance for specific tasks. Invoke for parameter-efficient methods, dataset preparation, or model adaptation.

🇺🇸|EnglishTranslated

AI & Machine Learningorchestra-research/ai-res...

fine-tuning-serving-openpi

Fine-tune and serve Physical Intelligence OpenPI models (pi0, pi0-fast, pi0.5) using JAX or PyTorch backends for robot policy inference across ALOHA, DROID, and LIBERO environments. Use when adapting pi0 models to custom datasets, converting JAX checkpoints to PyTorch, running policy inference servers, or debugging norm stats and GPU memory issues.

🇺🇸|EnglishTranslated

AI & Machine Learningqodex-ai/ai-agent-skills

llm-fine-tuning-guide

Master fine-tuning of large language models for specific domains and tasks. Covers data preparation, training techniques, optimization strategies, and evaluation methods. Use when adapting models for specialized applications, reducing inference costs, or improving domain-specific performance.

🇺🇸|EnglishTranslated

5 scripts/Attention

AI & Machine Learningorchestra-research/ai-res...

fine-tuning-openvla-oft

Fine-tunes and evaluates OpenVLA-OFT and OpenVLA-OFT+ policies for robot action generation with continuous action heads, LoRA adaptation, and FiLM conditioning on LIBERO simulation and ALOHA real-world setups. Use when reproducing OpenVLA-OFT paper results, training custom VLA action heads (L1 or diffusion), deploying server-client inference for ALOHA, or debugging normalization, LoRA merge, and cross-GPU issues.

🇺🇸|EnglishTranslated

AI & Machine Learningtogethercomputer/skills

together-fine-tuning

LoRA, full fine-tuning, DPO preference tuning, VLM training, function-calling tuning, reasoning tuning, and BYOM uploads on Together AI. Reach for it whenever the user wants to adapt a model on custom data rather than only run inference, evaluate outputs, or host an existing model.

🇺🇸|EnglishTranslated

5 scripts/Checked

AI & Machine Learningadaptationio/skrillz

bedrock-fine-tuning

Amazon Bedrock Model Customization with fine-tuning, continued pre-training, reinforcement fine-tuning (NEW 2025 - 66% accuracy gains), and distillation. Create customization jobs, monitor training, deploy custom models, and evaluate performance. Use when customizing Claude, Titan, or other Bedrock models for domain-specific tasks, adapting to proprietary data, improving accuracy on specialized workflows, or distilling large models to smaller ones.

🇺🇸|EnglishTranslated

AI & Machine Learninglebsral/dspy-programming-...

ai-fine-tuning

Fine-tune models on your data to maximize quality and cut costs. Use when prompt optimization hit a ceiling, you need domain specialization, you want cheaper models to match expensive ones, you heard "fine-tuning will make us AI-native", you have 500+ training examples, or you need to train on proprietary data. Covers DSPy BootstrapFinetune, BetterTogether, model distillation, and when to fine-tune vs optimize prompts.

🇺🇸|EnglishTranslated

AI & Machine Learningjg-chalk-io/nora-livekit

moai-ml-llm-fine-tuning

Enterprise LLM Fine-Tuning with LoRA, QLoRA, and PEFT techniques

🇺🇸|EnglishTranslated

AI & Machine Learningdavila7/claude-code-templ...

peft-fine-tuning

Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B) with limited GPU memory, when you need to train <1% of parameters with minimal accuracy loss, or for multi-adapter serving. HuggingFace's official library integrated with transformers ecosystem.

🇺🇸|EnglishTranslated

AI & Machine Learningpluginagentmarketplace/cu...

fine-tuning

LLM fine-tuning with LoRA, QLoRA, and instruction tuning for domain adaptation.

🇺🇸|EnglishTranslated

1 scripts/Checked

AI & Machine Learningkiterlin/intelligent-dete...

fine-tuning-with-trl

Fine-tune LLMs using reinforcement learning with TRL - SFT for instruction tuning, DPO for preference alignment, PPO/GRPO for reward optimization, and reward model training. Use when need RLHF, align model with preferences, or train from human feedback. Works with HuggingFace Transformers.

🇺🇸|EnglishTranslated

AI & Machine Learningmarkpitt/claude-skills

fine-tuning-data-generator

Generates comprehensive synthetic fine-tuning datasets in ChatML format (JSONL) for use with Unsloth, Axolotl, and similar training frameworks. Gathers requirements, creates datasets with diverse examples, validates quality, and provides framework integration guidance.

🇺🇸|EnglishTranslated

2 scripts/Checked