Search Results: llm-training

Found 17 Skills

AI & Machine Learningpromptingcompany/nv-skill...

nemo-mbridge-perf-activation-recompute

Validate and use selective and full activation recompute in Megatron Bridge to reduce GPU memory usage at the cost of extra compute.

🇺🇸|EnglishTranslated

AI & Machine Learningorchestra-research/ai-res...

axolotl

Expert guidance for fine-tuning LLMs with Axolotl - YAML configs, 100+ models, LoRA/QLoRA, DPO/KTO/ORPO/GRPO, multimodal support

🇺🇸|EnglishTranslated

AI & Machine Learningdavila7/claude-code-templ...

unsloth

Expert guidance for fast fine-tuning with Unsloth - 2-5x faster training, 50-80% less memory, LoRA/QLoRA optimization

🇺🇸|EnglishTranslated

AI & Machine Learningorchestra-research/ai-res...

constitutional-ai

Anthropic's method for training harmless AI through self-improvement. Two-phase approach - supervised learning with self-critique/revision, then RLAIF (RL from AI Feedback). Use for safety alignment, reducing harmful outputs without human labels. Powers Claude's safety system.

🇺🇸|EnglishTranslated

AI & Machine Learningtheneoai/awesome-skills

autoresearch

AI autonomous research agent for LLM training optimization using opencode as the agent. The agent autonomously modifies train.py, runs experiments, evaluates val_bpb, and iterates to find the best model. Use when: "run autoresearch", "start experiment", "train model", "autonomous research", "optimize LLM training".

🇺🇸|EnglishTranslated

2 scripts/Attention