Search Results: fp8-optimization

Found 2 Skills

AI & Machine Learningscientiacapital/skills

unsloth-training

Fine-tune LLMs with Unsloth using GRPO or SFT. Supports FP8, vision models, mobile deployment, Docker, packing, GGUF export. Use when: train with GRPO, fine-tune, reward functions, SFT training, FP8 training, vision fine-tuning, phone deployment, docker training, packing, export to GGUF.

🇺🇸|EnglishTranslated

5 scripts/Checked

AI & Machine Learningnvidia/skills

perf-moe-optimization-workflow

Systematic workflow for MoE training optimization in Megatron Bridge, based on the Megatron-Core MoE paper. Covers the Three Walls framework, parallel folding, recompute strategy, dispatcher choice, and CUDA-graph bring-up.

🇺🇸|EnglishTranslated