Loading...
Loading...
Found 2 Skills
Train and fine-tune transformer language models using TRL (Transformers Reinforcement Learning). Supports SFT, DPO, GRPO, KTO, RLOO and Reward Model training via CLI commands.
Train SONA neural patterns from successful task completions, view learned patterns, and optimize the intelligence pipeline