Loading...
Loading...
Generates a Jupyter notebook that fine-tunes a base model using SageMaker serverless training jobs. Use when the user says "start training", "fine-tune my model", "I'm ready to train", or when the plan reaches the finetuning step. Supports SFT, DPO, and RLVR trainers, including RLVR Lambda reward function creation.
npx skill4agent add awslabs/agent-plugins finetuninguse_case_spec.mduse-case-specificationfinetuning-setupfinetuning-setupfinetuning-setup[title]_finetuning.ipynb[title]references/sft_example.mdreferences/dpo_example.mdreferences/rlvr_example.md[title]_finetuning.ipynbuse_case_spec.md[a-zA-Z0-9](-*[a-zA-Z0-9]){0,62}customer-support-chatbot-v1references/rlvr_reward_function.mdCUSTOM_REWARD_FUNCTIONevaluator.arnACCEPT_EULATrueA Jupyter notebook has now been generated which will help you finetune your model. You are free to run it now. Please let me know once the training is complete.rlvr_reward_function.mdtemplates/rlvr_reward_function_source_template.pytemplates/nova_rlvr_reward_function_source_template.pysft_example.mddpo_example.mdrlvr_example.md