Search Results: lora-finetuning

Found 4 Skills

AI & Machine Learningdavila7/claude-code-templ...

gptq

Post-training 4-bit quantization for LLMs with minimal accuracy loss. Use for deploying large models (70B, 405B) on consumer GPUs, when you need 4× memory reduction with <2% perplexity degradation, or for faster inference (3-4× speedup) vs FP16. Integrates with transformers and PEFT for QLoRA fine-tuning.

🇺🇸|EnglishTranslated

108

AI & Machine Learningdavila7/claude-code-templ...

implementing-llms-litgpt

Implements and trains LLMs using Lightning AI's LitGPT with 20+ pretrained architectures (Llama, Gemma, Phi, Qwen, Mistral). Use when need clean model implementations, educational understanding of architectures, or production fine-tuning with LoRA/QLoRA. Single-file implementations, no abstraction layers.

🇺🇸|EnglishTranslated

AI & Machine Learningawslabs/agent-plugins

model-deployment

Generates a Jupyter notebook that deploys fine-tuned models from SageMaker Serverless Model Customization to SageMaker endpoints or Bedrock. Use when the user says "deploy my model", "create an endpoint", "make it available", or asks about deployment options. Identifies the correct deployment pathway (Nova vs OSS), generates deployment code, and handles endpoint configuration.

🇺🇸|EnglishTranslated

4 scripts/Checked

AI & Machine Learningreplicate/skills

prompt-images

Prompting techniques for AI image generation and editing models on Replicate. Use when writing prompts for image models or building image generation features.

🇺🇸|EnglishTranslated