Unsloth now supports full fine-tuning, 8-bit and all models! 🦥

Tutorial: Train your own Reasoning model with GRPO

Beginner's Guide to transforming a model like Llama 3.1 (8B) into a reasoning model by using Unsloth and GRPO.

Last updated

Was this helpful?