Inference
Learn how to run your finetuned model.
Last updated
Was this helpful?
Learn how to run your finetuned model.
Last updated
Was this helpful?
Unsloth supports natively 2x faster inference. For our inference only notebook, click .
All QLoRA, LoRA and non LoRA inference paths are 2x faster. This requires no change of code or any new dependencies.
Sometimes when you execute a cell can appear. To solve this, in a new cell, run the below: