Text-to-speech, all model types & full fine-tuning now supported!

🔮All Our Models

Unsloth model catalog for all our Dynamic GGUF, 4-bit, 16-bit models on Hugging Face.

DeepSeekLlamaGemmaQwenMistralPhi

GGUFs let you run models in tools like Ollama, Open WebUI, and llama.cpp. Instruct (4-bit) safetensors can be used for inference or fine-tuning.

Model
Variant
GGUF
Instruct (4-bit)

Gemma 3n

E2B

DeepSeek-R1-0528

R1-0528-Qwen3-8B

R1-0528

Mistral

Small 3.2 24B (2506)

Magistral Small (2506)

FLUX.1

Kontext-dev

Qwen3

0.6 B

30B-A3B

235B-A22B

Llama 4

Scout 17B 16E

Maverick 17B 128E

Qwen-2.5 Omni

3 B

7 B

Phi-4

Reasoning-plus

Reasoning

DeepSeek models:

Model
Variant
GGUF
Instruct (4-bit)

DeepSeek-V3

V3-0324

V3

DeepSeek-R1

R1-0528

R1-0528-Qwen3-8B

R1

R1 Zero

Distill Llama 3 8 B

Distill Llama 3.3 70 B

Distill Qwen 2.5 1.5 B

Distill Qwen 2.5 7 B

Distill Qwen 2.5 14 B

Distill Qwen 2.5 32 B

Llama models:

Model
Variant
GGUF
Instruct (4-bit)

Llama 4

Scout 17 B-16 E

Maverick 17 B-128 E

Llama 3.3

70 B

Llama 3.2

1 B

11 B Vision

90 B Vision

Llama 3.1

8 B

70 B

405 B

Llama 3

8 B

70 B

Llama 2

7 B

13 B

CodeLlama

7 B

13 B

34 B

Gemma models:

Model
Variant
GGUF
Instruct (4-bit)

Gemma 3n

E2B

link

Gemma 3

1 B

MedGemma

4 B (vision)

27 B (vision)

Gemma 2

2 B

9 B

27 B

Qwen models:

Model
Variant
GGUF
Instruct (4-bit)

Qwen 3

0.6 B

30 B-A3B

235 B-A22B

Qwen 2.5 Omni

3 B

7 B

Qwen 2.5 VL

3 B

Qwen 2.5

0.5 B

1.5 B

3 B

7 B

14 B

32 B

72 B

Qwen 2.5 Coder (128 K)

0.5 B

QwQ

32 B

QVQ (preview)

72 B

Qwen 2 (chat)

1.5 B

7 B

72 B

Qwen 2 VL

2 B

7 B

72 B

Mistral models:

Model
Variant
GGUF
Instruct (4-bit)

Mistral Small

3.2-24 B (2506)

3.1-24 B (2503)

3-24 B (2501)

Magistral

Small-24 B (2506)

Devstral

Small-24 B (2507)

Small-24 B (2505)

Pixtral

12 B (2409)

Mistral Small

2409-22 B

Mistral NeMo

12 B (2407)

Mistral Large

2407

Mistral 7 B

v0.3

v0.2

Mixtral

8 × 7 B

Phi models:

Model
Variant
GGUF
Instruct (4-bit)

Phi-4

Reasoning-plus

Reasoning

Mini-Reasoning

Phi-4 (instruct)

mini (instruct)

Phi-3.5

mini

Phi-3

mini

medium

Other (Orpheus, Smol, Llava etc.) models:

Model
Variant
GGUF
Instruct (4-bit)

Hunyuan

A13B

Orpheus

0.1-ft (3B)

LLava

1.5 (7 B)

1.6 Mistral (7 B)

TinyLlama

Chat

SmolLM 2

135 M

Zephyr-SFT

7 B

Yi

6 B (v1.5)

6 B (v1.0)

34 B (chat)

34 B (base)

Last updated