Back to Quant Hub

Mistral Small 24B Instruct

24B

Mistral AI

Mistral's efficient 24B. Strong multilingual; fits on 24GB with Q4.

4.4K HF downloads53 likesbartowski/Mistral-Small-Instruct-2409-GGUF· stats from 6/24/2026
Consumer GPUPro GPU

33K

Max Context

3

Quant Variants

EXL2 4.65bpw

Best Quality

97.8%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.8514.2 GB2.9%62 tok/s
AWQINT4414.2 GB3.8%78 tok/s
EXL24.65bpw4.6513.5 GB2.2%88 tok/s