Back to Quant Hub

Gemma 2 27B Instruct

27B

Google Gemma 2

Largest open Gemma 2. Strong reasoning; needs 24GB+ VRAM at Q4.

7.1K HF downloads174 likesbartowski/gemma-2-27b-it-GGUF· stats from 6/24/2026
Consumer GPUPro GPU

8K

Max Context

3

Quant Variants

GGUF Q5_K_M

Best Quality

98.7%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.8518.5 GB2.9%48 tok/s
GGUFQ5_K_M5.6821.2 GB1.3%42 tok/s
AWQINT4416.2 GB4.0%58 tok/s