Back to Quant Hub

WizardLM-2 7B

7B

Microsoft / WizardLM

Evol-Instruct fine-tuned Mistral-based 7B. Strong complex instruction handling.

Consumer GPUMac / Apple SiliconCPU / VPS

33K

Max Context

2

Quant Variants

GGUF Q4_K_M

Best Quality

96.9%

Accuracy Retained

Calculate VRAM Hugging Face Compare

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

Format	Level	BPW	VRAM	PPL Loss	Speed	Actions
GGUF	Q4_K_M	4.85	5.4 GB	3.1%	152 tok/s	Calc HF
AWQ	INT4	4	4.8 GB	4.3%	218 tok/s	Calc HF