Back to Quant Hub

InternLM2 7B Chat

7B

Shanghai AI Lab

Strong bilingual (EN/ZH) 7B from Shanghai AI Lab. Competitive with Qwen 7B.

⬇ 1.0K HF downloads♥ 0 likesbartowski/internlm2_5-7b-chat-GGUF· stats from 6/24/2026

Consumer GPUMac / Apple SiliconCPU / VPS

33K

Max Context

2

Quant Variants

GGUF Q4_K_M

Best Quality

96.9%

Accuracy Retained

Calculate VRAM Hugging Face Compare

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

Format	Level	BPW	VRAM	PPL Loss	Speed	Actions
GGUF	Q4_K_M	4.85	5.5 GB	3.1%	148 tok/s	Calc HF
AWQ	INT4	4	4.9 GB	4.3%	215 tok/s	Calc HF