Back to Quant Hub

InternLM2 7B Chat

7B

Shanghai AI Lab

Strong bilingual (EN/ZH) 7B from Shanghai AI Lab. Competitive with Qwen 7B.

1.0K HF downloads0 likesbartowski/internlm2_5-7b-chat-GGUF· stats from 6/24/2026
Consumer GPUMac / Apple SiliconCPU / VPS

33K

Max Context

2

Quant Variants

GGUF Q4_K_M

Best Quality

96.9%

Accuracy Retained

Quantization Variants

Per-quant VRAM, quality loss, and inference speed on RTX 4090

FormatLevelBPWVRAMPPL LossSpeedActions
GGUFQ4_K_M4.855.5 GB3.1%148 tok/s
AWQINT444.9 GB4.3%215 tok/s