Open Source · Edge Deployment · Geek-First
Quantize
Everything.
Bridge the gap between research papers and real-world deployment. Run state-of-the-art LLMs on consumer hardware.
GGUF
AWQ
EXL2
GPTQ
HQQ
& more10Models Indexed
5Formats Tracked
33GPUs in Database
99.2%Avg Accuracy Retained
Today's Quant Feed
Latest community quantization releases
- NEWQwen2.5-72B-InstructGGUFQ4_K_M · 43.6 GB · bartowskiRTX 4090 ×2·2h ago
- HOTDeepSeek-R1-Distill-Qwen-14BEXL24.65bpw · 9.8 GB · turboderpRTX 4090·5h ago
- NEWLlama-3.3-70B-InstructGGUFQ5_K_M · 50.1 GB · unslothA100 80G·8h ago
- UPDMistral-Small-24B-InstructAWQINT4 · 14.2 GB · city96RTX 3090·12h ago
- HOTQwen2.5-Coder-32B-InstructGGUFQ4_K_M · 22.0 GB · bartowskiRTX 4090·18h ago
Format Heat Index
Community adoption · this week
- 1GGUF89%+3%
- 2AWQ45%+7%
- 3EXL232%0%
- 4GPTQ28%-2%
- 5HQQ18%+12%
vs last week
Format Intelligence Radar
Compare quantization formats across 6 key dimensions