Back to Quant Hub
DeepSeek-R1-Distill-Llama-70B
70BDeepSeek
R1 reasoning in Llama 70B architecture. Top open reasoning model for dual-GPU setups.
Pro GPU
131K
Max Context
2
Quant Variants
GGUF Q4_K_M
Best Quality
97.6%
Accuracy Retained
DeepSeek
R1 reasoning in Llama 70B architecture. Top open reasoning model for dual-GPU setups.
131K
Max Context
2
Quant Variants
GGUF Q4_K_M
Best Quality
97.6%
Accuracy Retained