§01·compatibility · /check
gpt-oss 20B on RTX 4070
? untestedllmactive40 series12GB VRAM
§02·benchmarks
No verified benchmarks yet. Be the first to contribute one.
§03·related recipes
- llmintermediate12GB+
gpt-oss 20B on RTX 4070: MXFP4 Chat in 12 GB via llama.cpp Expert Offload
- llmintermediate12GB+
gpt-oss 20B on RTX 5070: MXFP4 Chat in 12 GB via llama.cpp Expert Offload
- llmbeginner16GB+
gpt-oss 20B on RTX 5070 Ti: MXFP4 Chat at 156 tok/s via Ollama or vLLM
- llmbeginner16GB+
gpt-oss 20B on RTX 4080 SUPER: MXFP4 chat at 139 tok/s via Ollama or vLLM
- llmbeginner16GB+
gpt-oss 20B on RTX 4070 Ti SUPER: MXFP4 chat at 129 tok/s via Ollama or vLLM