§01·compatibility · /check

Qwen3-8B on RTX 4090

? untestedllmactive40 series24GB VRAM

model

name: Qwen3-8B
slug: qwen3-8b
vertical: llm
status: active
repo: huggingface.co ↗

open detail ↗

gpu

name: RTX 4090
slug: rtx-4090
vram: 24 GB
series: 40

open detail ↗

§02·benchmarks

No verified benchmarks yet. Be the first to contribute one.

§03·related recipes

llmbeginner6GB+
Qwen3-8B on RTX 5090: Q4_K_M GGUF with 26 GB of Headroom for Colocation, BF16, or Full 131K Context
llmbeginner6GB+
Qwen3-8B on RTX 3090: Q4_K_M GGUF with 18 GB of Headroom for Colocation or Long Context
llmbeginner6GB+
Qwen3-8B on RTX 4090: Q4_K_M GGUF via Ollama or llama.cpp
llmbeginner16GB+
Qwen3-8B on RTX 4060 Ti 16GB: Q4_K_M GGUF via Ollama or llama.cpp
llmbeginner16GB+
Qwen3-8B on RTX 5060 Ti: Q4_K_M GGUF via Ollama or llama.cpp