self-hosted
/ai
GPUs
Models
Compare
Tools
Search
⌘K
§01
·
compatibility · /check
Qwen3 32B
on
RTX 4090
? untested
llm
active
40 series
24GB VRAM
model
name
Qwen3 32B
slug
qwen3-32b
vertical
llm
status
active
repo
huggingface.co ↗
open detail ↗
gpu
name
RTX 4090
slug
rtx-4090
vram
24 GB
series
40
open detail ↗
§02
·
benchmarks
No verified benchmarks yet. Be the first to contribute one.
§03
·
related recipes
llm
intermediate
29GB+
Qwen3-32B on RTX 5090: Q6_K_XL GGUF via llama.cpp (with AWQ-INT4 + 128K context alternative)
llm
intermediate
22GB+
Qwen3-32B on RTX 3090: UD-Q4_K_XL GGUF via llama.cpp
llm
intermediate
22GB+
Qwen3-32B on RTX 4090: UD-Q4_K_XL GGUF via llama.cpp