§01·spec · /gpus
RTX 5090
nvidia50 series32GB VRAM
§02·models that run on this GPU
12 total| Model | Vertical | Best speed | Min VRAM | Works | Benchmarks | |
|---|---|---|---|---|---|---|
| gpt-oss 20B | llm | 298.2tokens/s | ✓ | 3 | check ↗ | |
| Qwen3 30B-A3B | llm | 226.1tokens/s | ✓ | 2 | check ↗ | |
| Qwen3-8B | llm | 200.4tokens/s | ✓ | 3 | check ↗ | |
| Gemma 4 26B MoE | llm | 180.3tokens/s | ✓ | 2 | check ↗ | |
| Qwen3.5 35B | llm | 165.2tokens/s | ✓ | 2 | check ↗ | |
| Qwen3 30B | llm | 141.63tokens/s | ✓ | 1 | check ↗ | |
| Qwen3 14B | llm | 123.8tokens/s | ✓ | 3 | check ↗ | |
| Qwen3 32B | llm | 61.4tokens/s | ✓ | 3 | check ↗ | |
| Gemma4 31B | llm | 61.1tokens/s | ✓ | 2 | check ↗ | |
| Qwen3.5 27B | llm | 58.8tokens/s | ✓ | 2 | check ↗ | |
| Flux.1 Dev | image | 9.55s | 32GB | ✓ | 7 | check ↗ |
| Stable Diffusion XL | image | 8.8it/s | 32GB | ✓ | 2 | check ↗ |
§03·tested recipes
showing 6 of 17- imageintermediate21GB+recipe
Qwen-Image on RTX 5090: 20B Text-to-Image via ComfyUI FP8 (Blackwell Native Path)
- imagebeginner9GB+recipe
Flux.2 Klein 4B on RTX 5090: FP8 1.2-Second Generation, Blackwell-Native Speed Win
- imageintermediate18GB+recipe
LongCat-Image (base T2I) on RTX 5090: Bilingual 6B Text-to-Image via diffusers BF16 with 14 GB Headroom
- imageintermediate24GB+recipe
Chroma1-Base (V48) on RTX 5090: Uncensored 8.9B FLUX.1-Schnell De-Distillation via Diffusers BF16
- imageintermediate17GB+recipe
HiDream-O1-Image on RTX 5090: 2048×2048 Text-to-Image with MXFP8 Blackwell-Native Acceleration in ComfyUI
- llmintermediate29GB+recipe
Qwen3-32B on RTX 5090: Q6_K_XL GGUF via llama.cpp (with AWQ-INT4 + 128K context alternative)