self-hosted/ai
§01·spec · /gpus

RTX 3090

nvidia30 series24GB VRAM
§02·models that run on this GPU
17 total
ModelVerticalBest speedMin VRAMWorksBenchmarks
Qwen3 30B-A3Bllm153.6tokens/s24GB2check ↗
gpt-oss 20Bllm147.5tokens/s24GB2check ↗
Gemma 4 26B MoEllm119.4tokens/s24GB2check ↗
Qwen3-8Bllm115.3tokens/s24GB2check ↗
Qwen3.5 35Bllm111.2tokens/s24GB2check ↗
Llama 3.1 7Bllm95tok/s24GB1check ↗
Qwen3 14Bllm70tokens/s24GB2check ↗
Llama 3.1 13Bllm55tok/s24GB1check ↗
Flux.1 Devimage40s24GB3check ↗
Qwen3 32Bllm35.1tokens/s24GB2check ↗
Gemma4 31Bllm34.7tokens/s24GB2check ↗
Qwen3.5 27Bllm33.5tokens/s24GB2check ↗
Llama 3.1 34Bllm28tok/s24GB1check ↗
Llama 3.1 70Bllm10tok/s24GB1check ↗
Wan 2.2 14Bvideo724GB1check ↗
Stable Diffusion XLimage5.6s24GB2check ↗
Kokoro TTStts96GB1check ↗
§03·tested recipes
showing 6 of 20