self-hosted/ai
§01·spec · /gpus

RTX 4080

nvidia40 series16GB VRAM
§02·models that run on this GPU
3 total
ModelVerticalBest speedMin VRAMWorksBenchmarks
gpt-oss 20Bllm136.5tokens/s16GB2check ↗
Qwen3-8Bllm102.7tokens/s16GB2check ↗
Qwen3 14Bllm62tokens/s16GB2check ↗