self-hosted/ai
§01·spec · /gpus

RTX 4070 Super

nvidia40 series12GB VRAM
§02·models that run on this GPU
6 total
ModelVerticalBest speedMin VRAMWorksBenchmarks
Qwen3-8Bllm75.4tokens/s12GB2check ↗
Llama 3.1 7Bllm75tok/s12GB1check ↗
Qwen3 14Bllm45.5tokens/s12GB2check ↗
Llama 3.1 13Bllm40tok/s12GB1check ↗
CogVideoX 1.5video15min12GB1check ↗
Llama 3.1 34Bllm12GB1check ↗