self-hosted/ai
§01·spec · /gpus

RTX 3060 Ti

nvidia30 series8GB VRAM
§02·models that run on this GPU
11 total
ModelVerticalBest speedMin VRAMWorksBenchmarks
llama2 7bllm73.07tokens/s8GB1check ↗
llava 7bmultimodal72tokens/s8GB1check ↗
wizardlm2 7bllm70.79tokens/s8GB1check ↗
qwen2 7bllm63.73tokens/s8GB1check ↗
Qwen2.5 7Bllm58.13tokens/s8GB1check ↗
Llama 3.1 8Bllm57.34tokens/s8GB1check ↗
gemma 7bllm31.95tokens/s8GB1check ↗
falcon2 11bllm31.2tokens/s8GB1check ↗
Gemma 2 9Bllm23.8tokens/s8GB1check ↗
stablelm2 12bllm18.73tokens/s8GB1check ↗
llama2 13bllm9.25tokens/s8GB1check ↗