self-hosted/ai
§01·model · /models

Llama 3.1 8B

llmactive
§02·GPUs that run this model
7 total
GPUVRAMSeriesBest speedMin VRAMWorksBenchmarks
RTX 409024GB4096.12tokens/s1check ↗
RTX 3060 Ti8GB3057.34tokens/s8GB1check ↗
RTX 5060 Ti16GB5055.5tokens/s16GB2check ↗
RTX 407012GB4052.1tokens/s1check ↗
RX 7900 XTX24GBamd51.3tokens/s24GB1check ↗
RTX 306012GB3042tokens/s1check ↗
RTX 4060 Ti 16GB16GB4034tokens/s16GB1check ↗