self-hosted/ai
§01·model · /models

Qwen3 14B

llmactive
§02·GPUs that run this model
16 total
GPUVRAMSeriesBest speedMin VRAMWorksBenchmarks
RTX 509032GB50123.8tokens/s3check ↗
RTX 508016GB5080.6tokens/s16GB2check ↗
RTX 3090 Ti24GB3076.2tokens/s24GB2check ↗
RTX 5070 Ti16GB5074.3tokens/s16GB2check ↗
RTX 309024GB3070tokens/s24GB2check ↗
RTX 3080 Ti12GB3069.9tokens/s2check ↗
RTX 4080 Super16GB4064.2tokens/s16GB2check ↗
RTX 408016GB4062tokens/s16GB2check ↗
Apple M4 Max0GBapple60tokens/s9GB1check ↗
RTX 4070 Ti Super16GB4058.1tokens/s16GB2check ↗
RTX 507012GB5054.2tokens/s12GB2check ↗
RTX 4070 Ti12GB4045.8tokens/s12GB2check ↗
RTX 4070 Super12GB4045.5tokens/s12GB2check ↗
RTX 5060 Ti16GB5041.1tokens/s16GB2check ↗
RTX 4060 Ti 16GB16GB4027.4tokens/s16GB2check ↗
RTX 306012GB3022.7tokens/s1check ↗