§01·model · /models
Qwen3 14B
llmactive
§02·GPUs that run this model
16 total| GPU | VRAM | Series | Best speed | Min VRAM | Works | Benchmarks | |
|---|---|---|---|---|---|---|---|
| RTX 5090 | 32GB | 50 | 123.8tokens/s | ✓ | 3 | check ↗ | |
| RTX 5080 | 16GB | 50 | 80.6tokens/s | 16GB | ✓ | 2 | check ↗ |
| RTX 3090 Ti | 24GB | 30 | 76.2tokens/s | 24GB | ✓ | 2 | check ↗ |
| RTX 5070 Ti | 16GB | 50 | 74.3tokens/s | 16GB | ✓ | 2 | check ↗ |
| RTX 3090 | 24GB | 30 | 70tokens/s | 24GB | ✓ | 2 | check ↗ |
| RTX 3080 Ti | 12GB | 30 | 69.9tokens/s | ✓ | 2 | check ↗ | |
| RTX 4080 Super | 16GB | 40 | 64.2tokens/s | 16GB | ✓ | 2 | check ↗ |
| RTX 4080 | 16GB | 40 | 62tokens/s | 16GB | ✓ | 2 | check ↗ |
| Apple M4 Max | 0GB | apple | 60tokens/s | 9GB | ✓ | 1 | check ↗ |
| RTX 4070 Ti Super | 16GB | 40 | 58.1tokens/s | 16GB | ✓ | 2 | check ↗ |
| RTX 5070 | 12GB | 50 | 54.2tokens/s | 12GB | ✓ | 2 | check ↗ |
| RTX 4070 Ti | 12GB | 40 | 45.8tokens/s | 12GB | ✓ | 2 | check ↗ |
| RTX 4070 Super | 12GB | 40 | 45.5tokens/s | 12GB | ✓ | 2 | check ↗ |
| RTX 5060 Ti | 16GB | 50 | 41.1tokens/s | 16GB | ✓ | 2 | check ↗ |
| RTX 4060 Ti 16GB | 16GB | 40 | 27.4tokens/s | 16GB | ✓ | 2 | check ↗ |
| RTX 3060 | 12GB | 30 | 22.7tokens/s | ✓ | 1 | check ↗ |