self-hosted/ai
§01·spec · /gpus

Apple M4 Max

appleapple series0GB VRAM
§02·models that run on this GPU
5 total
ModelVerticalBest speedMin VRAMWorksBenchmarks
Llama 3.2 8Bllm95tokens/s5GB1check ↗
Qwen3 14Bllm60tokens/s9GB1check ↗
Qwen3 32Bllm35tokens/s20GB1check ↗
Llama 3.3 70Bllm16.5tokens/s40GB1check ↗
DeepSeek V3llm6.5tokens/s90GB1check ↗