Drop a GPU, a model, and the numbers you measured. A source link — a forum post, a gist, a screenshot — helps us cross-check before the entry shows up in the dataset.

open dataCC BY-SA

Submit a benchmark

Will it run on your GPU?

Have a GPU

Want a model

Comparing options

Training a reusable character LoRA for Z-Image-Turbo

Qwen3.5-35B-A3B on RTX 5090: Blackwell MXFP4 MoE Chat at 165 tok/s

Qwen3.5 27B on RTX 5090: Q4_K GGUF local chat via llama.cpp

LTX-2.3 on RTX 4060 Ti 16GB: 22B Audio-Video at the 16 GB Floor via Distilled GGUF + Streamed Encoder

Llama 3.3 70B on RTX 4090: 70B-Class Chat on One 24 GB Card (Q4 Offload or Fully-On-GPU IQ2)

Qwen3-14B on RTX 4060 Ti 16GB: Q4_K_M GGUF via Ollama or llama.cpp

Ran a benchmark? Share the numbers.

Ran a benchmark?
Share the numbers.