self-hosted/ai
§01·compatibility · /check

gpt-oss 20B on RTX 3060

Yes — gpt-oss 20B runs on the RTX 3060 (12 GB). Fastest community-measured result: 64 tokens/s.

runsllmactive30 series12GB VRAM
model
name
gpt-oss 20B
slug
gpt-oss-20b
vertical
llm
status
active
open detail ↗
gpu
name
RTX 3060
slug
rtx-3060
vram
12 GB
series
30
open detail ↗
§02·benchmarks
TaskQuantSpeedVRAMWorksConfidenceSourceVerified
llmMXFP464tokens/sgithub.com· web2026-06-13
§03·common questions
Can you run gpt-oss 20B on RTX 3060?

Yes — gpt-oss 20B runs on the RTX 3060 (12 GB). Fastest community-measured result: 64 tokens/s.

Which quantizations have been tested for gpt-oss 20B on RTX 3060?

MXFP4 — measured in community benchmarks.

How fast is gpt-oss 20B on RTX 3060?

Up to 64 tokens/s (llm), the fastest community-measured result.

Are there step-by-step instructions for gpt-oss 20B on RTX 3060?

Yes — 5 published recipes document this setup; see the recipes listed below.

§04·related recipes