self-hosted/ai
§01·model · /models

gpt-oss 20B

llmactive
§02·GPUs that run this model
10 total
GPUVRAMSeriesBest speedMin VRAMWorksBenchmarks
RTX 4080 Super16GB406364prefill tokens/s16GB1check ↗
RTX 509032GB50298.2tokens/s3check ↗
RTX 508016GB50172.4tokens/s16GB2check ↗
RTX 3090 Ti24GB30160.3tokens/s24GB2check ↗
RTX 5070 Ti16GB50156tokens/s16GB2check ↗
RTX 309024GB30147.5tokens/s24GB2check ↗
RTX 408016GB40136.5tokens/s16GB2check ↗
RTX 4070 Ti Super16GB40128.9tokens/s16GB2check ↗
RTX 5060 Ti16GB5092.1tokens/s16GB2check ↗
RTX 4060 Ti 16GB16GB4063.2tokens/s16GB2check ↗