§01·model · /models

Ornith 1.0 9B

llmactiveMIT

The small-rig member of DeepReinforce's open (MIT) Ornith 1.0 agentic-coding family — a ~9B dense model (Qwen3.5 + Gemma 4 lineage) with 262K context, <think> reasoning, and tool-calling. SWE-bench Verified 69.4. Runs locally via llama.cpp/Ollama from GGUF; Q4_K_M is ~5.63 GB (fits an 8 GB card) up to Q8_0 ~9.53 GB. The companion to Ornith 1.0 35B for cards below the 35B's 24 GB floor.

Download· 5 variants

huggingface.co ↗deep-reinforce.com ↗

§02·GPUs that run this model

19 total

GPU	VRAM	Series	Works	Recipe
Apple M2 Pro	16GB	apple	~	recipe	check ↗
RTX 3060	12GB	30	~	recipe	check ↗
RTX 3060 Ti	8GB	30	~	recipe	check ↗
RTX 3080 Ti	12GB	30	~	recipe	check ↗
RTX 4060	8GB	40	~	recipe	check ↗
RTX 4060 Ti 16GB	16GB	40	~	recipe	check ↗
RTX 4060 Ti 8GB	8GB	40	~	recipe	check ↗
RTX 4070	12GB	40	~	recipe	check ↗
RTX 4070 Super	12GB	40	~	recipe	check ↗
RTX 4070 Ti	12GB	40	~	recipe	check ↗
RTX 4070 Ti Super	16GB	40	~	recipe	check ↗
RTX 4080	16GB	40	~	recipe	check ↗
RTX 4080 Super	16GB	40	~	recipe	check ↗
RTX 5060	8GB	50	~	recipe	check ↗
RTX 5060 Ti	16GB	50	~	recipe	check ↗
RTX 5070	12GB	50	~	recipe	check ↗
RTX 5070 Ti	16GB	50	~	recipe	check ↗
RTX 5080	16GB	50	~	recipe	check ↗
RX 7800 XT	16GB	amd	~	recipe	check ↗

✓ benchmarked·~ runs via recipe (not benchmarked)·— untested·✕doesn't fit