§01·tested · /gpus/rtx-5060-ti
Recipes tested on RTX 5060 Ti
20 community-tested setups — recipes and guides whose author ran them on this exact card.
nvidia50 series16GB VRAM
- llmbeginner16GB+recipe
Qwen3-8B on RTX 5060 Ti: Q4_K_M GGUF via Ollama or llama.cpp
- multimodalbeginner6GB+recipe
Gemma 4 E4B on RTX 5060 Ti: Multimodal Inference with transformers or llama.cpp
- ttsintermediate5GB+recipe
OpenAudio S1 Mini on RTX 5060 Ti: 13-Language Distilled TTS in ~5 GB VRAM
- ttsintermediate4GB+recipe
OmniVoice on RTX 5060 Ti: Zero-Shot Voice Cloning Across 646 Languages
- imageintermediate12GB+recipe
SenseNova U1 (8B-MoT) on RTX 5060 Ti: VAE-Free Unified Image Gen + Understanding via Q4 GGUF
- imageintermediate16GB+recipe
LongCat-Image (base T2I) on RTX 5060 Ti: Bilingual 6B Text-to-Image at 16 GB via ComfyUI GGUF
- ttsintermediate8GB+recipe
Foundation-1 on RTX 5060 Ti: Structured Music Sample Generation
- ttsintermediate12GB+recipe
ACE-Step 1.5 XL on RTX 5060 Ti: Text-to-Music Generation in ComfyUI
- ttsintermediate8GB+recipe
Qwen3-TTS 1.7B-Base on RTX 5060 Ti: Multilingual Voice Cloning in 10 Languages
- imageintermediate16GB+recipe
Chroma V48 on RTX 5060 Ti: Uncensored 8.9B Flux.1-Schnell De-Distillation via GGUF in ComfyUI
- ttsintermediate12GB+recipe
MOSS-Audio 4B-Instruct on RTX 5060 Ti: local audio understanding in ~12 GB
- videointermediate8GB+recipe
Wan 2.2 TI2V-5B on RTX 5060 Ti: 720p Text/Image-to-Video in ComfyUI
- imageintermediate13GB+recipe
Qwen-Image on RTX 5060 Ti: 20B Text-to-Image via GGUF Quantization
- ttsbeginner2GB+recipe
Kokoro TTS on RTX 5060 Ti: 82M-Parameter Text-to-Speech, 47 Voices, Under 3 GB VRAM
- ttsbeginner8GB+recipe
VoxCPM2 on RTX 5060 Ti: 30-Language 48kHz Voice Cloning in ~8 GB VRAM
- videointermediate8GB+recipe
LightX2V on RTX 5060 Ti: 4-Step Text-to-Video with Distilled Wan2.1-14B
- videoadvanced16GB+recipe
Sulphur 2 on RTX 5060 Ti: Uncensored LTX-2.3 Video via GGUF in ComfyUI
- imageintermediate16GB+recipe
Juggernaut Z on RTX 5060 Ti: Cinematic Photoreal Fine-Tune of Z-Image Base
- ttsintermediate10GB+recipe
Voxtral Mini 3B on RTX 5060 Ti: local speech understanding in ~9.5 GB
- imageintermediate10GB+recipe
HiDream-O1-Image on RTX 5060 Ti: 2048×2048 Text-to-Image with FP8 in ComfyUI