Skip to content
llm-speed
Leaderboard/models/qwen3-6-27b-q4-k-m

Qwen3.6-27B-Q4_K_M.gguf

1 workload result across 1 hardware configuration.

Fastest known config

66.3 decode tok/s

on RTX 5090 (32GB) + AMD Ryzen 7 9850X3D 8-Core Processor (8c) + 30GB via llama.cpp see full run

Qwen3.6-27B-Q4_K_M.gguf on hardware