Qwen3-Coder-30B-A3B-Instruct vs Codestral-22B-v0.1
Side-by-side decode tok/s, prefill tok/s, and TTFT for Qwen3-Coder-30B-A3B-Instruct and Codestral-22B-v0.1, sourced from community-submitted runs of the llm-speed suite. Every number on this page links back to the run it came from.
Verdict
On the M3 Ultra (60-core GPU), Qwen3-Coder-30B-A3B-Instruct decodes at 112 tok/s versus 47 tok/s for Codestral-22B-v0.1, 2.4× faster. That is the only hardware measured on both so far; the row below links to both source runs. Submit another to widen the comparison.
Hardware with data on both models
| Hardware | Qwen3-Coder-30B-A3B-Instruct decode | Codestral-22B-v0.1 decode | Δ | Source runs |
|---|---|---|---|---|
| M3 Ultra (60-core GPU) | 112.2tok/s | 47.49tok/s | +64.7 | r_fpsca03u2o_ · r_79dvtag5fd_ |
See also: Qwen3-Coder-30B-A3B-Instruct benchmarks · Codestral-22B-v0.1 benchmarks · All hardware · All models · Methodology