Skip to content
llm-speed

RTX 5090 vs RTX 4090

A single shareable card for the RTX 5090 vs RTX 4090 matchup. Numbers are the best decode tok/s submitted on the llm-speed suite — every side links back to the run it came from.

RTX 5090 vs RTX 4090

Best decode tok/s across every model submitted on each rig.

NVIDIA Ada/Blackwell · 32 GB VRAM
356tok/s
decode (best submitted run)
source: r_q9f15lz6831
NVIDIA Ada/Blackwell · 24 GB VRAM
161tok/s
decode (best submitted run)
source: r_mv8n8k9wu1e
RTX 5090 ← faster +195 tok/s (2.21× faster)

Need the long-form table? Open the RTX 5090 vs RTX 4090 comparison for every overlapping (model × hardware) row, source runs, and methodology.