RTX 5090 vs RTX 3090
Signed, community-submitted decode tok/s, prefill, and TTFT for RTX 5090 vs RTX 3090 — every number links to the run it came from.
Verdict
RTX 5090 and RTX 3090 both have submitted runs, but no single model has been measured on both yet — so there is no apples-to-apples row. Each side's measured decode tok/s is listed separately below (peaks of 356 and 189 tok/s). Submit a shared workload to turn this into a direct comparison.
RTX 5090 benchmarks — no shared model with RTX 3090 yet
RTX 3090 benchmarks — no shared model with RTX 5090 yet
| Model | decode tok/s | Workload | Run |
|---|---|---|---|
| deepseek-coder-v2 | 189.5tok/s | chat-short | r_o2-1w665rtq |
| qwen2.5-coder | 139.2tok/s | concurrent-decode | r_pb0jpnbujji |
| llama3.1 | 136.2tok/s | chat-short | r_9hurqggbshk |
See also: RTX 5090 benchmarks · RTX 3090 benchmarks · All hardware · All models · Methodology