Skip to content
llm-speed

RTX 5090 vs RTX 3090

Signed, community-submitted decode tok/s, prefill, and TTFT for RTX 5090 vs RTX 3090 — every number links to the run it came from.

Verdict

RTX 5090 and RTX 3090 both have submitted runs, but no single model has been measured on both yet — so there is no apples-to-apples row. Each side's measured decode tok/s is listed separately below (peaks of 356 and 189 tok/s). Submit a shared workload to turn this into a direct comparison.

hardware
RTX 5090
NVIDIA Ada/Blackwell · 32 GB VRAM
View RTX 5090 page →
hardware
RTX 3090
NVIDIA Ampere · 24 GB VRAM
View RTX 3090 page →

RTX 5090 benchmarks — no shared model with RTX 3090 yet

RTX 3090 benchmarks — no shared model with RTX 5090 yet

Modeldecode tok/sWorkloadRun
deepseek-coder-v2189.5tok/schat-shortr_o2-1w665rtq
qwen2.5-coder139.2tok/sconcurrent-decoder_pb0jpnbujji
llama3.1136.2tok/schat-shortr_9hurqggbshk

See also: RTX 5090 benchmarks · RTX 3090 benchmarks · All hardware · All models · Methodology