Skip to content
llm-speed

RTX 4090 vs M3 Max

Side-by-side decode tok/s, prefill tok/s, and TTFT for RTX 4090 and M3 Max, sourced from community-submitted runs of the llm-speed suite. Every number on this page links back to the run it came from.

hardware
RTX 4090
NVIDIA Ada/Blackwell · 24 GB VRAM
View RTX 4090 page →
hardware
M3 Max
Apple M3 · up to 128 GB unified memory
View M3 Max page →

No overlapping RTX 4090 ↔ M3 Max benchmarks yet.

We don't have a submitted run that covers both sides of this comparison yet. Run the suite on either side to populate this page:

$ pipx install llm-speed && llm-speed bench

See also: RTX 4090 benchmarks · M3 Max benchmarks · All hardware · All models · Methodology