DeepSeek-V3 vs Llama-3.1-405B-Instruct
Side-by-side decode tok/s, prefill tok/s, and TTFT for DeepSeek-V3 and Llama-3.1-405B-Instruct, sourced from community-submitted runs of the llm-speed suite. Every number on this page links back to the run it came from.
No overlapping DeepSeek-V3 ↔ Llama-3.1-405B-Instruct benchmarks yet.
We don't have a submitted run that covers both sides of this comparison yet. Run the suite on either side to populate this page:
$ pipx install llm-speed && llm-speed bench
See also: DeepSeek-V3 benchmarks · Llama-3.1-405B-Instruct benchmarks · All hardware · All models · Methodology