Skip to content
llm-speed

Llama-3.3-70B-Instruct vs Llama-4-Scout

Side-by-side decode tok/s, prefill tok/s, and TTFT for Llama-3.3-70B-Instruct and Llama-4-Scout, sourced from community-submitted runs of the llm-speed suite. Every number on this page links back to the run it came from.

model
Llama-3.3-70B-Instruct
Meta · 70B
View Llama-3.3-70B-Instruct page →
model
Llama-4-Scout
Meta · 109B-A17B
View Llama-4-Scout page →

No overlapping Llama-3.3-70B-Instruct ↔ Llama-4-Scout benchmarks yet.

We don't have a submitted run that covers both sides of this comparison yet. Run the suite on either side to populate this page:

$ pipx install llm-speed && llm-speed bench

See also: Llama-3.3-70B-Instruct benchmarks · Llama-4-Scout benchmarks · All hardware · All models · Methodology