Skip to content
llm-speed

llm-speed

LLM speed benchmarks: community-verified tok/s for every model + hardware combo

The benchmark suite for local + hosted LLM inference.Run it. See your tok/s. Compare your rig.

$ pipx install llm-speed && llm-speed bench

Runs in about a minute. Auto-detects your hardware and backends. Open source.

See a signed 287 tok/s run
58 signed runs6 hosts42 (model × hardware) cellsApache-2.0
Fastest local signed run192tok/smlx-community/stable-code-instruct-3b-4biton M3 Ultra

Latest LLM benchmarks

20 runs