xy
1 workload result across 1 hardware configuration.
Local runs (1 run)
Runs from contributors' own machines via MLX, llama.cpp, vLLM, exllamav2, or ollama. Signed on the submitter's hardware.
Pentest-Bench
| Workload | Backend | Quant | decode tok/s | prefill tok/s | TTFT | Run |
|---|---|---|---|---|---|---|
| chat-short | llama.cpp@b9999 | Q4_K_M | 42.00tok/s | 100.0tok/s | 50.0ms | r_2nqkbpdq-dk |