← All benchmarks

Qwen 3 8B — Apple Silicon Benchmarks

Measured inference speed for Qwen 3 8B across 1 Apple Silicon chip. Tokens per second at multiple quantization levels. Real runs, not estimates.

Quantizations measured: Q4_K_M

1 Benchmark rows
1 Chip tiers covered
63.1 Fastest avg tok/s (M4 Max (128 GB))
Minimum RAM observed

Benchmark results for Qwen 3 8B

Rows sorted by avg tok/s descending. Click source badge to see original measurement page.

Chip Quant RAM req. Context Avg tok/s Prompt tok/s Runtime Source
M4 Max (128 GB) Q4_K_M 10k 63.1 tok/s LM Studio ref

benchmarks.json — full dataset  ·  models.json — model summaries  ·  benchmarks.csv — CSV export

See all models →