Leaderboard
On-device LLM performance rankings powered by Glicko-2
iPhone 17 Air
iOSRank
#11
Rating
1,972
±21 RD
Win Rate
95.6%
Conservative Rating
1,929
TG Rating
1,972
PP Rating
1,961
Matches
590
Record
564W – 26L
Models Tested
| Model | TG Median (tok/s) | PP Median (tok/s) | TG Best | PP Best | Runs |
|---|---|---|---|---|---|
| Qwen3-0.6B-Q4_K_M | 115.26 | 1541.37 | 115.26 | 1541.37 | 1 |
| Qwen3-0.6B-Q8_0 | 82.82 | 1551.99 | 82.82 | 1551.99 | 1 |
| qwen2.5-1.5b-instruct-q8_0 | 36.36 | 598.48 | 36.40 | 599.57 | 3 |
| gemma-3n-E2B-it-Q4_K_M | 31.29 | 234.60 | 31.29 | 234.60 | 1 |
| gemma-2-2b-it-Q6_K | 26.26 | 349.87 | 26.26 | 349.87 | 1 |
| qwen2.5-3b-instruct-q5_k_m | 24.77 | 255.12 | 25.05 | 258.20 | 4 |
| moondream2-text-model-f16_ct-vicuna | 22.28 | 645.32 | 22.28 | 645.32 | 1 |
| LFM2.5-VL-1.6B-BF16 | 22.19 | 732.88 | 22.19 | 732.88 | 1 |
| Qwen3-4B-Instruct-2507-Q4_K_M | 21.69 | 202.42 | 21.69 | 202.42 | 1 |
| Qwen3VL-4B-Instruct-Q4_K_M | 21.51 | 201.56 | 21.51 | 201.56 | 1 |
| Qwen3.5-4B-Q4_K_M | 12.07 | 159.98 | 12.07 | 159.98 | 1 |
| MechaEpstein-8000.Q4_K_M | 10.34 | 102.50 | 10.34 | 102.50 | 1 |
| Qwen3.5-4B-Q6_K | 9.23 | 146.23 | 9.23 | 146.23 | 1 |
| Qwen2.5-7B-Instruct-Q5_K_M | 8.90 | 93.18 | 8.90 | 93.18 | 1 |
| Qwen3-4B-Instruct-2507-IQ4_XS | 8.73 | 12.94 | 8.73 | 12.94 | 1 |
| Qwen3.5-9B-UD-IQ2_M | 8.67 | 93.88 | 8.67 | 93.88 | 1 |
| Qwen3.5-4B-Q6_K | 8.35 | 130.93 | 8.35 | 130.93 | 1 |
| Meta-Llama-3.1-8B-Instruct-Q5_K_M | 7.63 | 80.35 | 7.63 | 80.35 | 1 |
Head-to-Head Record
1–50 of 144 rows
1 / 3
Performance by App Version
ImprovedRegressed