Leaderboard
On-device LLM performance rankings powered by Glicko-2
iPad Pro 11 inch 4th Gen
iOSRank
#4
Rating
2,003
±18 RD
Win Rate
98.6%
Conservative Rating
1,967
TG Rating
2,003
PP Rating
2,002
Matches
792
Record
781W – 11L
Models Tested
| Model | TG Median (tok/s) | PP Median (tok/s) | TG Best | PP Best | Runs |
|---|---|---|---|---|---|
| Meta-Llama-3-8B-Instruct.Q4_K_M | 122.53 | 1162.42 | 122.53 | 1162.42 | 1 |
| Qwen1.5-0.5B-Chat-Q5_K_M | 81.39 | 214.68 | 81.39 | 214.68 | 1 |
| granite-3.1-3b-a800m-instruct-Q4_K_M | 67.70 | 27.26 | 67.70 | 27.26 | 1 |
| gemma-3-1b-it-Q8_0 | 66.34 | 1593.07 | 66.34 | 1593.07 | 1 |
| llama-3.2-1b-instruct-q8_0 | 53.40 | 679.55 | 60.73 | 1255.98 | 2 |
| qwen2.5-1.5b-instruct-q8_0 | 41.14 | 482.79 | 48.48 | 880.52 | 2 |
| tinyswallow-1.5b-instruct-q5_k_m | 39.09 | 60.29 | 39.09 | 60.29 | 1 |
| gemma-2-2b-it-Q6_K | 35.81 | 509.79 | 36.04 | 521.91 | 2 |
| tinyswallow-1.5b-instruct-q8_0 | 34.08 | 90.82 | 34.08 | 90.82 | 1 |
| gemma-3-4b-it-IQ4_NL | 34.05 | 401.94 | 34.05 | 401.94 | 1 |
| gemma-3-4b-it-Q4_K_M | 30.33 | 343.43 | 30.33 | 343.43 | 1 |
| translategemma-4b-it.i1-Q6_K | 24.96 | 303.12 | 24.96 | 303.12 | 1 |
| Phi-3.5-mini-instruct.Q4_K_M | 21.87 | 163.07 | 31.75 | 308.70 | 2 |
| DeepSeek-R1-Distill-Qwen-1.5B-IQ2_M | 20.25 | 29.90 | 20.25 | 29.90 | 1 |
| DeepSeek-R1-Distill-Qwen-7B-abliterated.Q4_K_S | 19.91 | 176.60 | 19.91 | 176.60 | 1 |
| sarashina2.2-3b-instruct-v0.1-Q8_0 | 17.03 | 41.24 | 17.03 | 41.24 | 1 |
| gemma-3-4b-it.Q8_0 | 13.37 | 33.22 | 13.48 | 33.29 | 2 |
| Qwen3-4B-Q8_0 | 11.25 | 25.74 | 11.25 | 25.74 | 1 |
| DeepSeek-R1-Distill-Qwen-7B-IQ2_M | 9.86 | 89.68 | 16.04 | 175.63 | 4 |
| gemma-2-2b-it.IQ1_M | 9.80 | 15.03 | 9.80 | 15.03 | 1 |
| Llama-3-ELYZA-JP-8B-IQ1_S | 4.70 | 6.47 | 4.70 | 6.47 | 1 |
| DeepSeek-R1-Distill-Llama-8B-Q2_K_L | 1.38 | 5.16 | 1.38 | 5.16 | 1 |
Head-to-Head Record
1–50 of 205 rows
1 / 5
Performance by App Version
ImprovedRegressed