Leaderboard

On-device LLM performance rankings powered by Glicko-2

iPhone 11

iOS

Rank

#100

Rating

1,677

±40 RD

Win Rate

67.3%

Conservative Rating

1,597

TG Rating

1,658

PP Rating

1,770

Matches

165

Record

111W – 54L

Models Tested

ModelTG Median (tok/s)PP Median (tok/s)TG BestPP BestRuns
SmolLM2-135M-Instruct-Q3_K_M99.50220.3999.50220.391
SmolLM2-135M-Instruct-Q4_K_M93.32214.3693.32214.361
SmolLM2-135M-Instruct-Q5_K_M84.77195.0584.77195.051
SmolLM2-135M-Instruct-Q2_K83.34207.7383.34207.731
SmolLM2-135M-Instruct-Q8_062.4691.4862.4691.481
SmolLM2-135M-Instruct-Q6_K56.8986.5756.8986.571
SmolLM2-135M-Instruct-F1649.18182.8449.18182.841
gemma-3-270m-it-F1640.33164.6948.30173.502
Qwen2-500M-Instruct-Q5_K_M28.57291.3128.57291.311
Qwen3-0.6B.Q4_K_M24.75118.4824.75118.481
tinyllama-1.1b-chat-v1.0.Q2_K17.0714.6317.0714.631
llama-3.2-1b-instruct-q8_010.6416.4213.2319.852
google_gemma-3-270m-it-bf169.7628.639.7628.631
HY-MT1.5-1.8B-Q4_K_M9.5051.739.5051.731
SmolLM2-1.7B-Instruct-Q8_08.9412.208.9412.201
qwen2.5-1.5b-instruct-q8_08.2829.3117.4588.755
agentica-org_DeepScaleR-1.5B-Preview-Q5_K_L6.9538.426.9538.421
gemmasutra-mini-2b-v1-iq4_nl-imat4.6711.294.6711.291
Qwen3-4B-presinq-Q3_K_S1.2216.491.2216.491

Head-to-Head Record

Performance by App Version

ImprovedRegressed

Compare With