Leaderboard

On-device LLM performance rankings powered by Glicko-2

iPhone 16

iOS

Rank

#17

Rating

1,931

±14 RD

Win Rate

91.6%

Conservative Rating

1,903

TG Rating

1,924

PP Rating

1,928

Matches

1,391

Record

1274W – 117L

Models Tested

ModelTG Median (tok/s)PP Median (tok/s)TG BestPP BestRuns
google_gemma-3n-E2B-it-Q8_0181.182897.66181.182897.661
gemma-3n-E2B-it-Q8_0166.353208.17166.353208.171
Qwen3-0.6B-Q4_K_M96.291112.7096.291112.701
chatgpt-5-q8_073.58177.5073.58177.501
LFM2.5-1.2B-Instruct-Q4_K_M65.31662.6965.31662.691
DeepSeek-R1-Distill-Llama-8B-Q4_K_M63.50528.4763.86623.273
Qwen3-0.6B-Q8_058.841083.5858.841083.581
gemma-3-1B-it-QAT-Q4_058.05841.0158.05841.011
LFM2.5-1.2B-Thinking-Q5_K_M52.36535.5252.93552.702
Qwen2-VL-2B-Instruct-Q4_K_L37.95420.4037.95420.401
DeepSeek-R1-Distill-Qwen-1.5B-Q4_K_M32.16240.0938.41438.732
DeepSeek-R1-Distill-Qwen-1.5B-Q5_K_M31.65395.4531.65395.451
qwen2.5-1.5b-thinking-q8_030.78470.9230.78470.921
qwen2.5-1.5b-instruct-q8_029.75473.1930.90514.213
qwen2.5-1.5b-instruct-q4_k_m29.2844.8529.2844.851
DeepSeek-R1-Distill-Qwen-1.5B-Q8_028.07417.9728.07417.971
DeepSeek-R1-Distill-Qwen-1.5B-uncensored.Q8_028.03464.9428.03464.941
llm-jp-3.1-1.8b-instruct4-Q8_027.45431.5727.45431.571
Qwen3.5-0.8B-Q8_025.52582.9125.52582.911
DeepSeek-R1-Distill-Qwen-1.5B-Q2_K25.4733.0625.4733.061
gemma-3n-E2B-it-Q4_K_M24.06223.5824.06223.581
DeepSeek-R1-Distill-Qwen-1.5B-Q2_K23.0736.0723.0736.071
Llama-3.2-3B-Instruct.Q4_K_M22.37214.1722.37214.171
Llama-3.2-3B-Instruct-Q4_K_L22.24195.7622.24195.761
Qwen3-1.7B.Q4_K_M22.1336.9422.1336.941
LFM2.5-1.2B-Instruct-BF1621.98673.0721.98673.071
llama-3.2-1b-instruct-q8_021.96264.3421.96264.341
gemma-2-2b-it-Q6_K20.95273.3921.55288.605
DeepSeek-R1-Distill-Qwen-1.5B-Q5_K_M20.0430.7020.0430.701
Llama-3.2-1B-Instruct-BF1620.04712.0520.04712.051
qwen2.5-3b-instruct-q5_k_m18.32185.1719.70188.462
Phi-3.5-mini-instruct.Q4_K_M18.32155.7218.32155.721
gemma-3-4b-it-IQ4_NL17.95188.3117.95188.311
gemma-3-4B-it-QAT-Q4_017.86176.6017.86176.601
Qwen3.5-2B-Q8_016.94340.3116.94340.311
Llama-3.2-3B-Instruct-Q6_K14.89169.0918.53214.424
Gemmasutra-Mini-2B-v1-Q6_K14.69137.6417.16253.942
Qwen_Qwen3-4B-Instruct-2507-Q5_K_L13.80150.4113.80150.411
DeepSeek-R1-ReDistill-Qwen-1.5B-v1.0-IQ3_XS13.6013.6814.0814.962
gemma-3n-E4B-it-Q4_K_M12.52112.8612.52112.861
Qwen3-4B-IQ4_NL12.17102.7912.17102.791
gemma-3-4b-it-Q5_K_M8.7313.958.7313.951
Qwen3.5-4B-Uncensored-HauhauCS-Aggressive-Q4_K_M8.73113.859.12115.972
Qwen3.5-4B-IQ4_NL8.58115.169.41122.502
DeepSeek-R1-Distill-Qwen-7B-IQ2_M8.4066.0711.2189.332
gemma-3-4b-it-q4_0_s8.1916.128.1916.121
gemma-3-4b-it-Q8_08.1617.218.1617.211
Qwen3-4B.Q6_K7.5011.207.5011.201
DeepSeek-R1-Distill-Qwen-7B-Q4_K_S7.1358.907.1358.901
Qwen3-1.7B.fp166.4519.676.4519.671

150 of 53 rows

1 / 2

Head-to-Head Record

Performance by App Version

ImprovedRegressed

Compare With