← Back to user leaderboard
User profile
mlx-mike
13 runsFirst seen 2026-04-13Avg 70.9
Total tokens70.9KAcross every task this user has run
Avg latency1029msPer task, across all submissions
Tasks run32513 submissions x ~25 tasks
Rigs used11Distinct hardware tags
Category signature
Average score per category across all 13 runs.
Code
71.2
Reason
71.1
Write
70.6
Tool Use
71.6
RAG
70.9
Speed
69.2
Hardware mix
Rigs this user benchmarked on.
m3-pro-36gb2 (15%)
rtx-4070-12gb2 (15%)
h200-141gb1 (8%)
b200-192gb1 (8%)
m3-max-128gb1 (8%)
rtx-3060-12gb1 (8%)
a100-40gb1 (8%)
m3-max-64gb1 (8%)
ryzen-7950x-rtx-30901 (8%)
rtx-4090-24gb1 (8%)
ryzen-7950x-cpu-only1 (8%)
Provider mix
Where they spend their tokens.
alibaba3 (23%)
microsoft2 (15%)
mistral2 (15%)
yi2 (15%)
nous1 (8%)
google1 (8%)
upstage1 (8%)
meta1 (8%)
Models tried
Best score per model. Click a model to see its full page.
| # | Model | Best Score | Tier |
|---|---|---|---|
| 1 | WizardLM 2 8x22B | 77.8 | MAINLINE |
| 2 | Mixtral 8x22B Instruct | 77.4 | MAINLINE |
| 3 | Codestral 22B | 76.9 | MAINLINE |
| 4 | Qwen 3 14B Instruct | 74.8 | FEEDER |
| 5 | Yi 1.5 34B Chat | 72.7 | FEEDER |
| 6 | Hermes 3 Llama 3.1 8B | 69.7 | FEEDER |
| 7 | Qwen 3 8B Instruct | 69.5 | FEEDER |
| 8 | Gemma 2 27B IT | 69.0 | FEEDER |
| 9 | SOLAR 10.7B Instruct | 66.7 | FEEDER |
| 10 | Qwen 2.5 7B Instruct | 66.3 | FEEDER |
| 11 | Code Llama 34B Instruct | 64.6 | FEEDER |
| 12 | Phi 3 Small 7B | 63.9 | FEEDER |
All submissions
Every run, ordered by score.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 | WizardLM 2 8x22B | 77.8 | MAINLINE |
| 2 | Mixtral 8x22B Instruct | 77.4 | MAINLINE |
| 3 | Codestral 22B | 76.9 | MAINLINE |
| 4 | Qwen 3 14B Instruct | 74.8 | FEEDER |
| 5 | Yi 1.5 34B Chat | 72.7 | FEEDER |
| 6 | Yi 1.5 34B Chat | 72.2 | FEEDER |
| 7 | Hermes 3 Llama 3.1 8B | 69.7 | FEEDER |
| 8 | Qwen 3 8B Instruct | 69.5 | FEEDER |
| 9 | Gemma 2 27B IT | 69.0 | FEEDER |
| 10 | SOLAR 10.7B Instruct | 66.7 | FEEDER |
| 11 | Qwen 2.5 7B Instruct | 66.3 | FEEDER |
| 12 | Code Llama 34B Instruct | 64.6 | FEEDER |
| 13 | Phi 3 Small 7B | 63.9 | FEEDER |