← Back to user leaderboard
User profile
gpu-shopper
12 runsFirst seen 2026-04-10Avg 64.3
Total tokens67.4KAcross every task this user has run
Avg latency758msPer task, across all submissions
Tasks run30012 submissions x ~25 tasks
Rigs used10Distinct hardware tags
Category signature
Average score per category across all 12 runs.
Code
64.5
Reason
62.6
Write
65.8
Tool Use
65.1
RAG
63.9
Speed
64.5
Hardware mix
Rigs this user benchmarked on.
macbook-pro-m3-pro-18gb3 (25%)
h200-141gb1 (8%)
b200-192gb1 (8%)
m3-max-64gb1 (8%)
m3-pro-36gb1 (8%)
ryzen-7950x-cpu-only1 (8%)
a100-40gb1 (8%)
m1-air-8gb1 (8%)
rtx-4070-12gb1 (8%)
rtx-3060-12gb1 (8%)
Provider mix
Where they spend their tokens.
alibaba2 (17%)
microsoft2 (17%)
cohere2 (17%)
cognitivecomputations1 (8%)
zyphra1 (8%)
upstage1 (8%)
google1 (8%)
community1 (8%)
bigcode1 (8%)
Models tried
Best score per model. Click a model to see its full page.
| # | Model | Best Score | Tier |
|---|---|---|---|
| 1 | Qwen 2.5 72B Instruct | 85.1 | MAINLINE |
| 2 | WizardLM 2 8x22B | 79.5 | MAINLINE |
| 3 | Aya 23 35B | 74.1 | FEEDER |
| 4 | Dolphin 3.0 Llama 3.1 8B | 68.5 | FEEDER |
| 5 | Phi 4 14B | 67.9 | FEEDER |
| 6 | Zamba 2 7B Instruct | 66.7 | FEEDER |
| 7 | SOLAR 10.7B Instruct | 66.6 | FEEDER |
| 8 | Command R | 65.5 | FEEDER |
| 9 | Gemma 3 4B IT | 58.8 | TAP |
| 10 | LLaVA 1.6 Mistral 7B | 56.0 | TAP |
| 11 | StarCoder2 3B | 49.5 | TAP |
| 12 | Qwen 2.5 0.5B Instruct | 33.9 | DRIP |
All submissions
Every run, ordered by score.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 | Qwen 2.5 72B Instruct | 85.1 | MAINLINE |
| 2 | WizardLM 2 8x22B | 79.5 | MAINLINE |
| 3 | Aya 23 35B | 74.1 | FEEDER |
| 4 | Dolphin 3.0 Llama 3.1 8B | 68.5 | FEEDER |
| 5 | Phi 4 14B | 67.9 | FEEDER |
| 6 | Zamba 2 7B Instruct | 66.7 | FEEDER |
| 7 | SOLAR 10.7B Instruct | 66.6 | FEEDER |
| 8 | Command R | 65.5 | FEEDER |
| 9 | Gemma 3 4B IT | 58.8 | TAP |
| 10 | LLaVA 1.6 Mistral 7B | 56.0 | TAP |
| 11 | StarCoder2 3B | 49.5 | TAP |
| 12 | Qwen 2.5 0.5B Instruct | 33.9 | DRIP |