← Back to user leaderboard
User profile
inference-monk
15 runsFirst seen 2026-04-14Avg 68.2
Total tokens78.8KAcross every task this user has run
Avg latency981msPer task, across all submissions
Tasks run37515 submissions x ~25 tasks
Rigs used11Distinct hardware tags
Category signature
Average score per category across all 15 runs.
Code
69.3
Reason
67.7
Write
68.1
Tool Use
66.6
RAG
69.7
Speed
67.4
Hardware mix
Rigs this user benchmarked on.
rtx-4070-12gb4 (27%)
m3-pro-36gb2 (13%)
cloud-api1 (7%)
m3-max-64gb1 (7%)
h200-141gb1 (7%)
m3-ultra-256gb1 (7%)
rtx-4080-16gb-offload1 (7%)
rtx-4080-16gb1 (7%)
rtx-3060-12gb1 (7%)
m2-air-16gb1 (7%)
macbook-pro-m3-pro-18gb1 (7%)
Provider mix
Where they spend their tokens.
deepseek2 (13%)
alibaba2 (13%)
nous2 (13%)
mistral2 (13%)
google2 (13%)
internlm1 (7%)
cognitivecomputations1 (7%)
community1 (7%)
cohere1 (7%)
bigcode1 (7%)
Models tried
Best score per model. Click a model to see its full page.
| # | Model | Best Score | Tier |
|---|---|---|---|
| 1 | DeepSeek Coder V2 236B | 84.7 | MAINLINE |
| 2 | DeepSeek R1 Distill Qwen 32B | 84.0 | MAINLINE |
| 3 | Qwen 3 235B-A22B MoE | 83.0 | MAINLINE |
| 4 | Hermes 3 Llama 3.1 70B | 77.2 | MAINLINE |
| 5 | Mistral Small 24B Instruct | 73.2 | FEEDER |
| 6 | Gemma 3 12B IT | 73.2 | FEEDER |
| 7 | Mistral Nemo 12B Instruct | 71.0 | FEEDER |
| 8 | InternLM 2.5 7B Chat | 66.6 | FEEDER |
| 9 | Dolphin 3.0 Llama 3.1 8B | 65.2 | FEEDER |
| 10 | Hermes 3 Llama 3.1 8B | 64.1 | FEEDER |
| 11 | LLaVA OneVision 7B | 63.7 | FEEDER |
| 12 | Aya 23 8B | 61.3 | FEEDER |
| 13 | StarCoder2 7B | 56.7 | TAP |
| 14 | Qwen 2.5 Coder 1.5B | 55.0 | TAP |
| 15 | Gemma 2 2B IT | 44.0 | TAP |
All submissions
Every run, ordered by score.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 | DeepSeek Coder V2 236B | 84.7 | MAINLINE |
| 2 | DeepSeek R1 Distill Qwen 32B | 84.0 | MAINLINE |
| 3 | Qwen 3 235B-A22B MoE | 83.0 | MAINLINE |
| 4 | Hermes 3 Llama 3.1 70B | 77.2 | MAINLINE |
| 5 | Mistral Small 24B Instruct | 73.2 | FEEDER |
| 6 | Gemma 3 12B IT | 73.2 | FEEDER |
| 7 | Mistral Nemo 12B Instruct | 71.0 | FEEDER |
| 8 | InternLM 2.5 7B Chat | 66.6 | FEEDER |
| 9 | Dolphin 3.0 Llama 3.1 8B | 65.2 | FEEDER |
| 10 | Hermes 3 Llama 3.1 8B | 64.1 | FEEDER |
| 11 | LLaVA OneVision 7B | 63.7 | FEEDER |
| 12 | Aya 23 8B | 61.3 | FEEDER |
| 13 | StarCoder2 7B | 56.7 | TAP |
| 14 | Qwen 2.5 Coder 1.5B | 55.0 | TAP |
| 15 | Gemma 2 2B IT | 44.0 | TAP |