← Back to user leaderboard
User profile
tensor-tomas
19 runsFirst seen 2026-04-26Avg 65.5
Total tokens104.5KAcross every task this user has run
Avg latency895msPer task, across all submissions
Tasks run47519 submissions x ~25 tasks
Rigs used16Distinct hardware tags
Category signature
Average score per category across all 19 runs.
Code
65.8
Reason
66.1
Write
65.8
Tool Use
64.5
RAG
66.5
Speed
64.3
Hardware mix
Rigs this user benchmarked on.
m3-pro-36gb2 (11%)
m2-pro-16gb2 (11%)
rtx-4080-16gb2 (11%)
h100-80gb1 (5%)
4x-rtx-30901 (5%)
dgx-h1001 (5%)
a100-80gb1 (5%)
rtx-4090-24gb1 (5%)
b200-192gb1 (5%)
rtx-4070-12gb1 (5%)
a100-40gb1 (5%)
ryzen-7950x-cpu-only1 (5%)
rtx-3080-10gb1 (5%)
macbook-pro-m3-pro-18gb1 (5%)
snapdragon-x-elite-32gb1 (5%)
m2-air-16gb1 (5%)
Provider mix
Where they spend their tokens.
meta3 (16%)
deepseek2 (11%)
cohere2 (11%)
community2 (11%)
yi2 (11%)
huggingface2 (11%)
databricks1 (5%)
zhipu1 (5%)
bigcode1 (5%)
alibaba1 (5%)
google1 (5%)
mistral1 (5%)
Models tried
Best score per model. Click a model to see its full page.
| # | Model | Best Score | Tier |
|---|---|---|---|
| 1 | Llama 3.3 70B Instruct | 83.3 | MAINLINE |
| 2 | DeepSeek V3 671B-A37B | 83.2 | MAINLINE |
| 3 | Command A | 80.3 | MAINLINE |
| 4 | DBRX Instruct 132B-MoE | 78.8 | MAINLINE |
| 5 | DeepSeek R1 Distill Qwen 32B | 78.5 | MAINLINE |
| 6 | GLM 4 Plus | 75.6 | MAINLINE |
| 7 | Llama 4 8B Instruct | 70.6 | FEEDER |
| 8 | Aya 23 35B | 69.7 | FEEDER |
| 9 | StarCoder2 15B | 69.4 | FEEDER |
| 10 | Qwen 2.5 VL 7B | 66.2 | FEEDER |
| 11 | LLaVA OneVision 7B | 66.2 | FEEDER |
| 12 | Yi Coder 9B | 65.9 | FEEDER |
| 13 | Yi 1.5 9B Chat | 65.7 | FEEDER |
| 14 | Llama 3.1 8B Instruct | 63.8 | FEEDER |
| 15 | CodeGemma 7B | 63.6 | FEEDER |
| 16 | Mistral 7B Instruct v0.3 | 62.7 | FEEDER |
| 17 | SmolLM 1.7B | 36.5 | DRIP |
| 18 | TinyLlama 1.1B Chat | 32.0 | DRIP |
All submissions
Every run, ordered by score.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 | Llama 3.3 70B Instruct | 83.3 | MAINLINE |
| 2 | DeepSeek V3 671B-A37B | 83.2 | MAINLINE |
| 3 | Command A | 80.3 | MAINLINE |
| 4 | DBRX Instruct 132B-MoE | 78.8 | MAINLINE |
| 5 | DeepSeek R1 Distill Qwen 32B | 78.5 | MAINLINE |
| 6 | GLM 4 Plus | 75.6 | MAINLINE |
| 7 | Llama 4 8B Instruct | 70.6 | FEEDER |
| 8 | Aya 23 35B | 69.7 | FEEDER |
| 9 | StarCoder2 15B | 69.4 | FEEDER |
| 10 | Qwen 2.5 VL 7B | 66.2 | FEEDER |
| 11 | LLaVA OneVision 7B | 66.2 | FEEDER |
| 12 | Yi Coder 9B | 65.9 | FEEDER |
| 13 | Yi 1.5 9B Chat | 65.7 | FEEDER |
| 14 | Llama 3.1 8B Instruct | 63.8 | FEEDER |
| 15 | CodeGemma 7B | 63.6 | FEEDER |
| 16 | Mistral 7B Instruct v0.3 | 62.7 | FEEDER |
| 17 | SmolLM 1.7B | 36.5 | DRIP |
| 18 | SmolLM 1.7B | 33.4 | DRIP |
| 19 | TinyLlama 1.1B Chat | 32.0 | DRIP |