User profile

bench-rat

12 runsFirst seen 2026-04-12Avg 67.6

Best PipelineScore

84.5MAINLINE

on DeepSeek Coder V2 236B

Best run beats 96% of all 428 submissionsBest rig 2x-rtx-4090 ranks #12 of 63 rigs

Total tokens63.6KAcross every task this user has run

Avg latency1455msPer task, across all submissions

Tasks run30012 submissions x ~34 tasks

Rigs used11Distinct hardware tags

Category signature

Average score per category across all 12 runs.

Code

68.8

Reason

67.9

Tool Use

68.7

RAG

66.9

Speed

63.3

Hardware mix

Rigs this user benchmarked on.

2x-rtx-40902 (17%)

m3-ultra-256gb1 (8%)

a100-80gb1 (8%)

rtx-3090-24gb1 (8%)

rtx-4070-12gb1 (8%)

m3-max-64gb1 (8%)

rtx-4080-16gb1 (8%)

macbook-pro-m3-pro-18gb1 (8%)

ryzen-7950x-cpu-only1 (8%)

m2-pro-16gb1 (8%)

rtx-3060-12gb1 (8%)

Provider mix

Where they spend their tokens.

meta3 (25%)

alibaba2 (17%)

deepseek1 (8%)

microsoft1 (8%)

zhipu1 (8%)

cognitivecomputations1 (8%)

google1 (8%)

yi1 (8%)

huggingface1 (8%)

Models tried

Best score per model. Click a model to see its full page.

#	Model	Provider	Best Score	Tier	Achieved
1	DeepSeek Coder V2 236B	deepseek	84.5	MAINLINE	2026-05-24
2	Llama 3.1 70B Instruct	meta	79.1	MAINLINE	2026-05-03
3	Qwen 2.5 VL 72B	alibaba	77.7	MAINLINE	2026-04-12
4	Qwen 3 32B Instruct	alibaba	77.0	MAINLINE	2026-04-18
5	WizardLM 2 8x22B	microsoft	74.5	FEEDER	2026-04-26
6	GLM 4 9B Chat	zhipu	70.0	FEEDER	2026-05-22
7	Code Llama 34B Instruct	meta	69.5	FEEDER	2026-05-20
8	Llama 3.1 8B Instruct	meta	66.7	FEEDER	2026-05-05
9	Dolphin 2.9 Llama 3 8B	cognitivecomputations	63.6	FEEDER	2026-05-21
10	CodeGemma 7B	google	60.9	FEEDER	2026-04-23
11	Yi 1.5 6B Chat	yi	58.1	TAP	2026-04-14
12	SmolLM 1.7B	huggingface	29.0	DRIP	2026-04-15

All submissions

Every run, ordered by score.

#	Model	Hardware	Score	Tier	Code	Reason	Tool Use	RAG	Speed	Tokens	Avg ms	Date
1	DeepSeek Coder V2 236B	2x-rtx-4090	84.5	MAINLINE	86.5	84.7	90.6	90.1	68.0	5.5K	3037	2026-05-24
2	Llama 3.1 70B Instruct	m3-ultra-256gb	79.1	MAINLINE	83.7	79.7	79.5	74.1	66.1	5.4K	2932	2026-05-03
3	Qwen 2.5 VL 72B	a100-80gb	77.7	MAINLINE	77.5	80.6	78.4	78.5	76.1	5.6K	889	2026-04-12
4	Qwen 3 32B Instruct	rtx-3090-24gb	77.0	MAINLINE	78.3	72.6	79.1	77.2	74.0	6.1K	1546	2026-04-18
5	WizardLM 2 8x22B	2x-rtx-4090	74.5	FEEDER	76.9	74.1	82.2	74.7	62.5	5.3K	3039	2026-04-26
6	GLM 4 9B Chat	rtx-4070-12gb	70.0	FEEDER	71.1	68.2	69.6	71.1	69.6	5.1K	854	2026-05-22
7	Code Llama 34B Instruct	m3-max-64gb	69.5	FEEDER	67.6	71.4	68.2	66.6	67.0	5.2K	1572	2026-05-20
8	Llama 3.1 8B Instruct	rtx-4080-16gb	66.7	FEEDER	69.7	68.0	69.5	64.1	59.6	5.1K	854	2026-05-05
9	Dolphin 2.9 Llama 3 8B	macbook-pro-m3-pro-18gb	63.6	FEEDER	60.2	66.5	61.2	65.2	66.9	4.8K	769	2026-05-21
10	CodeGemma 7B	ryzen-7950x-cpu-only	60.9	FEEDER	64.1	63.5	63.2	54.6	57.3	5.0K	796	2026-04-23
11	Yi 1.5 6B Chat	m2-pro-16gb	58.1	TAP	57.4	54.6	58.5	63.9	61.9	5.2K	826	2026-04-14
12	SmolLM 1.7B	rtx-3060-12gb	29.0	DRIP	32.3	30.9	24.5	22.7	30.5	5.2K	344	2026-04-15