User profile

gpu-shopper

12 runsFirst seen 2026-04-10Avg 64.3

Best PipelineScore

85.1MAINLINE

on Qwen 2.5 72B Instruct

Best run beats 98% of all 428 submissionsBest rig h200-141gb ranks #8 of 63 rigs

Total tokens67.4KAcross every task this user has run

Avg latency758msPer task, across all submissions

Tasks run30012 submissions x ~34 tasks

Rigs used10Distinct hardware tags

Category signature

Average score per category across all 12 runs.

Code

64.5

Reason

62.6

Tool Use

65.1

RAG

63.9

Speed

64.5

Hardware mix

Rigs this user benchmarked on.

macbook-pro-m3-pro-18gb3 (25%)

h200-141gb1 (8%)

b200-192gb1 (8%)

m3-max-64gb1 (8%)

m3-pro-36gb1 (8%)

ryzen-7950x-cpu-only1 (8%)

a100-40gb1 (8%)

m1-air-8gb1 (8%)

rtx-4070-12gb1 (8%)

rtx-3060-12gb1 (8%)

Provider mix

Where they spend their tokens.

alibaba2 (17%)

microsoft2 (17%)

cohere2 (17%)

cognitivecomputations1 (8%)

zyphra1 (8%)

upstage1 (8%)

google1 (8%)

community1 (8%)

bigcode1 (8%)

Models tried

Best score per model. Click a model to see its full page.

#	Model	Provider	Best Score	Tier	Achieved
1	Qwen 2.5 72B Instruct	alibaba	85.1	MAINLINE	2026-04-16
2	WizardLM 2 8x22B	microsoft	79.5	MAINLINE	2026-04-10
3	Aya 23 35B	cohere	74.1	FEEDER	2026-04-14
4	Dolphin 3.0 Llama 3.1 8B	cognitivecomputations	68.5	FEEDER	2026-04-22
5	Phi 4 14B	microsoft	67.9	FEEDER	2026-04-13
6	Zamba 2 7B Instruct	zyphra	66.7	FEEDER	2026-05-10
7	SOLAR 10.7B Instruct	upstage	66.6	FEEDER	2026-04-12
8	Command R	cohere	65.5	FEEDER	2026-04-28
9	Gemma 3 4B IT	google	58.8	TAP	2026-04-17
10	LLaVA 1.6 Mistral 7B	community	56.0	TAP	2026-04-12
11	StarCoder2 3B	bigcode	49.5	TAP	2026-05-02
12	Qwen 2.5 0.5B Instruct	alibaba	33.9	DRIP	2026-05-04

All submissions

Every run, ordered by score.

#	Model	Hardware	Score	Tier	Code	Reason	Tool Use	RAG	Speed	Tokens	Avg ms	Date
1	Qwen 2.5 72B Instruct	h200-141gb	85.1	MAINLINE	81.5	85.3	87.2	81.7	89.4	5.2K	906	2026-04-16
2	WizardLM 2 8x22B	b200-192gb	79.5	MAINLINE	77.8	78.4	84.3	77.3	80.3	5.4K	911	2026-04-10
3	Aya 23 35B	m3-max-64gb	74.1	FEEDER	73.9	72.9	71.2	73.6	75.3	5.5K	1547	2026-04-14
4	Dolphin 3.0 Llama 3.1 8B	macbook-pro-m3-pro-18gb	68.5	FEEDER	71.2	66.0	69.7	69.9	66.4	6.4K	838	2026-04-22
5	Phi 4 14B	m3-pro-36gb	67.9	FEEDER	66.0	64.8	70.6	69.3	65.1	5.2K	767	2026-04-13
6	Zamba 2 7B Instruct	macbook-pro-m3-pro-18gb	66.7	FEEDER	72.0	66.6	63.8	62.4	69.1	6.3K	788	2026-05-10
7	SOLAR 10.7B Instruct	ryzen-7950x-cpu-only	66.6	FEEDER	66.1	63.7	73.9	68.0	64.6	5.6K	807	2026-04-12
8	Command R	a100-40gb	65.5	FEEDER	61.0	67.4	66.1	65.3	65.9	5.7K	626	2026-04-28
9	Gemma 3 4B IT	m1-air-8gb	58.8	TAP	58.4	54.3	63.2	55.6	58.2	5.4K	348	2026-04-17
10	LLaVA 1.6 Mistral 7B	rtx-4070-12gb	56.0	TAP	55.2	55.8	55.3	57.1	61.5	5.1K	872	2026-04-12
11	StarCoder2 3B	rtx-3060-12gb	49.5	TAP	53.1	45.1	46.7	54.5	47.2	5.9K	361	2026-05-02
12	Qwen 2.5 0.5B Instruct	macbook-pro-m3-pro-18gb	33.9	DRIP	38.1	31.5	28.6	31.7	30.6	5.8K	330	2026-05-04