User profile

apple-silicon

9 runsFirst seen 2026-04-10Avg 67.5

Best PipelineScore

82.3MAINLINE

Best run beats 92% of all 428 submissionsBest rig m3-ultra-256gb ranks #23 of 63 rigs

Total tokens47.5KAcross every task this user has run

Avg latency1172msPer task, across all submissions

Tasks run2259 submissions x ~34 tasks

Rigs used9Distinct hardware tags

Category signature

Average score per category across all 9 runs.

Code

67.5

Reason

70.7

Tool Use

67.7

RAG

67.2

Speed

65.4

Rigs this user benchmarked on.

m3-ultra-256gb1 (11%)

b200-192gb1 (11%)

2x-rtx-40901 (11%)

m2-pro-16gb1 (11%)

rtx-3060-12gb1 (11%)

rtx-4080-16gb1 (11%)

m3-pro-36gb1 (11%)

macbook-pro-m3-pro-18gb1 (11%)

snapdragon-x-elite-32gb1 (11%)

Where they spend their tokens.

alibaba2 (22%)

yi2 (22%)

meta1 (11%)

moonshot1 (11%)

mistral1 (11%)

microsoft1 (11%)

google1 (11%)

Best score per model. Click a model to see its full page.

#	Model	Provider	Best Score	Tier	Achieved
1	Qwen 3 235B-A22B MoE	alibaba	82.3	MAINLINE	2026-05-10
2	Llama 3.3 70B Instruct	meta	78.8	MAINLINE	2026-05-14
3	Kimi K2 Instruct	moonshot	73.9	FEEDER	2026-05-20
4	Yi Coder 9B	yi	70.4	FEEDER	2026-04-10
5	Mistral Nemo 12B Instruct	mistral	66.6	FEEDER	2026-04-22
6	Phi 3 Small 7B	microsoft	60.9	FEEDER	2026-04-11
7	Gemma 3 4B IT	google	58.2	TAP	2026-04-26
8	Qwen 2.5 Coder 1.5B	alibaba	48.4	TAP	2026-04-13

Every run, ordered by score.

#	Model	Hardware	Score	Tier	Code	Reason	Tool Use	RAG	Speed	Tokens	Avg ms	Date
1	Qwen 3 235B-A22B MoE	m3-ultra-256gb	82.3	MAINLINE	82.6	88.8	85.0	82.4	70.3	5.6K	2890	2026-05-10
2	Llama 3.3 70B Instruct	b200-192gb	78.8	MAINLINE	81.3	85.6	74.8	75.2	76.9	5.5K	877	2026-05-14
3	Kimi K2 Instruct	2x-rtx-4090	73.9	FEEDER	74.0	74.4	81.5	75.5	66.2	5.2K	2934	2026-05-20
4	Yi Coder 9B	m2-pro-16gb	70.4	FEEDER	66.9	72.4	70.1	65.7	77.0	5.0K	786	2026-04-10
5	Yi Coder 9B	rtx-3060-12gb	68.1	FEEDER	66.6	71.0	71.2	68.8	63.3	5.4K	790	2026-05-10
6	Mistral Nemo 12B Instruct	rtx-4080-16gb	66.6	FEEDER	61.5	69.7	66.8	71.7	67.9	5.6K	791	2026-04-22
7	Phi 3 Small 7B	m3-pro-36gb	60.9	FEEDER	66.4	64.7	57.4	54.7	56.0	4.9K	792	2026-04-11
8	Gemma 3 4B IT	macbook-pro-m3-pro-18gb	58.2	TAP	60.2	57.2	54.9	58.3	62.6	5.3K	342	2026-04-26
9	Qwen 2.5 Coder 1.5B	snapdragon-x-elite-32gb	48.4	TAP	48.1	52.8	47.5	52.2	48.4	4.9K	342	2026-04-13