User profile

gguf-pilgrim

12 runsFirst seen 2026-04-16Avg 69.1

Best PipelineScore

80.0MAINLINE

on Qwen 2.5 Coder 32B

Best run beats 85% of all 428 submissionsBest rig m3-max-128gb ranks #18 of 63 rigs

Total tokens67.1KAcross every task this user has run

Avg latency1221msPer task, across all submissions

Tasks run30012 submissions x ~34 tasks

Rigs used9Distinct hardware tags

Category signature

Average score per category across all 12 runs.

Code

68.3

Reason

70.5

Tool Use

69.2

RAG

70.6

Speed

66.5

Hardware mix

Rigs this user benchmarked on.

cloud-api2 (17%)

rtx-4080-16gb-offload2 (17%)

macbook-pro-m3-pro-18gb2 (17%)

m3-max-128gb1 (8%)

2x-rtx-40901 (8%)

rtx-3090-24gb1 (8%)

rtx-4090-24gb1 (8%)

rtx-4080-16gb1 (8%)

m3-air-16gb1 (8%)

Provider mix

Where they spend their tokens.

alibaba2 (17%)

google2 (17%)

mistral2 (17%)

deepseek1 (8%)

nous1 (8%)

cohere1 (8%)

meta1 (8%)

internlm1 (8%)

bigcode1 (8%)

Models tried

Best score per model. Click a model to see its full page.

#	Model	Provider	Best Score	Tier	Achieved
1	Qwen 2.5 Coder 32B	alibaba	80.0	MAINLINE	2026-05-12
2	DeepSeek V3 671B-A37B	deepseek	79.6	MAINLINE	2026-04-20
3	Hermes 3 Llama 3.1 70B	nous	78.1	MAINLINE	2026-05-15
4	Gemma 3 27B IT	google	76.9	MAINLINE	2026-05-02
5	Devstral Small 24B	mistral	75.6	MAINLINE	2026-05-08
6	Mixtral 8x22B Instruct	mistral	74.7	FEEDER	2026-04-23
7	Gemma 2 27B IT	google	74.7	FEEDER	2026-04-21
8	Command R	cohere	67.9	FEEDER	2026-05-16
9	Llama 3.1 8B Instruct	meta	63.6	FEEDER	2026-04-27
10	InternLM 2.5 7B Chat	internlm	62.7	FEEDER	2026-05-06
11	Qwen 2.5 1.5B Instruct	alibaba	48.1	TAP	2026-05-08
12	StarCoder2 3B	bigcode	47.0	TAP	2026-04-16

All submissions

Every run, ordered by score.

#	Model	Hardware	Score	Tier	Code	Reason	Tool Use	RAG	Speed	Tokens	Avg ms	Date
1	Qwen 2.5 Coder 32B	m3-max-128gb	80.0	MAINLINE	81.6	79.0	77.4	81.9	74.9	5.4K	1506	2026-05-12
2	DeepSeek V3 671B-A37B	2x-rtx-4090	79.6	MAINLINE	80.1	81.7	84.0	77.6	71.5	5.6K	3039	2026-04-20
3	Hermes 3 Llama 3.1 70B	cloud-api	78.1	MAINLINE	74.0	79.9	79.4	80.6	76.8	6.0K	869	2026-05-15
4	Gemma 3 27B IT	rtx-3090-24gb	76.9	MAINLINE	79.1	80.5	74.8	78.3	67.1	5.7K	1506	2026-05-02
5	Devstral Small 24B	rtx-4080-16gb-offload	75.6	MAINLINE	76.9	74.2	73.9	80.2	73.7	5.2K	1575	2026-05-08
6	Mixtral 8x22B Instruct	cloud-api	74.7	FEEDER	74.2	71.3	71.0	77.2	75.7	5.5K	945	2026-04-23
7	Gemma 2 27B IT	rtx-4090-24gb	74.7	FEEDER	72.5	75.9	76.3	74.9	76.6	5.5K	1431	2026-04-21
8	Command R	rtx-4080-16gb-offload	67.9	FEEDER	67.0	70.7	71.1	72.3	62.0	5.8K	1524	2026-05-16
9	Llama 3.1 8B Instruct	macbook-pro-m3-pro-18gb	63.6	FEEDER	61.7	63.5	61.5	66.9	66.1	5.8K	791	2026-04-27
10	InternLM 2.5 7B Chat	rtx-4080-16gb	62.7	FEEDER	57.0	67.2	62.8	66.1	61.9	5.1K	750	2026-05-06
11	Qwen 2.5 1.5B Instruct	macbook-pro-m3-pro-18gb	48.1	TAP	46.3	51.4	48.7	49.1	49.5	5.6K	348	2026-05-08
12	StarCoder2 3B	m3-air-16gb	47.0	TAP	49.2	51.3	49.4	41.6	42.4	5.9K	364	2026-04-16