PipelineScore
Hardware Board

Rank the rigs.

60 hardware tags from 412 tagged runs, each ranked by the best score anyone has posted on that rig. Same testpack everywhere — the hardware is the variable. Sort by value to see score per dollar.

Your rig missing? Run the CLI with --hardware-tag and put it on the board.
60 rigs · sorted by best score (high → low) · pick any two rigs to compare
#vsHardwareBest PipelineScore Best model on this rigRuns Tier
1lab-b200-192gb
89.5
DeepSeek R1 671B-A37Bdeepseek2MAINLINE
2m2-ultra-192gb
88.8
Qwen 3 235B-A22B MoEalibaba7MAINLINE
3b200-192gb
88.1
DeepSeek V3 671B-A37Bdeepseek10MAINLINE
4lab-h200-141gb
87.3
DeepSeek Coder V2 236Bdeepseek4MAINLINE
5dgx-h100
86.0
DeepSeek R1 671B-A37Bdeepseek6MAINLINE
6a100-80gb
85.3
DeepSeek Coder V2 236Bdeepseek14MAINLINE
7h200-141gb
85.1
Qwen 2.5 72B Instructalibaba7MAINLINE
8lab-dgx-h100
85.0
Qwen 3 72B Instructalibaba6MAINLINE
9h100-80gb
84.8
Hermes 3 Llama 3.1 405Bnous7MAINLINE
10cloud-api
84.7
DeepSeek Coder V2 236Bdeepseek10MAINLINE
112x-rtx-4090
84.5
DeepSeek Coder V2 236Bdeepseek7MAINLINE
12rtx-3080-10gb
84.0
DeepSeek V4deepseek10MAINLINE
13m3-max-64gb
84.0
DeepSeek R1 Distill Qwen 32Bdeepseek8MAINLINE
144x-rtx-3090
83.8
Qwen 3 72B Instructalibaba5MAINLINE
15lab-baseline
83.7
DeepSeek V4deepseek5MAINLINE
16m3-max-128gb
83.3
DeepSeek V4deepseek12MAINLINE
17lab-m3-ultra-256gb
83.1
Hermes 3 Llama 3.1 405Bnous2MAINLINE
18m4-pro-48gb
83.1
DeepSeek V4deepseek4MAINLINE
19rtx-4090-24gb
82.5
DeepSeek V4deepseek16MAINLINE
20lab-m3-max-64gb
82.4
Qwen 3 32B Instructalibaba3MAINLINE
21m3-ultra-256gb
82.3
Qwen 3 235B-A22B MoEalibaba8MAINLINE
22rtx-4080-16gb-offload
82.3
DeepSeek R1 Distill Qwen 32Bdeepseek7MAINLINE
23rtx-3090-24gb
81.3
Qwen 3.6 72Balibaba10MAINLINE
24lab-m2-ultra-192gb
81.2
Llama 4 70B Instructmeta4MAINLINE
25lab-a100-40gb
81.2
DeepSeek R1 Distill Qwen 32Bdeepseek4MAINLINE
26ryzen-7950x-cpu-only
80.5
Qwen 3.6 72Balibaba23MAINLINE
27lab-h100-80gb
79.9
Qwen 2.5 72B Instructalibaba2MAINLINE
28rtx-4080-16gb
79.9
Qwen 3 14B Instructalibaba14MAINLINE
29ryzen-5950x-rtx-3060
79.6
Qwen 3.6 72Balibaba3MAINLINE
30a100-40gb
79.3
Devstral Small 24Bmistral4MAINLINE
31macbook-pro-m3-pro-18gb
79.0
Qwen 3 14B Instructalibaba17MAINLINE
32lab-4x-rtx-3090
78.8
Qwen 2.5 VL 72Balibaba2MAINLINE
33lab-rtx-3090-24gb
77.9
Codestral 22Bmistral3MAINLINE
34lab-ryzen-7950x-cpu-only
76.9
Qwen 2.5 14B Instructalibaba9MAINLINE
35a6000-48gb
76.9
DeepSeek Coder V2 16Bdeepseek4MAINLINE
36lab-a6000-48gb
76.7
Devstral Small 24Bmistral2MAINLINE
37m2-pro-16gb
75.7
Gemma 3 12B ITgoogle15MAINLINE
38lab-macbook-pro-m3-pro-18gb
75.3
DeepSeek R1 Distill Llama 8Bdeepseek13MAINLINE
39rtx-3060-12gb
75.2
Qwen 2.5 14B Instructalibaba25MAINLINE
40rtx-4070-12gb
74.7
Qwen 2.5 14B Instructalibaba17FEEDER
41lab-rtx-3080-10gb
74.2
Qwen 3 14B Instructalibaba5FEEDER
42ryzen-7950x-rtx-3090
73.4
Codestral 22Bmistral4FEEDER
43lab-m3-max-128gb
73.4
Gemma 3 27B ITgoogle2FEEDER
44m3-pro-36gb
73.2
Gemma 3 12B ITgoogle17FEEDER
45lab-m2-pro-16gb
72.6
Qwen 3 8B Instructalibaba4FEEDER
46lab-rtx-4080-16gb
72.2
Gemma 3 12B ITgoogle2FEEDER
47lab-rtx-4080-16gb-offload
70.0
Aya 23 35Bcohere1FEEDER
48lab-a100-80gb
68.7
Mixtral 8x7B Instructmistral1FEEDER
49lab-ryzen-7950x-rtx-3090
68.4
Phi 3.5 MoE 42Bmicrosoft1FEEDER
50lab-rtx-4070-12gb
68.4
Yi Coder 9Byi2FEEDER
51lab-m3-pro-36gb
68.3
Llama 4 8B Instructmeta3FEEDER
52lab-rtx-3060-12gb
66.9
InternLM 2.5 7B Chatinternlm6FEEDER
53lab-snapdragon-x-elite-32gb
64.6
Qwen 3 4B Instructalibaba1FEEDER
54m2-air-16gb
59.5
Qwen 2.5 3B Instructalibaba5TAP
55m1-air-8gb
58.8
Gemma 3 4B ITgoogle9TAP
56lab-m3-air-16gb
55.1
Qwen 2.5 3B Instructalibaba3TAP
57m3-air-16gb
52.9
Phi 3 Mini 3.8Bmicrosoft7TAP
58snapdragon-x-elite-32gb
52.4
Phi 3 Mini 3.8Bmicrosoft4TAP
59lab-m2-air-16gb
49.3
Granite 3.1 2B Instructibm3TAP
60lab-m1-air-8gb
28.9
TinyLlama 1.1B Chatcommunity1DRIP
vs

Click a hardware tag to see every run on that rig; tick two rigs to go head-to-head. A rig's rank reflects its best showing, so it rewards the best model the hardware can hold. Value is score per dollar (best rig = 100) using hand-maintained street-price approximations in USD; cloud and unpriced rigs are excluded from the value ranking.