Build templates

Build guides

Curated AI workstation templates with tok/s and value ratings — customize any guide in the builder.

New here?

Not sure where to start?

Answer 5 quick questions and get a personalized recommendation.

Help me choose

Featured

Local Dev Starter

Single RTX 4080 SUPER build for running 7B–14B models locally with llama.cpp or Ollama.

budgetsingle-gpulocal-dev

~$2,800

~114.1 tok/s · 40.75 tok/s per $1k (Llama 3.1 8B)

View guide

Featured

Home Inference Workstation

RTX 4090 powerhouse for 8B–34B models with headroom for agent workflows.

4090home-lab

~$4,200

~93.6 tok/s · 22.29 tok/s per $1k (Qwen 2.5 14B)

View guide

Featured

Dual RTX 4090 Workstation

Twin 4090s pooling 48 GB VRAM to hold 70B-class models with real context headroom.

dual-gpu4090nvlink

~$6,500

~18.7 tok/s · 2.88 tok/s per $1k (Llama 3.3 70B)

View guide

Featured

Pro Dual-GPU 70B

Team-grade dual 4090 rig targeting Llama 3.3 70B at Q4.

pro70bdual-gpu

~$12,000

~14 tok/s · 1.17 tok/s per $1k (Llama 3.3 70B)

View guide

Agent Host (16K Context)

Dual-GPU workstation tuned for agent workloads with 16K context depth.

agent16k-context

~$4,500

~337.5 tok/s · 75.00 tok/s per $1k (Qwen 3 Coder 30B-A3B (MoE))

View guide

Budget Llama 8B Box

Minimal spend path to a solid Llama 3.1 8B daily driver.

budget8b

~$1,800

~114.1 tok/s · 63.39 tok/s per $1k (Llama 3.1 8B)

View guide

Entry $2K 14B Build

Cost-conscious 14B inference box with modern single-GPU VRAM.

entry14bbudget

~$2,000

~65.2 tok/s · 32.60 tok/s per $1k (Qwen 2.5 14B)

View guide

Fine-tune Workstation

64GB RAM and RTX 4090 for LoRA fine-tuning on 8B–14B models.

fine-tuninglora

~$4,800

~93.6 tok/s · 19.50 tok/s per $1k (Qwen 2.5 14B)

View guide

Team Server Inference

High-memory build for concurrent team inference with vLLM on large models.

teamconcurrencyvllm

~$5,000

~14 tok/s · 2.80 tok/s per $1k (Llama 3.3 70B)

View guide

No assembly required

Plug-and-play alternatives

Prefer a ready-to-use device? These options ship configured and work out of the box.

Apple Silicon

Mac Mini M4 (16GB)

$799

16GB unified memory · ~28 tok/s on Llama 8B

View config →

Apple Silicon

MacBook Pro 14" M4 Pro (48GB)

$2,499

48GB unified memory · ~42 tok/s on Llama 8B

View config →

Apple Silicon

Mac Studio M4 Max (36GB)

$2,499

36GB unified memory · ~41 tok/s on Llama 8B

View config →

Apple Silicon

Mac Studio M3 Ultra (96GB)

$5,299

96GB unified memory · ~68 tok/s on Llama 8B

View config →

NVIDIA Appliance

NVIDIA DGX Spark

$4,599+

128GB unified memory · runs 70B at full precision

View on NVIDIA.com

Framework Appliance

Framework Desktop (Ryzen AI Max+ 395, 128GB)

$2,459+

128GB unified memory · runs 70B at full precision

View on Framework.com