Question 1

What is LocalLLM Compare?

Accepted Answer

LocalLLM Compare is a hardware-first reference for choosing open source local LLMs by GPU fit, VRAM requirement, quantization, and generation speed.

Question 2

How do I choose a local LLM for my GPU?

Accepted Answer

Start with your available VRAM, then compare Q4_K_M model estimates and tokens/sec benchmarks on the matching GPU page.

Question 3

Does LocalLLM Compare track prompt processing speed?

Accepted Answer

No. The benchmark tables currently track generation speed, also called decode tokens per second.

Question 4

Can I contribute benchmark data?

Accepted Answer

Yes. The site is data-driven, and benchmark corrections or new GPU/model measurements can be contributed on GitHub.

Local LLM Hardware Guide

Choose your GPU