Support

Everything Hugging Fit does, and how to use it.

Features, install instructions, and frequently asked questions. The extension itself is local-only — no account or API key required.

Capabilities

Built for people who run models locally.

Eight layers of tooling on every Hugging Face page you visit. No login, no tracking, no API key.

01

VRAM fit, per quant.

Set your card once. Every quantization on the model page is auto-classified fits, tight, or won't fit — with the math shown below.

Q4_K_Mfits
Q5_K_Mfits
Q6_Ktight
Q8_0over
02

Runtime & format compat.

Apps (vLLM, llama.cpp, Ollama, LM Studio, Jan, MLX LM, Draw Things, TGI, SGLang) and libraries (Transformers, Diffusers, GGUF, MLX, Sentence-Transformers, PEFT, Safetensors) — flagged early on every model and listing card.

vLLM llama.cpp Ollama LM Studio MLX LM Jan
03

Smart listings.

Browsing /models? Cards get tagged with size + fit + format + last-updated. Dim or hide non-fits. Tick your runtimes — we redirect to HF's own URL filter (?apps=…&library=…) so the server pre-narrows the results.

Llama-3.1-8B16.1 GB
Mistral-7B14.4 GB
Qwen3-72B144 GB
Phi-3.5-mini7.5 GB
04

One-click downloads.

Per-quant ↓ on every GGUF row. Whole-repo ↓ for safetensors / AWQ / GPTQ / FP8 / NVFP4 / diffusion checkpoints. Sharded files reassembled into one tidy folder. No CLI, no python, no auth flow.

Q4_K_M↓ 4.6 GB
Q5_K_M↓ 5.7 GB
Q6_K↓ 6.6 GB
Q8_0↓ 8.5 GB
05

Profile mini-panel.

Land on /<org>? A quick-nav block lets you jump to their models with your runtime filter pre-applied. Datasets, Spaces, Collections, Papers, Posts — all one click, no scrolling.

Models · filtered Datasets Spaces Collections
06

Theme HF itself.

Toggle "Theme HF page" and Hugging Face goes pink (Aurora), emerald (Forest), brutalist (Mono), or stays cream (Daylight). Hijacks Tailwind's dark mode and retints the dark surfaces with your palette. The site you stare at all day, in your colors.

07

UI cleanup, your way.

Hide HF clutter you never use: promo banners, the inference widget, "Use this model", trending carousels, the footer. Or paste your own selectors.

Promo banners
Inference widget
"Use this model" / Deploy
Trending carousel
Footer
.cookie-banner { display: none !important; }
08

Hover dossier — pin, drag, download.

Hover any model link anywhere on Hugging Face — a dossier appears with parameters, license, base model, formats, and a fit verdict. Pin it so it doesn't auto-hide. Drag it anywhere on the page. Hit ↓ download repo without ever clicking through.

openai/gpt-oss-20b · 📌
21B params · MoE · 14.6 GB Q4_K_M
parameters21B
fits @ 24GByes
licenseapache-2.0
GGUFsafetensorsmoe
How it works

Three steps, then you forget it's there.

  1. Install the extension.

    Manifest v3 · works on Chrome, Edge, Brave, Arc, Vivaldi. ~40 KB.

  2. Set your VRAM target.

    Pick from 8 / 12 / 16 / 24 / 48 / 80 — or type a custom value. Tune headroom and KV-cache dtype.

  3. Browse Hugging Face normally.

    Fit verdicts, runtime badges, and dossiers appear inline. Cards that won't fit dim or disappear.

Questions

Things people ask.

Does this send any of my data anywhere?

No. The extension only reads the Hugging Face DOM you're already looking at and computes everything in your browser. There is no analytics, no telemetry, no account, no API key.

How is the VRAM math done?

Per quant: file size + KV-cache estimate (configurable fp16 / fp8) + headroom for activations. Diffusion repos use a 3 GB minimum headroom regardless. Fits / tight / over thresholds map to colour bands.

Will it work on every model page?

It works on text models with quants on the page, on diffusion repos, and on listing pages. If a page genuinely lacks information (e.g. unlabelled custom forks), Hugging Fit says so rather than guessing.

Can I theme the floating panel too?

The popup theme drives the floating dossier, listing-card badges, and profile mini-panel — pick once, it travels. Toggle Theme HF page and the same palette retints Hugging Face itself by hijacking its Tailwind dark class and retinting the dark surfaces.

Where do downloads go?

Downloads/huggingface/<org>_<repo>/<file>. Sharded GGUFs (…-00001-of-00005.gguf) all land in the same folder. Anything > 5 GB or > 3 files asks for confirmation first — no surprise 200 GB pulls.

Does it talk to HF's own URL filters?

Yes. Tick vLLM, Ollama, llama.cpp, plus tasks / licenses / 4-bit / MoE — turn on Auto-apply to HF URL, and visiting /models redirects to ?apps=…&library=…&pipeline_tag=…&license=…&other=…. HF filters server-side, you scroll less.

Firefox? Safari?

Manifest v3 + chrome.storage means Chromium browsers work today. A Firefox port is on the list once a usable WebExtensions polyfill ships.

Get it

Stop downloading models that won't run.

Five seconds to install. Free. No tracking. Your VRAM thanks you.

Chrome Web Store listing pending review