Glimp

Point a vision-language model at a folder of scans. Glimp renames and sorts them by category, caption, date, and orientation — entirely on-device.

macOS 14+ Apple Silicon 100% on-device v0.1 beta

Download for Mac View source

✺ what it does

📁

Folder in, folder out

Drop a folder of scans. Glimp inspects each image and proposes a new name, category, caption, and rotation. Approve in a grid view.

🧠

VLM on Apple Silicon

Runs MLX-quantized vision-language models (Qwen3-VL-4B by default) via mlx-swift-examples. Nothing leaves the machine.

🏷️

13 categories, exemplar-tuned

photo · portrait · group · travel · event · card · letter · document · id · art · screenshot · receipt · other. Prompt iterated against a labeled eval set.

↻

Auto-rotation

Detects the upright orientation and writes EXIF rotation. 85% accuracy on a 56-image FastFoto-heavy test set.

🔬

Built-in eval harness

Ships with eval-batch and a seeded eval-search so prompt and model swaps are testable, not vibes.

🔒

Private by construction

No cloud calls, no analytics on image content. Your scans stay in their folder; the model lives in ~/Library.

✺ install

Download Glimp.zip and unzip. Drag Glimp.app into /Applications.
First launch: right-click → Open (the build is unsigned in this beta, so Gatekeeper needs the override once).
On first run Glimp downloads the default model (~3 GB) into ~/Library/Caches/huggingface. Subsequent runs are offline.
Point it at a folder. Approve the proposed renames. Done.

✺ writeups

From 9% to 59%: prompt iteration on a tiny VLM eval set

Four MLX-quantized VLMs, four prompts, 56 hand-labeled scans. What moved the number, what didn't, and one model that was completely dead.

2026-04-29 · ~10 min read

Auto-search: letting the eval harness write the next prompt

A ~600-line Swift tool that proposes (prompt, decode-params) cells, scores each, and emits a Pareto front. So I stop hand-rolling v6.

2026-04-30 · ~6 min read