Caveman
Caveman

Save 65% of your AI costs.

≈35% kept65% cut

Starred by engineers at

Why use many token when few token do trick.

Five layers. Every token earns its place.

support-agent.ts
1const client =new OpenAI({
2 baseURL: "…/gateway/openai/v1",
3 "x-cave-agent": "support"
4});
caveman://gateway12,480 meteredlive
POST /openai/v1/chat/completionsbyte-safe
prompt tokens4,210
cached prefix reused2,890
billed to you1,320
model-visible bytes unchanged69% billed
01coming soon

Caveman Gateway

Compress any LLM traffic.

  • Swap one base URL — your agents never change
  • Truthful spend, metered token-for-token
  • Byte-safe: record mode never touches what the model sees
02

Cavemem

Agents that stop forgetting.

  • One persistent recall layer, served over MCP
  • Local SQLite with FTS5 and a vector index
  • Pull back what matters instead of re-sending it
npm install -g cavemem
caveman-code · v0.19.1claude-opus-4-8
autopilotmsgs1.1k/30klayers0/4
03

Caveman Code

A terminal agent on a token budget.

  • Four compression layers across 20+ providers
  • Plan first, then ship — one autonomous loop
  • Same models, roughly half the tokens
npm install -g @juliusbrussee/caveman-code
cave plan · inferred12,480 traces
ranked moves · per day+$0/day
    S0zero app change
    S1SDK cooperation
    S2eval-gated routing

    inferred rate · verified stays $0 until live

    04

    Cave Architect

    Telemetry becomes a ranked plan.

    • Measured spend turns into ordered moves
    • Each move carries its own dollars-per-day
    • Split by how much app change it costs you
    rollout · resolve-ticketlive
    1replay
    2shadow
    3canary
    4active
    eval gate0.997 ≥ 0.98 · pass

    auto-rollback armed · nothing counts until the gate passes

    05

    Eval-Gated Rollout

    Savings you actually earned.

    • Clears replay, shadow and canary before live
    • Gated on evals — nothing counts until it passes
    • Auto-reverts the moment quality slips

    Truthful spend, down to the byte. See exactly where every token goes, priced to the cent.

    Don't code? Caveman help anyway.

    Lives in your chat box: ChatGPT, Claude, Gemini.

    ChatGPT with the Caveman extension — a compressed answer and the control panel docked beside it.
    Caveman off
    Caveman on

    Drag to compare

    Install browser extension

    The proof is public.

    Every number here is live and cited — held to the same honesty we hold our own metering to.

    74kGitHub stars, live#1Hacker News & GitHub Trending
    Top 220of every public repo on GitHub

    Fewer words. Same work.

    The open stack is public today — read the source, send a patch. The managed Cloud is in private development; leave your email and we'll reach out the moment a spot opens.

    No spam · one email when your invite is ready