chopratejas/headroom — 8,097 Stars
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.
Watch Episode
About This Repo#
chopratejas/headroom — 8,097 ⭐
Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server. - chopratejas/headroom
Narration#
Your agent burns tokens on noise. Logs, tool outputs, RAG chunks—most of it useless. Headroom compresses context before it hits the LLM. Ninety-five percent fewer tokens. Same answers. Library, proxy, MCP server. Six compression algorithms including AST-aware. CacheAligner restructures prompts so KV caches save you money. Eight thousand stars. Pays for itself.