A concise explanation of how Headroom reduces LLM token usage.
Headroom sits between an app/agent and the LLM provider. It rewrites large inputs into smaller, task-preserving forms before the request is sent. The original content is kept locally and can be retrieved later by hash.
It saves tokens through three main ideas: