HQ records every LLM call an agent makes and lets you set monthly budgets to control spend.Documentation Index
Fetch the complete documentation index at: https://docs.yourhq.ai/llms.txt
Use this file to discover all available pages before exploring further.
What gets tracked
Token usage
Input, output, cache-read, and cache-write tokens per call.
Estimated cost
Calculated from known model pricing. Unknown models are counted as unmetered — never silently invented.
Setting a budget
Open the agent detail page
Navigate to Agents → select an agent → scroll to the Usage & Budget section.
Configure limits
Set a monthly limit in USD, a soft warning threshold (percentage), and whether to hard-stop the agent when the limit is exceeded.
What happens at the limit
| Setting | Behavior |
|---|---|
| Soft threshold | A notification is created when the agent crosses the warning percentage |
| Hard cutoff enabled | The runtime blocks further replies and the dispatcher stops waking the agent for background work |
| Hard cutoff disabled | The agent keeps working — you get the notification but no enforcement |