docs: document model/context-window config, dynamic watermarks, rate-limit scoping

This commit is contained in:
iris 2026-05-20 16:55:13 +02:00
parent bac7dd6cde
commit 939df10a61
3 changed files with 60 additions and 19 deletions

View file

@ -382,7 +382,9 @@ Layout, top to bottom:
(input + cache_read + cache_write of the most recent
model call in the just-ended turn). This is the **actual
context window utilisation** — the number to watch when
deciding whether to compact.
deciding whether to compact. When `context_window_tokens`
is available from `/api/state`, the badge tooltip shows the
percentage of window used.
- Cost badge: `cost · 1.3M` — cumulative tokens billed
across **every inference** in the last turn (sum of all
per-call prompts). Tool-heavy turns rebill the cached
@ -407,7 +409,10 @@ Layout, top to bottom:
Polling: `/api/state` is fetched **once** on cold load, and
again while `status === 'needs_login_in_progress'` (login
session output isn't event-shaped yet). Every other badge
updates from SSE; no periodic refresh timer runs.
updates from SSE; no periodic refresh timer runs. Snapshot
includes `context_window_tokens` (effective window size for
the agent's current model, from `events::context_window_tokens`)
used to compute percentage-of-window in the ctx badge tooltip.
- Inbox `<details>` block (collapsed): `inbox · N` — last 30
messages addressed to this agent, fetched via
`AgentRequest::Recent { limit: 30 }`. Reply messages (those