docs sync + revert auto-unfree removal

revert the earlier 'operator must set allowUnfree' move: per-agent containers evaluate their own nixpkgs and the operator's host-level allowUnfree doesn't propagate in. restoring the scoped allowUnfreePredicate inside both the claude-unstable overlay and harness-base.nix; documented in README + gotchas as 'nothing to set on the operator side'. docs: - claude.md file map adds crash_watch.rs, kick_agent on coordinator, /api/model + journald viewer + bind-with-retry references. - scratchpad rewritten to reflect the recent run. - web-ui.md: notification row + browser notifications section, state row (badge + model chip + last-turn chip + cancel button), per-agent inbox, /model slash, /cancel-question + journald endpoints, focus-preservation on refresh. - turn-loop.md: --model is read from Bus::model() per turn (runtime override via /model); recv(wait_seconds) up to 180s with the rationale; ask_operator gains ttl_seconds; new TurnState section; kick_agent inbox-on-startup hint. - approvals.md: ttl/cancel resolution paths for operator questions. - persistence.md: /state/hyperhive-model file. - gotchas.md: web UI port collision policy (rename, don't probe); bind retry + SO_REUSEADDR shape; auto-unfree restored. - todo.md: cleaned up empty sections and stale entries; /model shipped, dropped from the list.
2026-05-15 21:26:13 +02:00 · 2026-05-15 21:26:13 +02:00 · 62d1a74929
commit 62d1a74929
parent d275b50177
10 changed files with 239 additions and 95 deletions
--- a/docs/turn-loop.md
+++ b/docs/turn-loop.md
@ -26,7 +26,7 @@ Each agent harness (`hive-ag3nt serve` or `hive-m1nd serve`) runs:
 ## The claude invocation

 ```
-claude --print --verbose --output-format stream-json --model haiku \
+claude --print --verbose --output-format stream-json --model <name> \
  --continue --settings /run/hive/claude-settings.json \
  --system-prompt-file /run/hive/claude-system-prompt.md \
  --mcp-config /run/hive/claude-mcp-config.json --strict-mcp-config \
@ -34,6 +34,12 @@ claude --print --verbose --output-format stream-json --model haiku \
 # wake prompt piped over stdin
 ```

+`<name>` is read from `Bus::model()` on each turn, default
+`haiku`. Operator can flip it at runtime with `/model <name>` in
+the web terminal — the next turn picks it up.  The choice is
+persisted to `/state/hyperhive-model` so it survives restart;
+override path: `HYPERHIVE_MODEL_FILE` env var for tests.
+
 `--continue` keeps a persistent session per agent (claude stores
 sessions in `~/.claude/projects/`, which is bind-mounted
 persistently). Auto-compact and auto-memory are disabled via
@ -45,6 +51,12 @@ The wake prompt is intentionally minimal: just the popped message's
 …)` hint when `unread > 0`. Claude drives any further `recv`/`send`
 itself via the embedded MCP server.

+Whenever hive-c0re starts / restarts / rebuilds a container, it
+also drops a `system` message into the agent's inbox via
+`Coordinator::kick_agent` — a one-line "you were just (re)started,
+check /state/ for your notes, --continue session is intact". The
+next turn picks it up like any other inbox message.
+
 ### On-boot files

 `hive_ag3nt::turn::write_*` writes three files next to the per-agent
@ -75,7 +87,11 @@ it as a stdio child via `--mcp-config`. The hyperhive socket name is
 - `send(to, body)` — message a peer (logical agent name), another
  agent, or the operator (recipient `operator`, surfaces in the
  dashboard inbox).
- `recv()` — drain one inbox message.
+- `recv(wait_seconds?)` — drain one inbox message. Long-polls
+  server-side; `wait_seconds` is capped at 180 (default 30 when
+  omitted). Agents use a long wait to park their turn waiting for
+  work instead of busy-looping with short polls — they wake
+  instantly when a message arrives.

 ### Manager tools (in addition to send/recv)

@ -87,12 +103,16 @@ it as a stdio child via `--mcp-config`. The hyperhive socket name is
 - `request_apply_commit(agent, commit_ref)` — submit a config
  change for any agent (`hm1nd` for the manager's own config) for
  operator approval.
- `ask_operator(question, options?, multi?)` — surface a question
-  on the dashboard. Non-blocking — returns the queued question id;
-  the operator's answer arrives later as
+- `ask_operator(question, options?, multi?, ttl_seconds?)` —
+  surface a question on the dashboard. Non-blocking — returns the
+  queued question id; the operator's answer arrives later as
  `HelperEvent::OperatorAnswered` in the manager inbox. Options
  always render alongside a free-text fallback; `multi=true`
-  renders options as checkboxes.
+  renders options as checkboxes. `ttl_seconds` auto-cancels with
+  answer `[expired]` after the deadline (useful for time-sensitive
+  decisions that become moot if the operator hasn't responded).
+  The operator can also manually cancel with `[cancelled]` via the
+  dashboard.

 The boundary: lifecycle ops on *existing* sub-agents
 (`kill`/`start`/`restart`) are at the manager's discretion — no
@ -100,6 +120,21 @@ operator approval. Creating a new agent (`request_spawn`) and
 changing any agent's config (`request_apply_commit`) still go
 through the approval queue.

+### Authoritative state
+
+`hive_ag3nt::events::Bus` carries the current turn-loop state in
+addition to the broadcast channel and the events history. Variants:
+
+- `Idle` — sitting on `Recv` waiting for mail.
+- `Thinking` — `claude --print` is running for a turn.
+- `Compacting` — operator-triggered `/compact` is in flight.
+
+The harness flips state at the relevant transitions
+(`set_state(Thinking)` before `drive_turn`, `set_state(Idle)`
+after; `set_state(Compacting)` around `compact_session`). Exposed
+via `/api/state.turn_state` + `turn_state_since` (unix seconds);
+the agent page renders this rather than deriving from SSE events.
+
 ### Tool envelope

 `mcp::run_tool_envelope`: every MCP tool handler logs the request,