hyperhive

Author	SHA1	Message	Date
müde	5c6c607e25	agent badges: split into ctx (last-inference) + cost (cumulative) the existing ctx badge was misnamed: it summed `result.usage`, which is the cumulative tokens billed across every inference in the turn. for tool-heavy turns that easily exceeds the model's context window (a 600k cached prefix × 15 sub-calls = 9M cache_read), making it useless as a "should i compact?" signal. now two separate badges: ctx · N last inference's prompt size = actual context window in use right now. parsed from each `assistant` event's `.message.usage`; the harness tracks the most recent one across the stream and snapshots it when the `result` event lands. cost · M cumulative tokens billed across the whole turn (the previous behaviour, now correctly labelled). both update via a single `TokenUsageChanged { ctx, cost }` SSE event at turn-end. turn_stats grows four columns (`last_input_tokens`, `last_output_tokens`, `last_cache_read_input_tokens`, `last_cache_creation_input_tokens`) so the cold-load seed can paint both badges on page load. migrations run try-and-ignore ALTERs so existing agent dbs catch up; pre-migration rows have last-inference zeros and yield no `ctx` seed (badge stays empty until next turn) rather than a misleading 0.	2026-05-18 18:48:35 +02:00
müde	8f5752980f	turn_stats: per-turn analytics sink new sqlite table at /state/hyperhive-turn-stats.sqlite on each agent's state dir. one row per claude turn captures identity (model, wake_from, result_kind), timing (started/ended_at, duration_ms), cost (input/output/cache_read/cache_creation token counts), behaviour (tool_call_count + per-tool breakdown JSON), and post-turn snapshot metrics (open_threads_count, open_reminders_count). wire additions: - AgentRequest/ManagerRequest::CountPendingReminders + Broker::count_pending_reminders_for(agent) - Bus::observe_stream + take_tool_calls — pumps the existing stdout stream-json, picks out tool_use blocks, accumulates per turn. bin loops fold the breakdown into each row. - TurnStats::open_default + TurnStatRow + record() — best-effort inserts; failures log + don't block the harness. both ag3nt and m1nd bins capture started_at + duration via Instant::elapsed, fetch open-thread + reminder counts from hive-c0re via the existing socket (post-turn, best-effort), and record one row at turn_end. record_kind splits ok / failed / prompt_too_long; failures carry the error message in note. todo entries for host-side vacuum sweep + reading the table back into agent/dashboard badges.	2026-05-17 23:00:41 +02:00
müde	39d8359c10	agent ui: event-driven status / model / token_usage / turn_state new LiveEvent variants on the per-agent bus — status_changed / model_changed / token_usage_changed / turn_state_changed — replace the per-agent web UI's /api/state polling for the badge row. emit sites: - Bus::set_model → model_changed - Bus::record_usage → token_usage_changed - Bus::set_state → turn_state_changed - turn::wait_for_login → status_changed("online") on creds detect - post_login_start / post_login_cancel → status_changed("needs_login_") per-agent endpoints (post_set_model / post_compact / post_new_session / post_cancel_turn / post_login_) now all return 200; client drops the post-submit refetch except on login transitions, which still need /api/state to render the OAuth form + session stream. client adds dispatch on the four new event kinds, threads `currentLabel` through so the composer re-enables on a live status flip, and no longer fires refreshState() from turn_end or postModel — the events carry the same signal faster. closes the per-agent half of the dashboard event-channel refactor; TODO entry dropped.	2026-05-17 22:49:55 +02:00
müde	b60774a66c	events: LiveEvent::Note becomes struct variant so serde can actually serialize it	2026-05-17 13:14:09 +02:00
damocles	4f56954422	extract TokenUsage::from_stream_event helper to keep run_claude under clippy line limit	2026-05-17 02:59:51 +02:00
damocles	ce740483c6	show token usage on per-agent web ui after each turn	2026-05-17 02:59:51 +02:00
müde	411cf86632	nix fmt + rustfmt sweep	2026-05-17 01:40:28 +02:00
müde	3e040d5b16	agent: forward unhandled turn failures to manager run_claude now keeps a 20-line stderr ring buffer and bails with it inline (was just 'exit <status>'). agent serve loop, on Failed (not PromptTooLong — that's already absorbed by drive_turn's compaction retry), sends the error body to manager via the normal hyperhive send. swallows transport errors — failure is already in journald and the events sqlite. manager-only harness (hive-m1nd) is unchanged so it doesn't try to notify itself.	2026-05-16 16:04:35 +02:00
müde	7ec658851a	back out bypassPermissions: claude refuses it under root uid claude-code rejects --dangerously-skip-permissions / defaultMode= bypassPermissions when running as root, which all hyperhive containers do. revert to the previous explicit allow-list plumbing (per-flavor list spliced into permissions.allow + --tools enable list), keep TodoWrite out of the built-in allow set, and keep the deny list (TodoWrite, WebFetch, WebSearch, Task) as belt-and-braces in case anything sneaks past the allow gate.	2026-05-16 15:58:41 +02:00
müde	36c7f3d1c7	mirror claude stderr to tracing so journald captures it bus-only note made post-mortems require the web UI / events sqlite; now stderr lines also land in 'journalctl -M <container> -b' alongside the existing LiveEvent::Note for the dashboard.	2026-05-16 15:30:03 +02:00
damocles	a6d1464071	refactor: per-agent state paths (/agents/{label}/state), centralize in paths.rs	2026-05-16 15:18:32 +02:00
müde	8e7405db13	bypass-mode perms + deny list, drop allow-list plumbing claude-settings.json now sets permissions.defaultMode=bypassPermissions with a small deny list (WebFetch, WebSearch, Task, TodoWrite). The per-flavor allow list and --tools / --allowedTools CLI flags are gone — anything not denied auto-approves. mcp.rs loses ALLOWED_BUILTIN_TOOLS, builtin_tools_arg, allow_list, allowed_mcp_tools. The extraMcpServers allowedTools field is parsed for back-compat but no longer wired anywhere; restrict via permissions.deny instead.	2026-05-16 15:17:30 +02:00
müde	50ef806266	operator pronouns: configurable free-text, threaded into prompts new NixOS module option services.hive-c0re.operatorPronouns (free text, default 'she/her', example 'they/them'). hive-c0re takes it as a CLI flag (--operator-pronouns, lib.escapeShellArg'd in the systemd unit), stores it on Coordinator, threads it into the meta flake's mkAgent so each agent's systemd service gets HIVE_OPERATOR_PRONOUNS set. the harness reads the env at boot and substitutes {operator_pronouns} into the agent / manager system prompt alongside {label}. nix string is escaped against backslash + double-quote so non-ascii / quoted values round-trip safely. prompt addendum: both agent.md and manager.md mention the operator's pronouns up front so claude uses them naturally in third-person reference. propagates on next ↻ R3BU1LD (meta lock bump, no per-agent approval).	2026-05-16 02:05:22 +02:00
müde	d94712bde8	turn: unify run_turn / compact_session via TurnFiles new TurnFiles bundle (mcp_config + settings + system_prompt + flavor) materialised once per harness boot, passed to drive_turn and compact_session alike. operator-initiated /compact now uses the exact same session shape as a normal turn — same MCP surface, same allowed tools, same role prompt — only the stdin payload differs (/compact vs the wake-up body). web_ui's AppState carries the TurnFiles instead of (label + socket + flavor + ad-hoc file writes per click). bin/hive-ag3nt and bin/hive-m1nd prepare TurnFiles before spawning the web UI and pass them to both surfaces. web_ui::Flavor folds into a type alias for mcp::Flavor — no two-stage enum mapping. removes ClaudeMode + the run_claude variant fork (system prompt was Option, mcp args were skipped on Compact). dead 'mode' plumbing gone.	2026-05-16 00:57:58 +02:00
müde	02139efbb1	turn: spawn claude with cwd = /state every claude invocation now runs with current_dir set to /state — relative paths in tool calls (Read notes.md, Bash ls, Write blob) land in the agent's durable bind-mounted dir instead of the harness's systemd cwd. /state is RW + survives destroy/recreate so this matches where the agent is told to keep notes anyway. dev/test setups without the bind silently fall back to the parent cwd.	2026-05-16 00:46:19 +02:00
müde	034b4fde10	force fresh session: ↻ new session button + /new-session bus carries a one-shot AtomicBool armed by POST /api/new-session (or the /new-session slash command). next turn drops --continue, starting a fresh claude session; the flag clears automatically so subsequent turns resume normal behavior. /compact still always uses --continue — compacting a non-existent session is a no-op anyway. per-agent page grows an ↻ new session button next to the cancel-turn one (always visible, amber, confirms before posting since dropping --continue context isn't reversible). slash-command surface picks up /new-session for parity with the button. note row emitted on the live feed both at arm- time and again when the turn actually consumes the flag, so the operator can confirm it landed.	2026-05-16 00:44:45 +02:00
müde	6db38cf70c	model: runtime override via /model slash; fixes for port + bind - runtime model override: Bus::{model,set_model} + POST /api/model (form-encoded {model: name}). turn.rs reads bus.model() per turn so a flip lands on the next claude invocation. /api/state grows a model field; agent page shows a 'model · <name>' chip in the state row. '/model <name>' slash command POSTs to the endpoint and refreshes state. - port regression fix: agent_web_port no longer probes forward for existing agents (the previous fix shifted ports for any agent without a port file, including legacy ones whose container was already bound to the bare hashed port — dashboard rendered the new port, container was still on the old one, conn errors). new rule: port file exists → use it; absent + applied flake present → legacy, persist port_hash without probing; absent + no applied flake → fresh spawn, probe forward. - SO_REUSEADDR on both the dashboard and per-agent web UI binds via tokio::net::TcpSocket. operator hit 12 retries failing on manager :8000 — REUSEADDR handles the TIME_WAIT case cleanly without a new dep; retry still covers the genuine process-still-alive overlap. todo: drops the model-override entry (shipped); adds two new items — model persistence (optional, future), and custom per-agent MCP tools (groundwork for moving bitburner-agent into hyperhive).	2026-05-15 20:59:45 +02:00
müde	d8807b8e8c	claude: pass --settings as a file path (avoid argv length limit)	2026-05-15 18:12:07 +02:00
müde	cf8f1e64b1	claude settings: extract to prompts/claude-settings.json (formatted)	2026-05-15 18:09:05 +02:00
müde	71bf8bf47e	claude: pin effortLevel=medium in inline settings	2026-05-15 18:07:53 +02:00
müde	68fe66c0ef	claude: static role/tools moved to --system-prompt-file	2026-05-15 17:44:15 +02:00
müde	e9b213690e	dedupe: lift drive_turn/emit_turn_end/wait_for_login into hive_ag3nt::turn	2026-05-15 16:27:51 +02:00
müde	70af56e050	turn loop: --continue, disable claude auto-compact, /compact on overflow	2026-05-15 15:40:51 +02:00
müde	09787659ab	manager: same agent loop, ManagerServer MCP surface	2026-05-15 15:13:26 +02:00

24 commits