hyperhive

Author	SHA1	Message	Date
müde	8f5752980f	turn_stats: per-turn analytics sink new sqlite table at /state/hyperhive-turn-stats.sqlite on each agent's state dir. one row per claude turn captures identity (model, wake_from, result_kind), timing (started/ended_at, duration_ms), cost (input/output/cache_read/cache_creation token counts), behaviour (tool_call_count + per-tool breakdown JSON), and post-turn snapshot metrics (open_threads_count, open_reminders_count). wire additions: - AgentRequest/ManagerRequest::CountPendingReminders + Broker::count_pending_reminders_for(agent) - Bus::observe_stream + take_tool_calls — pumps the existing stdout stream-json, picks out tool_use blocks, accumulates per turn. bin loops fold the breakdown into each row. - TurnStats::open_default + TurnStatRow + record() — best-effort inserts; failures log + don't block the harness. both ag3nt and m1nd bins capture started_at + duration via Instant::elapsed, fetch open-thread + reminder counts from hive-c0re via the existing socket (post-turn, best-effort), and record one row at turn_end. record_kind splits ok / failed / prompt_too_long; failures carry the error message in note. todo entries for host-side vacuum sweep + reading the table back into agent/dashboard badges.	2026-05-17 23:00:41 +02:00
damocles	dc1ce1f236	open_threads: new get_open_threads MCP tool on agent + manager surfaces	2026-05-17 22:52:08 +02:00
müde	39d8359c10	agent ui: event-driven status / model / token_usage / turn_state new LiveEvent variants on the per-agent bus — status_changed / model_changed / token_usage_changed / turn_state_changed — replace the per-agent web UI's /api/state polling for the badge row. emit sites: - Bus::set_model → model_changed - Bus::record_usage → token_usage_changed - Bus::set_state → turn_state_changed - turn::wait_for_login → status_changed("online") on creds detect - post_login_start / post_login_cancel → status_changed("needs_login_") per-agent endpoints (post_set_model / post_compact / post_new_session / post_cancel_turn / post_login_) now all return 200; client drops the post-submit refetch except on login transitions, which still need /api/state to render the OAuth form + session stream. client adds dispatch on the four new event kinds, threads `currentLabel` through so the composer re-enables on a live status flip, and no longer fires refreshState() from turn_end or postModel — the events carry the same signal faster. closes the per-agent half of the dashboard event-channel refactor; TODO entry dropped.	2026-05-17 22:49:55 +02:00
müde	b444dac6e8	agent ui: consolidate status into state-row badges drop the "● harness alive — turn loop running" paragraph; the new #alive-badge chip in the state row carries the same signal across all statuses (loading / online / needs-login / offline) with colour coding. token-usage chip renamed + restyled as #ctx-badge — primary number is total context-window tokens used, mirroring claude code's "N tokens" indicator. every state-row badge now has hover detail: state-badge gets per-state tooltips + age suffix, model-chip explains the /model command, last-turn shows the raw ms duration, ctx-badge breaks out input / cache_read / cache_write / output. new todo entry for the per-turn stats sink (start/end/model/ tokens/tool-call-count) the harness should be writing.	2026-05-17 22:36:02 +02:00
damocles	15f141801b	limits: raise message body cap 1k → 4k (catches ~95% of conversational overflow)	2026-05-17 22:15:25 +02:00
damocles	82b0877c47	ask: rename ask_operator → ask + optional 'to' for agent-to-agent Q&A	2026-05-17 13:20:32 +02:00
müde	b60774a66c	events: LiveEvent::Note becomes struct variant so serde can actually serialize it	2026-05-17 13:14:09 +02:00
müde	aa24080f7b	agent: /send returns 200 (terminal + turn-end refresh already cover the visual update)	2026-05-17 12:41:37 +02:00
müde	1340a654e7	sse: seq plumbing + subscribe-first dedupe dance	2026-05-17 12:26:00 +02:00
müde	f27108aecf	agent: route terminal scroll+backfill+SSE through hive-fr0nt::TERMINAL_JS	2026-05-17 11:53:50 +02:00
müde	0b9e7cbcf6	css: extract terminal pane styles to hive-fr0nt::TERMINAL_CSS	2026-05-17 11:50:39 +02:00
müde	e283e39949	css: route palette + body typography through hive-fr0nt::BASE_CSS	2026-05-17 11:47:45 +02:00
damocles	1770b51845	manager mcp: expose 'remind' tool sharing storage helper with agent surface	2026-05-17 11:43:14 +02:00
damocles	0e6bac8388	limits: unified 1 KiB cap on send/ask + reminder auto-file on overflow	2026-05-17 11:36:12 +02:00
damocles	07b7988915	agent mcp: add 'remind' to --allowedTools so claude doesn't have to ask	2026-05-17 11:20:01 +02:00
damocles	f2484b5e78	agent mcp: expose 'remind' tool for self-scheduled wakes	2026-05-17 10:54:36 +02:00
damocles	4f56954422	extract TokenUsage::from_stream_event helper to keep run_claude under clippy line limit	2026-05-17 02:59:51 +02:00
damocles	ce740483c6	show token usage on per-agent web ui after each turn	2026-05-17 02:59:51 +02:00
damocles	ca86bcf4bd	add claudePluginsAutoUpdate NixOS option, default false	2026-05-17 02:59:51 +02:00
müde	411cf86632	nix fmt + rustfmt sweep	2026-05-17 01:40:28 +02:00
müde	597351ca4e	harness: declarative claude plugin marketplaces new `hyperhive.claudeMarketplaces` option (list of strings — URL, path, or github:owner/repo). harness boot adds each via `claude plugin marketplace add` before updating + installing the configured plugins, so specs like `foo@some-marketplace` resolve on a fresh container. idempotent: 'already exists' stderr is treated as success.	2026-05-17 01:36:18 +02:00
müde	4a06615c5c	fix /state paths: sub-agents use /agents/<name>/state, not /state sub-agent containers post-refactor bind their state at /agents/<name>/state (manager keeps the legacy /state — see lifecycle.rs:751). agent.md still said /state/forge-token; corrected to /agents/{label}/state/forge-token (template-substituted at boot). tea-login systemd unit now walks both candidates so the same harness module works for the manager and sub-agents.	2026-05-16 23:37:49 +02:00
müde	9fc7cae132	prompts: tell agents + manager about the code forge; todo: shared docs repo system prompts now describe the hyperhive Forgejo at localhost:3000, the per-agent user, the pre-configured tea CLI, and the REST API fallback with /state/forge-token. todo gains the shared docs/skills RO-repo follow-up (org-shared + per-agent read membership).	2026-05-16 23:36:05 +02:00
damocles	824acee134	include agent label in turn failure notification body	2026-05-16 20:45:19 +02:00
damocles	1023acf69f	add get_logs tool to manager mcp surface	2026-05-16 20:45:19 +02:00
damocles	fca480b86e	add turn lock to prevent /compact racing with in-flight turns	2026-05-16 20:45:19 +02:00
damocles	25508d7399	fix manager loop: pending wake + move sleep into Empty arm only	2026-05-16 20:45:19 +02:00
müde	f2a0dc4107	re-apply TodoWrite removal + deny list (lost in subsequent merge)	2026-05-16 19:47:55 +02:00
damocles	f3739d2b8e	update plugin marketplaces before install at harness boot	2026-05-16 18:51:06 +02:00
damocles	dc53615686	fix stale /state refs in agent and manager prompts	2026-05-16 18:50:15 +02:00
müde	772fdd8320	forward plugin install failures to manager from sub-agents install_configured now takes an optional notify recipient. on a non-zero or spawn-failed 'claude plugin install', sub-agents send the spec + stderr to manager via the hyperhive socket; manager passes None so it doesn't message itself. boot still proceeds either way — notification is best-effort.	2026-05-16 17:24:04 +02:00
müde	3e040d5b16	agent: forward unhandled turn failures to manager run_claude now keeps a 20-line stderr ring buffer and bails with it inline (was just 'exit <status>'). agent serve loop, on Failed (not PromptTooLong — that's already absorbed by drive_turn's compaction retry), sends the error body to manager via the normal hyperhive send. swallows transport errors — failure is already in journald and the events sqlite. manager-only harness (hive-m1nd) is unchanged so it doesn't try to notify itself.	2026-05-16 16:04:35 +02:00
müde	7ec658851a	back out bypassPermissions: claude refuses it under root uid claude-code rejects --dangerously-skip-permissions / defaultMode= bypassPermissions when running as root, which all hyperhive containers do. revert to the previous explicit allow-list plumbing (per-flavor list spliced into permissions.allow + --tools enable list), keep TodoWrite out of the built-in allow set, and keep the deny list (TodoWrite, WebFetch, WebSearch, Task) as belt-and-braces in case anything sneaks past the allow gate.	2026-05-16 15:58:41 +02:00
müde	36c7f3d1c7	mirror claude stderr to tracing so journald captures it bus-only note made post-mortems require the web UI / events sqlite; now stderr lines also land in 'journalctl -M <container> -b' alongside the existing LiveEvent::Note for the dashboard.	2026-05-16 15:30:03 +02:00
müde	7d33da3727	retry hive socket up to 5x over 60s, surface retry count to claude socket client now retries connect/IO failures with 2-4-8-16-30s backoffs (60s total budget). transparent for non-tool callers via request(); tool handlers go through request_retried() which also returns the retry count, then annotate_retries() appends a one-line note to the tool result so claude knows the slow round-trip was a c0re flicker, not a content failure — avoids burning tokens on an LLM-level retry.	2026-05-16 15:28:18 +02:00
damocles	4a8a668348	feat: add optional description to request_apply_commit and request_spawn	2026-05-16 15:18:32 +02:00
damocles	a6d1464071	refactor: per-agent state paths (/agents/{label}/state), centralize in paths.rs	2026-05-16 15:18:32 +02:00
damocles	a82009cf8c	docs: update agent prompt to reference /agents/{label}/state and /agents/{label}/claude	2026-05-16 15:18:19 +02:00
müde	6dd17864ac	auto-install claude plugins at harness boot new hyperhive.claudePlugins NixOS option (list of strings) rendered to /etc/hyperhive/claude-plugins.json. both hive-ag3nt and hive-m1nd shell out 'claude plugin install <spec>' for each entry once at startup before the turn loop opens. failures log a warning but don't abort boot.	2026-05-16 15:17:34 +02:00
müde	8e7405db13	bypass-mode perms + deny list, drop allow-list plumbing claude-settings.json now sets permissions.defaultMode=bypassPermissions with a small deny list (WebFetch, WebSearch, Task, TodoWrite). The per-flavor allow list and --tools / --allowedTools CLI flags are gone — anything not denied auto-approves. mcp.rs loses ALLOWED_BUILTIN_TOOLS, builtin_tools_arg, allow_list, allowed_mcp_tools. The extraMcpServers allowedTools field is parsed for back-compat but no longer wired anywhere; restrict via permissions.deny instead.	2026-05-16 15:17:30 +02:00
damocles	3d2a7ffec7	fix: auto-wake after turn if pending messages exist, don't block on recv	2026-05-16 13:50:11 +02:00
damocles	d99e0812d0	fix: move sleep to only occur when recv returns empty, avoid message delivery delay	2026-05-16 13:48:04 +02:00
damocles	ab0df71068	docs: update system prompts to document /shared directory	2026-05-16 13:43:05 +02:00
damocles	abcf7a0c41	implement broadcast messaging: send to '*' reaches all agents with hint	2026-05-16 13:16:13 +02:00
damocles	286da8980e	Revert "mcp: wire extra server allowedTools into --allowedTools arg" This reverts commit e0b18ff3c2ec5a7f771ab9a1a247ff4a24a8c475.	2026-05-16 12:49:59 +02:00
damocles	caa495aeda	mcp: wire extra server allowedTools into --allowedTools arg	2026-05-16 12:49:59 +02:00
müde	67e4242b9f	per-agent send allow-list via hyperhive.allowedRecipients new NixOS option in harness-base.nix: hyperhive.allowedRecipients = [ 'alice' 'manager' ]; # whitelist hyperhive.allowedRecipients = [ ]; # default = unrestricted module writes the list as JSON to /etc/hyperhive/send-allow .json at activation. AgentServer::send reads the file before issuing the broker request; if the list is non-empty and `to` isn't on it, the tool returns a claude-readable refusal string without touching the broker. the manager is always implicitly permitted regardless of the list — otherwise a misconfigured allow-list could strand a sub-agent without an escalation path. enforcement is in the in-container MCP server (not on the host's per-agent socket) because the agent's nix config is the trust boundary anyway — the operator audits agent.nix at deploy time, the activation-time /etc/hyperhive/send-allow .json is r/o under /nix/store, so the agent can't tamper at runtime without going through a new approval. agent prompt mentions the option + tells claude to route through the manager when refused. retires the matching TODO under Permissions / policy.	2026-05-16 03:59:28 +02:00
müde	06af23c8a4	recv: None = peek, positive value = opt-in long-poll old behavior: omitted wait_seconds fell through to the 30s RECV_LONG_POLL_DEFAULT — claude calling 'is there anything in my inbox right now?' between actions blocked the turn for half a minute. flip the semantics: None (or 0) returns immediately, positive value parks up to MAX (180s, unchanged). cleaner 'peek vs wait' distinction; tool descriptions + agent/manager prompts updated to point at the new shape. harness's own serve loops in hive-ag3nt + hive-m1nd relied on the old default for their inbox poll. they now explicitly pass wait_seconds: Some(180) to opt into the full park — same effective behavior as before, just spelled out. retires the matching TODO under Turn loop.	2026-05-16 03:22:42 +02:00
müde	90df2106bf	agent socket: external wake-up path for in-container MCP servers new AgentRequest::Wake { from, body } drops a message into this agent's inbox via the per-agent socket. matrix-style MCP servers can use it when they receive an external event (matrix message, webhook, scrape result) to nudge claude into running a turn. broker.send wakes whatever Recv is currently long-polling, the harness picks the message up, formats a wake prompt with the caller's chosen from label ('matrix: new dm', 'webhook: deploy succeeded', etc.). new `hive-ag3nt wake --from <label> --body <text>` subcommand on the harness binary so MCP servers can shell out instead of implementing the line-JSON protocol themselves; body=='-' reads from stdin for multi-line / quoting-friendly payloads. identity = socket: anything that can connect to /run/hive/mcp .sock is implicitly trusted to inject. that's fine because the bind-mount is the agent's own container; no new auth surface opens up. docs/turn-loop.md gets a new 'Waking the agent from inside the container' section pointing at both paths (CLI + raw JSON).	2026-05-16 03:15:58 +02:00
müde	3db33b0fe5	agent flake.nix: forward inputs as flakeInputs module arg new boilerplate wraps agent.nix as a sub-module + passes every flake input (minus self) through to it via _module.args.flake Inputs. manager edits the inputs block of flake.nix to pull in out-of-tree flakes (MCP servers etc.) and references them in agent.nix as flakeInputs.<name>.packages.${pkgs.system}.default — the new input's pinned sha lands in the agent's own flake .lock (already tracked + part of the proposal flow), and transitively rolls up into meta's lock. migrate's MODULE_FLAKE_MARKER swaps to _module.args.flakeInputs so existing agents on the old 'nixosModules.default = import ./agent.nix' template get re-rendered onto the new shape on next hive-c0re start. manager_server's flake.nix tamper-check goes away — the build path's failed/<id> annotated tag already provides the safety net when a manager edit breaks the flake; enforcing 'no flake.nix edits at all' was overly strict (blocks the inputs- addition pattern that's the whole point of this change). manager prompt updated with a worked example for adding an MCP-server flake input + wiring it through agent.nix.	2026-05-16 02:23:43 +02:00

1 2 3

141 commits