hyperhive

Author	SHA1	Message	Date
damocles	77b249076f	events: HIVE_DEFAULT_MODEL takes priority over persisted model (closes #319 )	2026-05-23 00:38:17 +02:00
damocles	b0f6bd8ece	fix: self-calibrate context window from API result event the stream-json result event carries modelUsage.<model>.contextWindow which is the actual per-inference active window the model enforces. for claude-sonnet-4-6 this is 200k even though the full prompt cache can hold millions of tokens via accumulated cache reads. with the nix-configured sonnet = 1000000 the proactive compact watermark sat at 750k and was never reached. agents grew context until prompt_too_long at ~170k — reactive compact, no checkpoint turn. changes: - bus gains api_context_window field seeded from modelUsage.*.contextWindow in each turn's result event. authoritative; falls back to env var, then 200k. - new effective_context_window(bus) helper used by both watermark functions - compact_watermark (75%) and auto_reset_watermark (50%) call effective_context_window - context_tokens() docstring clarified: all three token fields (input + cache_read + cache_creation) count against the per-inference contextWindow limit. the large cache_read values seen in the result event are cumulative across all inferences in a turn, not per-inference. - /api/state context_window_tokens now reflects the calibrated window closes #129	2026-05-22 22:20:07 +02:00
damocles	908cadb151	docs: add missing #[must_use], # Errors, # Panics across public api	2026-05-22 19:41:27 +02:00
damocles	30d82148e0	clippy: apply auto-fixable warnings across workspace (closes #265 partial)	2026-05-22 18:55:57 +02:00
damocles	7e2f13cad8	model/context: defaults in nix module, no heuristic in rust	2026-05-20 15:49:03 +02:00
damocles	770cbaccf9	model/context: per-model ctx window overrides + expose window size in /api/state	2026-05-20 15:49:03 +02:00
damocles	9064cd3c57	model/context: configurable default model + model-derived context window	2026-05-20 15:49:03 +02:00
iris	804875d670	surface rate_limited status as red badge on per-agent page and dashboard - add rate_limited: Arc<AtomicBool> to Bus; set/cleared by emit_status - write/remove sentinel file hyperhive-rate-limited in state dir so host-side dashboard can detect it without a live socket call - api_state returns status=rate_limited when flag is set (cold-load accurate) - ALIVE_LABELS gains rate_limited entry (⊘ red chip) on per-agent page - ContainerView gains rate_limited: bool read from sentinel file - dashboard container row shows ⊘ rate limited badge (red) ahead of needs_login Closes #24	2026-05-20 15:16:00 +02:00
damocles	44c903f265	auto session-reset when context large and cache is cold	2026-05-20 14:49:26 +02:00
damocles	f9f1346eae	clippy: zero pedantic warnings across the tree	2026-05-18 22:09:34 +02:00
müde	5c6c607e25	agent badges: split into ctx (last-inference) + cost (cumulative) the existing ctx badge was misnamed: it summed `result.usage`, which is the cumulative tokens billed across every inference in the turn. for tool-heavy turns that easily exceeds the model's context window (a 600k cached prefix × 15 sub-calls = 9M cache_read), making it useless as a "should i compact?" signal. now two separate badges: ctx · N last inference's prompt size = actual context window in use right now. parsed from each `assistant` event's `.message.usage`; the harness tracks the most recent one across the stream and snapshots it when the `result` event lands. cost · M cumulative tokens billed across the whole turn (the previous behaviour, now correctly labelled). both update via a single `TokenUsageChanged { ctx, cost }` SSE event at turn-end. turn_stats grows four columns (`last_input_tokens`, `last_output_tokens`, `last_cache_read_input_tokens`, `last_cache_creation_input_tokens`) so the cold-load seed can paint both badges on page load. migrations run try-and-ignore ALTERs so existing agent dbs catch up; pre-migration rows have last-inference zeros and yield no `ctx` seed (badge stays empty until next turn) rather than a misleading 0.	2026-05-18 18:48:35 +02:00
müde	f827187341	agent ctx-badge: seed Bus::last_usage from latest turn_stats row on startup	2026-05-18 18:00:48 +02:00
müde	8f5752980f	turn_stats: per-turn analytics sink new sqlite table at /state/hyperhive-turn-stats.sqlite on each agent's state dir. one row per claude turn captures identity (model, wake_from, result_kind), timing (started/ended_at, duration_ms), cost (input/output/cache_read/cache_creation token counts), behaviour (tool_call_count + per-tool breakdown JSON), and post-turn snapshot metrics (open_threads_count, open_reminders_count). wire additions: - AgentRequest/ManagerRequest::CountPendingReminders + Broker::count_pending_reminders_for(agent) - Bus::observe_stream + take_tool_calls — pumps the existing stdout stream-json, picks out tool_use blocks, accumulates per turn. bin loops fold the breakdown into each row. - TurnStats::open_default + TurnStatRow + record() — best-effort inserts; failures log + don't block the harness. both ag3nt and m1nd bins capture started_at + duration via Instant::elapsed, fetch open-thread + reminder counts from hive-c0re via the existing socket (post-turn, best-effort), and record one row at turn_end. record_kind splits ok / failed / prompt_too_long; failures carry the error message in note. todo entries for host-side vacuum sweep + reading the table back into agent/dashboard badges.	2026-05-17 23:00:41 +02:00
müde	39d8359c10	agent ui: event-driven status / model / token_usage / turn_state new LiveEvent variants on the per-agent bus — status_changed / model_changed / token_usage_changed / turn_state_changed — replace the per-agent web UI's /api/state polling for the badge row. emit sites: - Bus::set_model → model_changed - Bus::record_usage → token_usage_changed - Bus::set_state → turn_state_changed - turn::wait_for_login → status_changed("online") on creds detect - post_login_start / post_login_cancel → status_changed("needs_login_") per-agent endpoints (post_set_model / post_compact / post_new_session / post_cancel_turn / post_login_) now all return 200; client drops the post-submit refetch except on login transitions, which still need /api/state to render the OAuth form + session stream. client adds dispatch on the four new event kinds, threads `currentLabel` through so the composer re-enables on a live status flip, and no longer fires refreshState() from turn_end or postModel — the events carry the same signal faster. closes the per-agent half of the dashboard event-channel refactor; TODO entry dropped.	2026-05-17 22:49:55 +02:00
müde	b60774a66c	events: LiveEvent::Note becomes struct variant so serde can actually serialize it	2026-05-17 13:14:09 +02:00
müde	1340a654e7	sse: seq plumbing + subscribe-first dedupe dance	2026-05-17 12:26:00 +02:00
damocles	4f56954422	extract TokenUsage::from_stream_event helper to keep run_claude under clippy line limit	2026-05-17 02:59:51 +02:00
damocles	ce740483c6	show token usage on per-agent web ui after each turn	2026-05-17 02:59:51 +02:00
müde	411cf86632	nix fmt + rustfmt sweep	2026-05-17 01:40:28 +02:00
damocles	a6d1464071	refactor: per-agent state paths (/agents/{label}/state), centralize in paths.rs	2026-05-16 15:18:32 +02:00
müde	034b4fde10	force fresh session: ↻ new session button + /new-session bus carries a one-shot AtomicBool armed by POST /api/new-session (or the /new-session slash command). next turn drops --continue, starting a fresh claude session; the flag clears automatically so subsequent turns resume normal behavior. /compact still always uses --continue — compacting a non-existent session is a no-op anyway. per-agent page grows an ↻ new session button next to the cancel-turn one (always visible, amber, confirms before posting since dropping --continue context isn't reversible). slash-command surface picks up /new-session for parity with the button. note row emitted on the live feed both at arm- time and again when the turn actually consumes the flag, so the operator can confirm it landed.	2026-05-16 00:44:45 +02:00
müde	8b9f7d21b7	model persisted to /state; stop auto-allowing claude-code unfree model persistence: /model <name> now writes to /state/hyperhive-model (in-container), Bus::new reads it on init. operator override survives harness restart and container rebuild; gone on --purge like every other piece of agent state. path overridable via HYPERHIVE_MODEL_FILE for tests. failure to persist is a warn, not fatal — runtime override still applies, just won't survive a restart. unfree opt-in: drop the auto-allowUnfreePredicate from harness-base.nix and the claude-unstable overlay. operator now has to set nixpkgs.config.allowUnfree (or a predicate listing claude-code) in their own host config. silent unfree bypass was sketchy; this is honest. readme + gotchas updated to spell out the snippet. todo: drops model-persistence + container-crash + journald (all shipped); adds per-agent send allow-list (constrain who an agent can message).	2026-05-15 21:05:40 +02:00
müde	6db38cf70c	model: runtime override via /model slash; fixes for port + bind - runtime model override: Bus::{model,set_model} + POST /api/model (form-encoded {model: name}). turn.rs reads bus.model() per turn so a flip lands on the next claude invocation. /api/state grows a model field; agent page shows a 'model · <name>' chip in the state row. '/model <name>' slash command POSTs to the endpoint and refreshes state. - port regression fix: agent_web_port no longer probes forward for existing agents (the previous fix shifted ports for any agent without a port file, including legacy ones whose container was already bound to the bare hashed port — dashboard rendered the new port, container was still on the old one, conn errors). new rule: port file exists → use it; absent + applied flake present → legacy, persist port_hash without probing; absent + no applied flake → fresh spawn, probe forward. - SO_REUSEADDR on both the dashboard and per-agent web UI binds via tokio::net::TcpSocket. operator hit 12 retries failing on manager :8000 — REUSEADDR handles the TIME_WAIT case cleanly without a new dep; retry still covers the genuine process-still-alive overlap. todo: drops the model-override entry (shipped); adds two new items — model persistence (optional, future), and custom per-agent MCP tools (groundwork for moving bitburner-agent into hyperhive).	2026-05-15 20:59:45 +02:00
müde	637085644d	server-side TurnState in the harness, exposed via /api/state new TurnState { Idle, Thinking, Compacting } on hive_ag3nt::events::Bus with set_state + state_snapshot. the turn loops in hive-ag3nt and hive-m1nd flip Thinking before drive_turn and Idle after; the web_ui's /api/compact handler flips Compacting around compact_session. per-agent /api/state grows turn_state + turn_state_since (unix seconds). frontend prefers the server-reported state over the client-derived one — setStateAbs takes the absolute since-time so the 'last turn' chip reads the actual server-side duration instead of the client's perceived gap between SSE events. SSE turn_start / turn_end still drive state instantly between renders; /api/state re-anchors on each turn_end refresh. new compacting state gets its own purple badge with pulse animation (mirrors thinking's amber). napping will slot in the same way once the nap tool lands.	2026-05-15 20:46:38 +02:00
müde	7b4adea325	dashboard: lifecycle_action helper collapses start/stop/restart/rebuild five POST handlers (post_kill / post_restart / post_start / post_rebuild) were all repeating the same boilerplate: strip prefix, set_transient, call lifecycle::X, clear_transient, match the result. extract one helper that takes the transient kind, error-message verb, the work body, and an optional 'on success' tail (used by kill to also unregister + emit HelperEvent::Killed). each handler shrinks to a single lifecycle_action(..) call. zero behavior change.	2026-05-15 20:12:03 +02:00
müde	89ccc5e6c5	events.sqlite vacuum moves host-side retention is a host concern — agents have no business doing their own cleanup, and a misbehaving harness could skip it. drop spawn_events_vacuum from both hive-ag3nt and hive-m1nd, drop the matching Bus::vacuum + EventStore::vacuum methods. new hive_c0re::events_vacuum module sweeps every existing agents/<name>/state/hyperhive-events.sqlite on the same hourly cadence as the broker vacuum. same two-stage delete (older than 7 days, trim to 2000 newest). called from main alongside broker vacuum. also: server-side state badge entered into todo.md (today's badge is derived client-side from sse, fine for idle/thinking but a state machine that grows compacting/napping wants authoritative status from the harness).	2026-05-15 20:10:34 +02:00
müde	de09503b59	events: persist to sqlite, survive harness restart hive_ag3nt::events::Bus replaces its in-memory VecDeque with a sqlite- backed store at /state/hyperhive-events.sqlite (overridable via HYPERHIVE_EVENTS_DB). emit() inserts a row; history() reads back the most recent 2000 events. survives harness restart now — operator reload mid-investigation no longer wipes the trail. vacuum runs hourly (immediate first sweep): drop rows older than 7 days, then trim to 2000 newest. two-stage so a quiet agent keeps a useful tail and a chatty one stays bounded. wired into both hive-ag3nt and hive-m1nd via spawn_events_vacuum. if the db open fails (e.g. no /state mount in dev), Bus runs in no-store mode — events still broadcast, just nothing persisted.	2026-05-15 19:42:57 +02:00
müde	d943bddd9e	agent ui: input lives in terminal section, banner shimmer on activity agent page restructure: - send form moves into the terminal panel as a prompt-style row beneath the live tail (status line stays above so it still reads as a header). - live panel + prompt share a single bordered 'terminal-wrap' box. - harness-alive / login-state status lines drop their decorative ascii bookends; just a leading dot/glyph remains. - banner gradient is now a real css gradient with a shimmer animation toggled by an .active class. turn_start adds it, turn_end removes it. dashboard side mirrors this: each broker sse event nudges a 4s shimmer window. - dashboard container rows drop their static ▓█▓▒░ / ▒░▒░░ glyph prefixes; the role chips already disambiguate m1nd vs ag3nt. - empty-state placeholders drop the ▓ bookends. terminal pre-fill: hive-ag3nt::events::Bus grows a 500-event ring buffer; new GET /events/history endpoint returns it. The agent JS fetches history before opening the SSE stream so opening the page mid- turn shows the last N events instead of a blank panel. The replay walks turn_start/turn_end pairs to seed the banner-active state correctly if a turn was still open.	2026-05-15 18:54:19 +02:00
müde	ace13cd785	agent ui: terminal-themed live panel; pretty tool calls; collapsed results - tool_use renders per-tool (Read /path, Bash $ cmd, send → operator: ...) - tool_result with >120 chars collapses into <details>; short ones inline - session_init / result / rate_limit dropped from the panel - thinking content shown inline if present, fallback indicator otherwise - TurnStart carries unread count → header badge "· 3 unread" - per-tool [status] line dropped from envelope; lives in wake prompt + UI - send form moved below the live panel - live panel themed as a terminal (crust bg, inset shadow, monospace)	2026-05-15 18:20:58 +02:00
müde	9eab28a716	agent ui: live event panel via SSE + stream-json	2026-05-15 15:01:26 +02:00

30 commits