hyperhive

damocles b0f6bd8ece fix: self-calibrate context window from API result event the stream-json result event carries modelUsage.<model>.contextWindow which is the actual per-inference active window the model enforces. for claude-sonnet-4-6 this is 200k even though the full prompt cache can hold millions of tokens via accumulated cache reads. with the nix-configured sonnet = 1000000 the proactive compact watermark sat at 750k and was never reached. agents grew context until prompt_too_long at ~170k — reactive compact, no checkpoint turn. changes: - bus gains api_context_window field seeded from modelUsage.*.contextWindow in each turn's result event. authoritative; falls back to env var, then 200k. - new effective_context_window(bus) helper used by both watermark functions - compact_watermark (75%) and auto_reset_watermark (50%) call effective_context_window - context_tokens() docstring clarified: all three token fields (input + cache_read + cache_creation) count against the per-inference contextWindow limit. the large cache_read values seen in the result event are cumulative across all inferences in a turn, not per-inference. - /api/state context_window_tokens now reflects the calibrated window closes #129		2026-05-22 22:20:07 +02:00
..
assets	web: clickable links in terminal rows and dashboard messages (issue #233 )	2026-05-22 01:17:38 +02:00
prompts	apply_commit handles first-time spawns, request_spawn deprecated	2026-05-22 10:52:02 +02:00
src	fix: self-calibrate context window from API result event	2026-05-22 22:20:07 +02:00
Cargo.toml	feat: poll forge notifications in agent harness	2026-05-20 17:59:56 +02:00