forked from tiann/hapi
-
Notifications
You must be signed in to change notification settings - Fork 0
Step 2.75: Replay harness v0 + CI gate #24
Copy link
Copy link
Open
Labels
architectureArchitectural / substrate workArchitectural / substrate workfleet-overseerFleet attention-arbitration architectureFleet attention-arbitration architecturemvpPart of the Overseer MVP acceptance bar (Steps 1-4)Part of the Overseer MVP acceptance bar (Steps 1-4)
Metadata
Metadata
Assignees
Labels
architectureArchitectural / substrate workArchitectural / substrate workfleet-overseerFleet attention-arbitration architectureFleet attention-arbitration architecturemvpPart of the Overseer MVP acceptance bar (Steps 1-4)Part of the Overseer MVP acceptance bar (Steps 1-4)
Goal
Build the fleet replay harness — captured-stream loader, run-once promotion/prioritization entry point, golden-scenario assertions, the one-boss invariant test stub — and gate it in CI so Overseer logic changes can be told apart from regressions.
Spec
docs/plans/2026-06-03-overseer-build-sequence.mdStep 2.75 (primary)docs/plans/2026-06-03-overseer-prioritization.md§6 (replay / evaluation harness, golden test cases, KPIs)docs/adr/0001-worker-facing-attribution-one-boss.md§"Invariant test" (the one-boss invariant stub)docs/plans/2026-06-03-overseer-contracts.md§7 (transcript retention — fixtures must not be production transcripts)Acceptance
progressevents surface nothing; samededupe_keycollapses; root-causeblocked_bychain surfaces upstream not symptoms; stale-item aging; etc. Initial target: at least 10 of the listed scenarios.dispatchedevent, the corresponding worker-facingmessagesrow carries no Overseer-attribution metadata and the rendered instruction contains no generated attribution boilerplate. Passes vacuously now (no dispatches yet) but the assertion shape is wired so Step 4: Disagreement-capable Overseer + voice dispatch with confirm #26 activates real coverage automatically.test/fixtures/overseer-replay/and are NOT production transcripts.Out of scope
Dependencies
Suggested PR breakdown
1 PR: replay harness v0; golden scenarios; one-boss invariant test stub; CI gate.
Risks