Unify the duplicated rule chains in `DefaultPolicyEngine.evaluate()` and `.explain()`

## Summary

Refactor `DefaultPolicyEngine` so `evaluate()` (decide) and `explain()` (describe)
are driven by one rule-chain definition, eliminating the two hand-maintained
~150-line copies of the same logic.

## Why this matters

The two methods must agree forever: an explanation that diverges from the actual
decision is worse than no explanation, because operators will trust it. Today any
rule added to `evaluate()` must be manually mirrored in `explain()` — a classic
drift bug waiting to happen in the security-critical module. The DSL engine already
demonstrates the separated pattern (`policy_dsl.py` vs `policy_dsl_explain.py`)
that keeps each file in budget, but the default engine duplicates *logic*, not just
location.

## Current evidence

- `policy.py`: `def evaluate(` at line 180 and `def explain(` at line 466, each walking the same conditions (safety class rules, roles, justification, rate limits) independently; the file is 652 lines (ISSUE 12).
- `policy_reasons.py` holds the stable reason codes both paths must emit consistently.

## External context

Not required for this issue.

## Proposed implementation

1. Extract each rule as a small object/function returning a structured outcome
   `(passed, reason_code, description, would_deny)`.
2. `evaluate()` short-circuits on first deny; `explain()` runs all rules and reports
   each — same chain, two traversal modes.
3. Property test: for randomized principals/capabilities/contexts, `explain()`'s
   predicted decision equals `evaluate()`'s actual decision (hypothesis-style or
   parametrized grid; deterministic seeds).
4. Sequence before ISSUE 12's `policy.py` split so the split moves the unified chain.

## AI-agent execution notes

- Inspect first: `policy.py` (both methods, rate-limit interaction), `policy_reasons.py`, `tests/test_policy.py`.
- Critical edge case: `evaluate()` *records* rate-limit usage; `explain()` must never mutate limiter state. The unified chain needs an explicit read-only mode.
- Preserve every reason code and message string exactly (error-message contract).
- This is security-grade code: behavior-preserving refactor, verified by the full existing policy test matrix.

## Acceptance criteria

- One rule-chain definition drives both methods; the duplicated blocks are gone.
- An evaluate/explain agreement test passes across a broad input grid.
- `explain()` provably does not mutate rate-limiter state (test).
- All existing policy tests pass unchanged.

## Test plan

Existing `tests/test_policy.py` matrix + new agreement and no-mutation tests.
Run `make ci`.

## Documentation plan

Update `docs/agent-context/architecture.md` policy section; CHANGELOG `Changed`
(internal).

## Migration and compatibility notes

No public API change. Not expected to require migration.

## Risks and tradeoffs

Refactoring security-critical code carries regression risk — mitigate with the
agreement property test landing *before* the refactor. Slight indirection cost in
readability, repaid by impossibility of drift.

## Suggested labels

architecture, refactor, security


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify the duplicated rule chains in `DefaultPolicyEngine.evaluate()` and `.explain()` #219

Summary

Why this matters

Current evidence

External context

Proposed implementation

AI-agent execution notes

Acceptance criteria

Test plan

Documentation plan

Migration and compatibility notes

Risks and tradeoffs

Suggested labels

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Unify the duplicated rule chains in DefaultPolicyEngine.evaluate() and .explain() #219

Description

Summary

Why this matters

Current evidence

External context

Proposed implementation

AI-agent execution notes

Acceptance criteria

Test plan

Documentation plan

Migration and compatibility notes

Risks and tradeoffs

Suggested labels

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

Unify the duplicated rule chains in `DefaultPolicyEngine.evaluate()` and `.explain()` #219