From 82a3a610657ce1ca416c48fc8debfb1720812ecc Mon Sep 17 00:00:00 2001
From: Diogo Andre Santos <diogo.ansantos@nos.pt>
Date: Mon, 15 Jun 2026 16:37:34 +0100
Subject: [PATCH 1/3] docs: add Puppetmaster integration pattern (#416)

Add docs/integration_puppetmaster.md showing how contextweaver consumes
Puppetmaster-style job artifacts, worker summaries, logs, and follow-up
reads without dumping raw artifacts into model context.

Covers:
- Artifact summary ingestion via ingest_tool_result_sync with firewall
- Drilldown via ArtifactRef handles (gateway + standalone paths)
- Route/answer phase budgeting over job history
- Follow-up prompt treatment as routable tool_call candidates
- Sensitivity guidance for logs containing credentials/PII
- Explicit boundaries: context consumer, not job orchestrator

Also updates mkdocs nav and CHANGELOG.md.
---
 CHANGELOG.md                     |   7 +
 docs/integration_puppetmaster.md | 214 +++++++++++++++++++++++++++++++
 mkdocs.yml                       |   1 +
 3 files changed, 222 insertions(+)
 create mode 100644 docs/integration_puppetmaster.md

diff --git a/CHANGELOG.md b/CHANGELOG.md
index 2fd06b2..2fc1982 100644
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -9,6 +9,13 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 
 ### Added
 
+- **Puppetmaster integration pattern (#416).** New `docs/integration_puppetmaster.md`
+  shows how contextweaver consumes Puppetmaster-style job artifacts, worker
+  summaries, logs, and follow-up reads without dumping raw artifacts into model
+  context. Covers artifact summary ingestion, drilldown via handles/selectors,
+  route/answer phase budgeting over job history, and explicit boundaries (in:
+  context consumer; out: job supervisor / worker orchestrator).
+
 - **Gateway resource & prompt runtime (#669 / #670).** New
   `PrimitiveGatewayRuntime` (+ the `PrimitiveUpstream` protocol) extends the
   gateway's bounded-choice routing and context-firewall treatment from tools to
diff --git a/docs/integration_puppetmaster.md b/docs/integration_puppetmaster.md
new file mode 100644
index 0000000..d1fa849
--- /dev/null
+++ b/docs/integration_puppetmaster.md
@@ -0,0 +1,214 @@
+# Puppetmaster Integration
+
+> Consume [Puppetmaster](https://github.com/dgenio/puppetmaster)-style job artifacts,
+> worker summaries, logs, and follow-up reads through contextweaver so the LLM
+> never sees raw multi-KB artifacts unless it explicitly asks for them.
+
+## Why
+
+Puppetmaster orchestrates long-running, multi-step jobs. Each job produces:
+
+- **Worker summaries** (small, structured)
+- **Execution logs** (often multi-KB)
+- **Intermediate artifacts** (files, JSON blobs, traces)
+- **Follow-up prompts** (conditional next steps)
+
+Dumping all of this into the model context on every turn causes the same three
+failures contextweaver exists to prevent:
+
+1. **Token bloat.** A single worker log can consume 30 % of a 32 K context window.
+2. **Quality drop.** Needle-in-haystack accuracy degrades as the prompt grows.
+3. **Sensitivity leakage.** Logs may contain credentials, internal URLs, or PII.
+
+contextweaver treats Puppetmaster outputs as firewalled tool results: raw bytes
+go to the artifact store; the LLM sees compact summaries, typed handles, and
+extracted structured fields. When the model needs the full log or artifact, it
+references the handle and `tool_view` returns the precise slice.
+
+## Scope
+
+- **In scope:** ingesting job artifacts and worker summaries, drilldown via
+  handles/selectors, per-phase budgeting over job history, bounded follow-up
+  context compilation.
+- **Out of scope:** job supervision, worker orchestration, or replacing
+  Puppetmaster's scheduler. contextweaver is a context consumer, not a job
+  controller.
+
+## Architecture
+
+```text
+┌─────────────────────────────────────────────────────────────────┐
+│ Puppetmaster runtime                                            │
+│   • job scheduler + worker pool                                 │
+│   • produces: summaries, logs, artifacts, follow-ups            │
+└───────────────────────────────┬─────────────────────────────────┘
+                                │  (your adapter)
+                                ▼
+┌─────────────────────────────────────────────────────────────────┐
+│ contextweaver                                                   │
+│   ┌─────────────────────────────────────────────────────────┐   │
+│   │ ContextManager                                          │   │
+│   │   ├─ ingest_tool_result_sync(summary, raw_log)        │   │
+│   │   │      → firewall stores raw log in artifact_store  │   │
+│   │   │      → ContextItem carries summary + ArtifactRef  │   │
+│   │   ├─ build(phase=Phase.route)                         │   │
+│   │   │      → choose which job/history is relevant       │   │
+│   │   └─ build(phase=Phase.answer)                        │   │
+│   │          → budgeted prompt over job history           │   │
+│   └─────────────────────────────────────────────────────────┘   │
+│                         │                                       │
+│                         ▼                                       │
+│   ┌─────────────────────────────────────────────────────────┐   │
+│   │ ArtifactStore                                          │   │
+│   │   • raw logs / artifacts by handle                      │   │
+│   │   • retrieved only via drilldown (tool_view)            │   │
+│   └─────────────────────────────────────────────────────────┘   │
+└─────────────────────────────────────────────────────────────────┘
+```
+
+## Ingesting a job result
+
+A Puppetmaster job completion is just another tool result. Ingest it via the
+firewall so large payloads are stored out-of-band automatically:
+
+```python
+from contextweaver import ContextManager
+from contextweaver.config import ContextBudget, ContextPolicy
+
+mgr = ContextManager(
+    budget=ContextBudget(route=500, call=800, interpret=600, answer=1200),
+    policy=ContextPolicy(),
+)
+
+job_summary = "Worker 'train-model' completed. Accuracy: 0.94. Epochs: 50."
+raw_log = """2024-06-01T12:00:01Z [INFO] Starting epoch 1...
+2024-06-01T12:05:23Z [INFO] epoch 1 loss=0.42
+...
+2024-06-01T14:32:11Z [INFO] epoch 50 loss=0.03 accuracy=0.94
+2024-06-01T14:32:12Z [INFO] Saving checkpoint to /runs/epoch-50.pt
+"""
+
+mgr.ingest_tool_result_sync(
+    tool_call_id="job:train-model:42",
+    raw_output={"summary": job_summary, "log": raw_log},
+)
+
+# The firewall stores the raw log in the artifact store.
+# The event log contains a compact ContextItem with a handle.
+```
+
+The firewall intercepts the log if it exceeds the configured threshold (default
+2 048 bytes). The LLM sees the summary; the full log lives in the artifact store
+under a typed `ArtifactRef` handle.
+
+## Budgeting across job history
+
+Treat job history like any other turn sequence. Phase budgets keep the prompt
+bounded even when dozens of prior jobs exist:
+
+```python
+from contextweaver.types import Phase
+
+# Route phase: select relevant tools/jobs for the current goal.
+route_pack = mgr.build_sync(phase=Phase.route, query="check latest training run")
+
+# Answer phase: compose a response over the accumulated history.
+answer_pack = mgr.build_sync(phase=Phase.answer, query="summarise training progress")
+```
+
+| Phase | What it covers | Budget posture |
+|---|---|---|
+| `route` | Pick which job, worker, or follow-up tool is relevant | Small |
+| `call` | Generate arguments for a selected tool | Medium |
+| `interpret` | Digest a fresh job result with prior context | Medium |
+| `answer` | Compose a final summary across multiple jobs | Largest |
+
+Use tight `route` budgets so the model can't "read" the entire job history just
+to pick a tool. Let `answer` carry the larger budget because that is where the
+model synthesises findings.
+
+## Drilldown and follow-up reads
+
+When the summary is not enough, the model can request the full artifact via the
+handle. In a gateway setup this is `tool_view`; in a standalone script you read
+directly from the store:
+
+```python
+# Gateway path (MCP proxy / gateway)
+from contextweaver.adapters.mcp_gateway import tool_view
+
+# The model sends back the artifact handle from the summary.
+full_log = tool_view(mgr, handle="artifact:job:train-model:42")
+```
+
+```python
+# Standalone path
+handle = "artifact:job:train-model:42"
+assert mgr.artifact_store.exists(handle)
+full_bytes = mgr.artifact_store.get(handle)
+```
+
+Both paths are gated by the artifact store; the LLM never receives raw bytes
+unless a deliberate drilldown happens.
+
+## Handling follow-up prompts
+
+Puppetmaster can emit structured follow-up prompts ("retrain with lr=0.001",
+"evaluate on validation set B"). Treat these as `ContextItem` candidates with
+`ItemKind.tool_call` so the router can include them in the shortlist:
+
+```python
+from contextweaver.types import ContextItem, ItemKind
+
+follow_up = ContextItem(
+    id="suggestion:epoch-50",
+    kind=ItemKind.tool_call,
+    text="Suggested follow-up: evaluate on validation set B",
+    metadata={
+        "source": "puppetmaster",
+        "job_id": "train-model:42",
+        "suggested_action": "evaluate",
+        "args": {"dataset": "validation-B"},
+    },
+)
+mgr.ingest(follow_up)
+```
+
+During `Phase.route`, the follow-up competes for budget with other candidates.
+If scored highly, it surfaces in the prompt as a routed suggestion the model can
+choose to act on or ignore.
+
+## Sensitivity
+
+Job logs often contain internal hostnames, tokens, or file paths. The firewall
+does not erase them — it stores the raw log unchanged in the artifact store.
+Apply a `SensitivityClassifier` or `RedactionHook` if the *summary* itself must
+also be scrubbed before reaching the LLM:
+
+```python
+from contextweaver.context.sensitivity import MaskRedactionHook
+from contextweaver.config import ContextPolicy
+
+policy = ContextPolicy(
+    sensitivity_floor=Sensitivity.internal,
+    redaction_hooks=["mask"],
+)
+mgr = ContextManager(budget=budget, policy=policy)
+```
+
+See [`context_firewall.md`](context_firewall.md) and
+[`security_model.md`](security_model.md) for the full firewall and sensitivity
+guides.
+
+## Summary
+
+| Puppetmaster output | contextweaver treatment |
+|---|---|
+| Worker summary | Ingest as `tool_result` summary (LLM-visible) |
+| Execution log | Firewall → artifact store; handle in summary |
+| Intermediate artifact | Firewall → artifact store; drilldown on demand |
+| Follow-up prompt | Ingest as `tool_call` candidate for routing |
+| Multi-job history | Per-phase budget (`route` / `answer`) keeps prompt bounded |
+
+Keep Puppetmaster as the orchestrator of record. contextweaver only decides
+what the LLM sees this turn and how much of it fits the budget.
diff --git a/mkdocs.yml b/mkdocs.yml
index 4b6db88..a0eeaec 100644
--- a/mkdocs.yml
+++ b/mkdocs.yml
@@ -111,6 +111,7 @@ nav:
       - Agent Skills: integration_agent_skills.md
       - External Memory: integration_memory.md
       - Persistent & Remote Stores: persistent_stores.md
+      - Puppetmaster: integration_puppetmaster.md
   - Architecture: architecture.md
   - Benchmarks: benchmarks.md
   - Adopter Benchmark Report: benchmark_report.md

From d64249ffa8d5bce84e9b5c32e36979380092d0bd Mon Sep 17 00:00:00 2001
From: Diogo Andre Santos <diogo.ansantos@nos.pt>
Date: Mon, 15 Jun 2026 16:58:42 +0100
Subject: [PATCH 2/3] docs: fix API accuracy in Puppetmaster integration guide

Address review feedback on docs/integration_puppetmaster.md:
- Correct ingest_tool_result_sync signature in architecture diagram
- Use json.dumps + StructuredFirewall for structured raw_output
- Fix default firewall_threshold (2000, not 2048)
- Replace nonexistent tool_view with dispatch_meta_tool
- Fix artifact handle format to artifact:result:{tool_call_id}
- Fix sensitivity snippet imports, budget definition, and action
---
 docs/integration_puppetmaster.md | 55 +++++++++++++++++++++++---------
 1 file changed, 40 insertions(+), 15 deletions(-)

diff --git a/docs/integration_puppetmaster.md b/docs/integration_puppetmaster.md
index d1fa849..e96dc1f 100644
--- a/docs/integration_puppetmaster.md
+++ b/docs/integration_puppetmaster.md
@@ -48,9 +48,10 @@ references the handle and `tool_view` returns the precise slice.
 │ contextweaver                                                   │
 │   ┌─────────────────────────────────────────────────────────┐   │
 │   │ ContextManager                                          │   │
-│   │   ├─ ingest_tool_result_sync(summary, raw_log)        │   │
+│   │   ├─ ingest_tool_result_sync(tool_call_id, raw_output) │   │
 │   │   │      → firewall stores raw log in artifact_store  │   │
-│   │   │      → ContextItem carries summary + ArtifactRef  │   │
+│   │   │      → summary auto-generated; ContextItem carries │   │
+│   │   │         summary + ArtifactRef                      │   │
 │   │   ├─ build(phase=Phase.route)                         │   │
 │   │   │      → choose which job/history is relevant       │   │
 │   │   └─ build(phase=Phase.answer)                        │   │
@@ -73,12 +74,17 @@ firewall so large payloads are stored out-of-band automatically:
 
 ```python
 from contextweaver import ContextManager
+from contextweaver.types import Sensitivity
 from contextweaver.config import ContextBudget, ContextPolicy
 
-mgr = ContextManager(
-    budget=ContextBudget(route=500, call=800, interpret=600, answer=1200),
-    policy=ContextPolicy(),
+budget = ContextBudget(route=500, call=800, interpret=600, answer=1200)
+
+policy = ContextPolicy(
+    sensitivity_floor=Sensitivity.internal,
+    sensitivity_action="redact",
+    redaction_hooks=["mask"],
 )
+mgr = ContextManager(budget=budget, policy=policy)
 
 job_summary = "Worker 'train-model' completed. Accuracy: 0.94. Epochs: 50."
 raw_log = """2024-06-01T12:00:01Z [INFO] Starting epoch 1...
@@ -88,18 +94,24 @@ raw_log = """2024-06-01T12:00:01Z [INFO] Starting epoch 1...
 2024-06-01T14:32:12Z [INFO] Saving checkpoint to /runs/epoch-50.pt
 """
 
+payload = json.dumps({"summary": job_summary, "log": raw_log})
+
 mgr.ingest_tool_result_sync(
     tool_call_id="job:train-model:42",
-    raw_output={"summary": job_summary, "log": raw_log},
+    raw_output=payload,
+    firewall=StructuredFirewall(keep=["summary"]),
 )
 
-# The firewall stores the raw log in the artifact store.
+# The firewall stores the raw payload in the artifact store.
 # The event log contains a compact ContextItem with a handle.
+# With StructuredFirewall(keep=["summary"]), the LLM sees the summary
+# inline while the full payload (including the log) stays behind the handle.
 ```
 
-The firewall intercepts the log if it exceeds the configured threshold (default
-2 048 bytes). The LLM sees the summary; the full log lives in the artifact store
-under a typed `ArtifactRef` handle.
+The firewall intercepts the payload if it exceeds the configured threshold (default
+2 000 bytes). When `StructuredFirewall(keep=[...])` is used, the LLM sees only the
+projected field(s) inline; the full payload remains in the artifact store under a
+typed `ArtifactRef` handle.
 
 ## Budgeting across job history
 
@@ -135,15 +147,24 @@ directly from the store:
 
 ```python
 # Gateway path (MCP proxy / gateway)
-from contextweaver.adapters.mcp_gateway import tool_view
+from contextweaver.adapters.mcp_gateway import dispatch_meta_tool
 
 # The model sends back the artifact handle from the summary.
-full_log = tool_view(mgr, handle="artifact:job:train-model:42")
+# Handles produced by ingest_tool_result_sync follow the form:
+#   artifact:result:{tool_call_id}
+handle = "artifact:result:job:train-model:42"
+result = dispatch_meta_tool(
+    name="tool_view",
+    runtime=proxy_runtime,   # your ProxyRuntime instance
+    arguments={"handle": handle, "selector": None},
+)
 ```
 
 ```python
 # Standalone path
-handle = "artifact:job:train-model:42"
+# Handles produced by ingest_tool_result_sync follow the form:
+#   artifact:result:{tool_call_id}
+handle = "artifact:result:job:train-model:42"
 assert mgr.artifact_store.exists(handle)
 full_bytes = mgr.artifact_store.get(handle)
 ```
@@ -186,11 +207,15 @@ Apply a `SensitivityClassifier` or `RedactionHook` if the *summary* itself must
 also be scrubbed before reaching the LLM:
 
 ```python
-from contextweaver.context.sensitivity import MaskRedactionHook
-from contextweaver.config import ContextPolicy
+from contextweaver import ContextManager
+from contextweaver.types import Sensitivity
+from contextweaver.config import ContextBudget, ContextPolicy
+
+budget = ContextBudget(route=500, call=800, interpret=600, answer=1200)
 
 policy = ContextPolicy(
     sensitivity_floor=Sensitivity.internal,
+    sensitivity_action="redact",
     redaction_hooks=["mask"],
 )
 mgr = ContextManager(budget=budget, policy=policy)

From cb66ee53bcbd8d4e5e6fe18e603100afe295a3cc Mon Sep 17 00:00:00 2001
From: Diogo Andre Santos <diogo.ansantos@nos.pt>
Date: Mon, 15 Jun 2026 17:28:53 +0100
Subject: [PATCH 3/3] docs: add missing imports and fix API call in
 Puppetmaster guide

Address review feedback on docs/integration_puppetmaster.md:
- Add missing import json and StructuredFirewall import
- Fix firewall threshold wording (bytes -> characters)
- Correct dispatch_meta_tool signature (args, not arguments; await; reorder params)
---
 docs/integration_puppetmaster.md | 15 ++++++++-------
 1 file changed, 8 insertions(+), 7 deletions(-)

diff --git a/docs/integration_puppetmaster.md b/docs/integration_puppetmaster.md
index e96dc1f..cb47945 100644
--- a/docs/integration_puppetmaster.md
+++ b/docs/integration_puppetmaster.md
@@ -73,9 +73,10 @@ A Puppetmaster job completion is just another tool result. Ingest it via the
 firewall so large payloads are stored out-of-band automatically:
 
 ```python
-from contextweaver import ContextManager
+from contextweaver import ContextManager, StructuredFirewall
 from contextweaver.types import Sensitivity
 from contextweaver.config import ContextBudget, ContextPolicy
+import json
 
 budget = ContextBudget(route=500, call=800, interpret=600, answer=1200)
 
@@ -109,7 +110,7 @@ mgr.ingest_tool_result_sync(
 ```
 
 The firewall intercepts the payload if it exceeds the configured threshold (default
-2 000 bytes). When `StructuredFirewall(keep=[...])` is used, the LLM sees only the
+2 000 characters). When `StructuredFirewall(keep=[...])` is used, the LLM sees only the
 projected field(s) inline; the full payload remains in the artifact store under a
 typed `ArtifactRef` handle.
 
@@ -146,17 +147,17 @@ handle. In a gateway setup this is `tool_view`; in a standalone script you read
 directly from the store:
 
 ```python
-# Gateway path (MCP proxy / gateway)
+# Gateway path (MCP proxy / gateway) — inside an async function
 from contextweaver.adapters.mcp_gateway import dispatch_meta_tool
 
 # The model sends back the artifact handle from the summary.
 # Handles produced by ingest_tool_result_sync follow the form:
 #   artifact:result:{tool_call_id}
 handle = "artifact:result:job:train-model:42"
-result = dispatch_meta_tool(
-    name="tool_view",
-    runtime=proxy_runtime,   # your ProxyRuntime instance
-    arguments={"handle": handle, "selector": None},
+result = await dispatch_meta_tool(
+    proxy_runtime,            # your ProxyRuntime instance
+    "tool_view",
+    {"handle": handle, "selector": {}},
 )
 ```