Skip to content

Commit 49f20ec

Browse files
coda house cleaning
1 parent 7f6eb0f commit 49f20ec

26 files changed

Lines changed: 3075 additions & 19 deletions
129 Bytes
Binary file not shown.

drafts/codawork-2026/PRESENTATION_SCRIPT.md

Lines changed: 29 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,8 @@
11
# CoDaWork 2026 — Presentation Script (10–12 minutes)
22

3-
**Source:** Copilot collective review, April 4, 2026.
3+
**Source:** Copilot collective review, April 4, 2026. Updated April 5, 2026 (CR results, Backblaze cross-domain, 25-error count, CGS appendix slide).
44
**Classification:** Rough diamond — conference preparation material.
5+
**Alignment check:** All three Conference Core Stack sentences present (calibration study, three diagnostics, 25-error catalogue).
56

67
---
78

@@ -35,7 +36,7 @@ Every property that defines compositional data — the closure constraint, the r
3536

3637
## Slide 4 — The Trigger
3738

38-
"This began with a simple question: What source of error does CoDa bring into a monitoring instrument, and what breaks at the joint? That question triggered a full audit of transforms, metrics, stored energy, dimensionality, and governance. The result was a 17-error catalogue."
39+
"This began with a simple question: What source of error does CoDa bring into a monitoring instrument, and what breaks at the joint? That question triggered a full audit of transforms, metrics, stored energy, dimensionality, and governance. The result was a 25-error calibration catalogue — 17 active error sources, 2 future extensions, and 6 scaling concerns — each with a detection test and a governance action."
3940

4041
---
4142

@@ -51,9 +52,9 @@ Every property that defines compositional data — the closure constraint, the r
5152

5253
---
5354

54-
## Slide 7 — The 17 Error Sources
55+
## Slide 7 — The 25 Error Sources
5556

56-
"The union inherits 12 initial errors, 4 from literature cross-reference, and 1 new concern discovered during review. Each error has a detection test and a governance action. This is the first failure-mode catalogue for a CoDa monitoring instrument."
57+
"The union inherits 25 documented error sources: 17 active entries covering transforms, metrics, stored energy, dimensionality, zeros, and governance; 2 future extensions for signed and complex compositions; and 6 scaling concerns for deep hierarchies. Each has a detection test and a governance action. This is the first failure-mode catalogue for a CoDa monitoring instrument."
5758

5859
---
5960

@@ -69,33 +70,49 @@ Every property that defines compositional data — the closure constraint, the r
6970

7071
---
7172

72-
## Slide 10 — Dimensionality-Aware Time Series
73+
## Slide 10 — The Third Diagnostic: Coherence Residual
74+
75+
"TV and Aitchison measure magnitude of change. But they miss something: cross-branch coupling. When fossil fuel declines, does it couple to renewable growth? Or do they move independently? The coherence residual measures this — it decomposes the composition into a sequential binary partition and measures how much change in one branch predicts change in another.
76+
77+
We computed it on all 74 year-to-year transitions in EMBER. The result: strict subcompositional coherence does not hold. Mean CR is approximately 0.58. More importantly, 31 percent of transitions show what we call invisible structural change — the TV and Aitchison distances say nothing happened, but the coherence residual detects cross-branch reorganisation underneath.
78+
79+
This is a CoDa finding, not just a HUF finding. It provides the first quantitative evidence on the Egozcue–Greenacre question: quasi-coherence is the empirical reality."
80+
81+
---
82+
83+
## Slide 11 — Cross-Domain Validation (Backblaze)
84+
85+
"One domain is an anecdote. Two domains is a pattern. We ran the same three diagnostics on Backblaze hard drive reliability data — 900,000 drives, four manufacturers, SMART health indicators as carriers on the simplex. Different physics entirely. The same pattern families emerged: COUPLING_SIGNAL, STRUCTURAL, COUPLED_EVENT. Same mathematics, different substrate. This is the beginning of domain independence."
86+
87+
---
88+
89+
## Slide 12 — Dimensionality-Aware Time Series
7390

7491
"CoDa transforms assume fixed dimensionality. Monitoring does not. E-11 documents the consequences of dimensionality change. The analyzer detects this and records it in the audit trail."
7592

7693
---
7794

78-
## Slide 11 — The Analyzer (Demonstrator)
95+
## Slide 13 — The Analyzer (Demonstrator)
7996

8097
"This is the CoDa Calibration Demonstrator. It is open loop, stateless, and observational. It produces a full audit trail of events: drift events, metric disagreements, structural shocks, zero events, dimensionality changes. This is the value HUF brings to CoDa."
8198

8299
---
83100

84-
## Slide 12 — The Union Thesis
101+
## Slide 14 — The Union Thesis
85102

86103
"CoDa perfected the geometry of compositions at rest. HUF discovered what happens when compositions move. The union is the instrument that watches them move and signals when to care."
87104

88105
---
89106

90-
## Slide 13 — Deployment Path
107+
## Slide 15 — Deployment Path
91108

92109
"The path is simple: Coimbra, then validation, then Ramsar pilot, then standardization. We begin with calibration, not claims."
93110

94111
---
95112

96-
## Slide 14 — Closing
113+
## Slide 16 — Closing
97114

98-
"This is not a new framework. This is a calibration study. The union is the instrument. The audit trail is the value. Thank you."
115+
"This is a calibration study of a compositional monitoring instrument, not a finished framework. The instrument uses three diagnostics: total variation distance, Aitchison distance, and the coherence residual. A 25-error calibration catalogue with detection tests and governance actions makes the instrument falsifiable. The union is the instrument. The audit trail is the value. Thank you."
99116

100117
---
101118

@@ -109,6 +126,6 @@ Every property that defines compositional data — the closure constraint, the r
109126

110127
"CoDa has mastered the geometry of compositions at rest. But the world we measure — ecosystems, energy systems, supply chains — is not at rest. It moves. And when compositions move, we need a monitoring instrument that can detect events, track drift, identify shocks, and maintain coherence over time.
111128

112-
HUF was built for that purpose. It is a multichannel coherent detector — stateless, open loop, and designed to produce an audit trail of events. When we combine HUF with CoDa, we get something neither can produce alone: a calibrated, dual-metric, dimensionality-aware monitoring instrument.
129+
This instrument uses three diagnostics: total variation distance, Aitchison distance, and the coherence residual. We computed the coherence residual on EMBER — strict subcompositional coherence does not hold. Mean CR is 0.58. Thirty-one percent of transitions show invisible structural change. We ran the same diagnostics on Backblaze hard drive data — different physics, same pattern families. Domain independence.
113130

114-
The April entanglement session produced a 17-error catalogue the first failure-mode study of a CoDa monitoring instrument. This is not a new framework. This is a calibration study. The union is the instrument. The audit trail is the value."
131+
A 25-error calibration catalogue makes the instrument falsifiable. This is not a new framework. This is a calibration study. The union is the instrument. The audit trail is the value."

drafts/codawork-2026/THE_UNION.md

Lines changed: 10 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88

99
## The Problem Both Communities Have
1010

11-
*For CoDaWork 2026, this union is being presented as a calibration study with 17 active entanglement errors, not as a finished instrument. The dual-metric protocol (TV distance + Aitchison distance) is exploratory; the metric question is not yet settled.*
11+
*For CoDaWork 2026, this union is being presented as a calibration study of a compositional monitoring instrument, not a finished framework. The instrument uses three diagnostics: total variation distance, Aitchison distance, and the coherence residual. A 25-error calibration catalogue (17 active, 2 future extensions, 6 scaling concerns) with detection tests and governance actions makes the instrument falsifiable.*
1212

1313
CoDa has spent 40 years perfecting the mathematics of compositional data — Aitchison geometry, log-ratio transformations, subcompositional coherence, robust estimation, outlier detection. The tools are precise, proven, and elegant. They work beautifully on collected datasets.
1414

@@ -70,7 +70,11 @@ CoDa control charts monitor single processes — one production line, one chemic
7070

7171
### 3. A calibrated entanglement with documented error sources
7272

73-
The entanglement error analysis (companion document) identifies 17 error sources — 12 mapped across HUF-alone, CoDa-alone, and the union, plus 5 discovered during literature cross-reference and collective review. Every error source has a detection test and a governance action. No existing CoDa tool or HUF tool has a published error catalogue of this kind. The union is the first compositional monitoring instrument that ships with its own calibration study.
73+
The entanglement error analysis (companion document) identifies 25 error sources — 17 active entries (E-01 through E-17) mapped across HUF-alone, CoDa-alone, and the union; 2 future extensions (E-18 signed compositions, E-19 complex CoDa); and 6 scaling concerns (ES-01 through ES-06) for deep hierarchies. Every error source has a detection test and a governance action. No existing CoDa tool or HUF tool has a published error catalogue of this kind. The union is the first compositional monitoring instrument that ships with its own calibration study.
74+
75+
### 3b. The third diagnostic: coherence residual
76+
77+
The coherence residual (CR) measures cross-branch coupling in sequential binary partition decompositions. Where TV and Aitchison measure magnitude of change, CR measures whether change in one branch of the SBP predicts change in another. Computed on EMBER (74 transitions, 3 countries, 25 years): strict subcompositional coherence does not hold (mean CR ≈ 0.58). 31% of transitions exhibit invisible structural change — cross-branch reorganisation that TV and Aitchison classify as stable. Cross-domain validation on Backblaze hardware data (900,000+ drives, 4 manufacturers) produces the same pattern families under different physics. The instrument now uses three diagnostics: total variation distance, Aitchison distance, and the coherence residual.
7478

7579
### 5. A calibrated zero-handling protocol (Greenacre complementarity)
7680

@@ -109,8 +113,8 @@ CoDa cannot compare compositions of different dimension. Real systems change dim
109113
│ METRIC LAYER (CoDa + HUF) │
110114
│ │
111115
│ Aitchison distance ─┐ │
112-
│ TV distance ├─ dual-metric diagnostic
113-
Metric disagreement ─┘ │
116+
│ TV distance ├─ three-diagnostic protocol
117+
Coherence residual ─┘ │
114118
│ │
115119
│ Perturbation velocity v(t) = d_A(x(t), x(t−1)) │
116120
│ K_eff = exp(−Σ xᵢ ln xᵢ) (effective carriers) │
@@ -158,10 +162,10 @@ CoDa cannot compare compositions of different dimension. Real systems change dim
158162
## The Path to Deployment
159163

160164
### Phase 1 — Coimbra (June 2026)
161-
Present the union thesis. Show the 17-error calibration study. Demonstrate v3 analyzer with dual metrics on EMBER data. Identify CoDa researchers interested in continuous monitoring. Identify MEWMA-CoDa researchers (Tran, Nguyen group) for direct collaboration.
165+
Present the union thesis. Show the 25-error calibration study. Demonstrate v3 analyzer with three diagnostics on EMBER data. Present coherence residual results and Backblaze cross-domain validation. Identify CoDa researchers interested in continuous monitoring. Identify MEWMA-CoDa researchers (Tran, Nguyen group) for direct collaboration.
162166

163167
### Phase 2 — Validation (2026–2027)
164-
Joint paper with CoDa co-author(s): formal comparison of MEWMA-CoDa and HUF on the same dataset (EMBER Germany, known ground truth). Quantify: does governance reference management improve detection over statistical reference estimation? Does dual-metric diagnostic catch events that single-metric charts miss? Does coherence chain diagnosis outperform neural network diagnosis in interpretability?
168+
Joint paper with CoDa co-author(s): formal comparison of MEWMA-CoDa and HUF on the same dataset (EMBER Germany, known ground truth). Quantify: does governance reference management improve detection over statistical reference estimation? Does the three-diagnostic protocol catch events that single-metric charts miss? Does coherence chain diagnosis outperform neural network diagnosis in interpretability?
165169

166170
### Phase 3 — Ramsar Pilot (2027–2028)
167171
Deploy the union instrument on Ramsar Convention wetland data. 172 countries, 2,500 wetlands, compositional species data already being collected. CoDa provides the transform layer. HUF provides the governance layer. MEWMA-CoDa provides the statistical power. The coherence chain monitors species composition within wetlands AND structural change across the network.

0 commit comments

Comments
 (0)