You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**Alignment check:** All three Conference Core Stack sentences present (calibration study, three diagnostics, 25-error catalogue).
5
6
6
7
---
7
8
@@ -35,7 +36,7 @@ Every property that defines compositional data — the closure constraint, the r
35
36
36
37
## Slide 4 — The Trigger
37
38
38
-
"This began with a simple question: What source of error does CoDa bring into a monitoring instrument, and what breaks at the joint? That question triggered a full audit of transforms, metrics, stored energy, dimensionality, and governance. The result was a 17-error catalogue."
39
+
"This began with a simple question: What source of error does CoDa bring into a monitoring instrument, and what breaks at the joint? That question triggered a full audit of transforms, metrics, stored energy, dimensionality, and governance. The result was a 25-error calibration catalogue — 17 active error sources, 2 future extensions, and 6 scaling concerns — each with a detection test and a governance action."
39
40
40
41
---
41
42
@@ -51,9 +52,9 @@ Every property that defines compositional data — the closure constraint, the r
51
52
52
53
---
53
54
54
-
## Slide 7 — The 17 Error Sources
55
+
## Slide 7 — The 25 Error Sources
55
56
56
-
"The union inherits 12 initial errors, 4 from literature cross-reference, and 1 new concern discovered during review. Each error has a detection test and a governance action. This is the first failure-mode catalogue for a CoDa monitoring instrument."
57
+
"The union inherits 25 documented error sources: 17 active entries covering transforms, metrics, stored energy, dimensionality, zeros, and governance; 2 future extensions for signed and complex compositions; and 6 scaling concerns for deep hierarchies. Each has a detection test and a governance action. This is the first failure-mode catalogue for a CoDa monitoring instrument."
57
58
58
59
---
59
60
@@ -69,33 +70,49 @@ Every property that defines compositional data — the closure constraint, the r
69
70
70
71
---
71
72
72
-
## Slide 10 — Dimensionality-Aware Time Series
73
+
## Slide 10 — The Third Diagnostic: Coherence Residual
74
+
75
+
"TV and Aitchison measure magnitude of change. But they miss something: cross-branch coupling. When fossil fuel declines, does it couple to renewable growth? Or do they move independently? The coherence residual measures this — it decomposes the composition into a sequential binary partition and measures how much change in one branch predicts change in another.
76
+
77
+
We computed it on all 74 year-to-year transitions in EMBER. The result: strict subcompositional coherence does not hold. Mean CR is approximately 0.58. More importantly, 31 percent of transitions show what we call invisible structural change — the TV and Aitchison distances say nothing happened, but the coherence residual detects cross-branch reorganisation underneath.
78
+
79
+
This is a CoDa finding, not just a HUF finding. It provides the first quantitative evidence on the Egozcue–Greenacre question: quasi-coherence is the empirical reality."
80
+
81
+
---
82
+
83
+
## Slide 11 — Cross-Domain Validation (Backblaze)
84
+
85
+
"One domain is an anecdote. Two domains is a pattern. We ran the same three diagnostics on Backblaze hard drive reliability data — 900,000 drives, four manufacturers, SMART health indicators as carriers on the simplex. Different physics entirely. The same pattern families emerged: COUPLING_SIGNAL, STRUCTURAL, COUPLED_EVENT. Same mathematics, different substrate. This is the beginning of domain independence."
86
+
87
+
---
88
+
89
+
## Slide 12 — Dimensionality-Aware Time Series
73
90
74
91
"CoDa transforms assume fixed dimensionality. Monitoring does not. E-11 documents the consequences of dimensionality change. The analyzer detects this and records it in the audit trail."
75
92
76
93
---
77
94
78
-
## Slide 11 — The Analyzer (Demonstrator)
95
+
## Slide 13 — The Analyzer (Demonstrator)
79
96
80
97
"This is the CoDa Calibration Demonstrator. It is open loop, stateless, and observational. It produces a full audit trail of events: drift events, metric disagreements, structural shocks, zero events, dimensionality changes. This is the value HUF brings to CoDa."
81
98
82
99
---
83
100
84
-
## Slide 12 — The Union Thesis
101
+
## Slide 14 — The Union Thesis
85
102
86
103
"CoDa perfected the geometry of compositions at rest. HUF discovered what happens when compositions move. The union is the instrument that watches them move and signals when to care."
87
104
88
105
---
89
106
90
-
## Slide 13 — Deployment Path
107
+
## Slide 15 — Deployment Path
91
108
92
109
"The path is simple: Coimbra, then validation, then Ramsar pilot, then standardization. We begin with calibration, not claims."
93
110
94
111
---
95
112
96
-
## Slide 14 — Closing
113
+
## Slide 16 — Closing
97
114
98
-
"This is not a new framework. This is a calibration study. The union is the instrument. The audit trail is the value. Thank you."
115
+
"This is a calibration study of a compositional monitoring instrument, not a finished framework. The instrument uses three diagnostics: total variation distance, Aitchison distance, and the coherence residual. A 25-error calibration catalogue with detection tests and governance actions makes the instrument falsifiable. The union is the instrument. The audit trail is the value. Thank you."
99
116
100
117
---
101
118
@@ -109,6 +126,6 @@ Every property that defines compositional data — the closure constraint, the r
109
126
110
127
"CoDa has mastered the geometry of compositions at rest. But the world we measure — ecosystems, energy systems, supply chains — is not at rest. It moves. And when compositions move, we need a monitoring instrument that can detect events, track drift, identify shocks, and maintain coherence over time.
111
128
112
-
HUF was built for that purpose. It is a multichannel coherent detector — stateless, open loop, and designed to produce an audit trail of events. When we combine HUF with CoDa, we get something neither can produce alone: a calibrated, dual-metric, dimensionality-aware monitoring instrument.
129
+
This instrument uses three diagnostics: total variation distance, Aitchison distance, and the coherence residual. We computed the coherence residual on EMBER — strict subcompositional coherence does not hold. Mean CR is 0.58. Thirty-one percent of transitions show invisible structural change. We ran the same diagnostics on Backblaze hard drive data — different physics, same pattern families. Domain independence.
113
130
114
-
The April entanglement session produced a 17-error catalogue — the first failure-mode study of a CoDa monitoring instrument. This is not a new framework. This is a calibration study. The union is the instrument. The audit trail is the value."
131
+
A 25-error calibration catalogue makes the instrument falsifiable. This is not a new framework. This is a calibration study. The union is the instrument. The audit trail is the value."
Copy file name to clipboardExpand all lines: drafts/codawork-2026/THE_UNION.md
+10-6Lines changed: 10 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@
8
8
9
9
## The Problem Both Communities Have
10
10
11
-
*For CoDaWork 2026, this union is being presented as a calibration study with 17 active entanglement errors, not as a finished instrument. The dual-metric protocol (TV distance + Aitchison distance) is exploratory; the metric question is not yet settled.*
11
+
*For CoDaWork 2026, this union is being presented as a calibration study of a compositional monitoring instrument, not a finished framework. The instrument uses three diagnostics: total variation distance, Aitchison distance, and the coherence residual. A 25-error calibration catalogue (17 active, 2 future extensions, 6 scaling concerns) with detection tests and governance actions makes the instrument falsifiable.*
12
12
13
13
CoDa has spent 40 years perfecting the mathematics of compositional data — Aitchison geometry, log-ratio transformations, subcompositional coherence, robust estimation, outlier detection. The tools are precise, proven, and elegant. They work beautifully on collected datasets.
14
14
@@ -70,7 +70,11 @@ CoDa control charts monitor single processes — one production line, one chemic
70
70
71
71
### 3. A calibrated entanglement with documented error sources
72
72
73
-
The entanglement error analysis (companion document) identifies 17 error sources — 12 mapped across HUF-alone, CoDa-alone, and the union, plus 5 discovered during literature cross-reference and collective review. Every error source has a detection test and a governance action. No existing CoDa tool or HUF tool has a published error catalogue of this kind. The union is the first compositional monitoring instrument that ships with its own calibration study.
73
+
The entanglement error analysis (companion document) identifies 25 error sources — 17 active entries (E-01 through E-17) mapped across HUF-alone, CoDa-alone, and the union; 2 future extensions (E-18 signed compositions, E-19 complex CoDa); and 6 scaling concerns (ES-01 through ES-06) for deep hierarchies. Every error source has a detection test and a governance action. No existing CoDa tool or HUF tool has a published error catalogue of this kind. The union is the first compositional monitoring instrument that ships with its own calibration study.
74
+
75
+
### 3b. The third diagnostic: coherence residual
76
+
77
+
The coherence residual (CR) measures cross-branch coupling in sequential binary partition decompositions. Where TV and Aitchison measure magnitude of change, CR measures whether change in one branch of the SBP predicts change in another. Computed on EMBER (74 transitions, 3 countries, 25 years): strict subcompositional coherence does not hold (mean CR ≈ 0.58). 31% of transitions exhibit invisible structural change — cross-branch reorganisation that TV and Aitchison classify as stable. Cross-domain validation on Backblaze hardware data (900,000+ drives, 4 manufacturers) produces the same pattern families under different physics. The instrument now uses three diagnostics: total variation distance, Aitchison distance, and the coherence residual.
74
78
75
79
### 5. A calibrated zero-handling protocol (Greenacre complementarity)
76
80
@@ -109,8 +113,8 @@ CoDa cannot compare compositions of different dimension. Real systems change dim
@@ -158,10 +162,10 @@ CoDa cannot compare compositions of different dimension. Real systems change dim
158
162
## The Path to Deployment
159
163
160
164
### Phase 1 — Coimbra (June 2026)
161
-
Present the union thesis. Show the 17-error calibration study. Demonstrate v3 analyzer with dual metrics on EMBER data. Identify CoDa researchers interested in continuous monitoring. Identify MEWMA-CoDa researchers (Tran, Nguyen group) for direct collaboration.
165
+
Present the union thesis. Show the 25-error calibration study. Demonstrate v3 analyzer with three diagnostics on EMBER data. Present coherence residual results and Backblaze cross-domain validation. Identify CoDa researchers interested in continuous monitoring. Identify MEWMA-CoDa researchers (Tran, Nguyen group) for direct collaboration.
162
166
163
167
### Phase 2 — Validation (2026–2027)
164
-
Joint paper with CoDa co-author(s): formal comparison of MEWMA-CoDa and HUF on the same dataset (EMBER Germany, known ground truth). Quantify: does governance reference management improve detection over statistical reference estimation? Does dual-metric diagnostic catch events that single-metric charts miss? Does coherence chain diagnosis outperform neural network diagnosis in interpretability?
168
+
Joint paper with CoDa co-author(s): formal comparison of MEWMA-CoDa and HUF on the same dataset (EMBER Germany, known ground truth). Quantify: does governance reference management improve detection over statistical reference estimation? Does the three-diagnostic protocol catch events that single-metric charts miss? Does coherence chain diagnosis outperform neural network diagnosis in interpretability?
165
169
166
170
### Phase 3 — Ramsar Pilot (2027–2028)
167
171
Deploy the union instrument on Ramsar Convention wetland data. 172 countries, 2,500 wetlands, compositional species data already being collected. CoDa provides the transform layer. HUF provides the governance layer. MEWMA-CoDa provides the statistical power. The coherence chain monitors species composition within wetlands AND structural change across the network.
0 commit comments