The Laundering · Thematic Edition · The statistics of identity

The Invariant

De-identification tries to break it. Disaggregation reveals it. Linkage restores it. The StatCan thread, read as one argument about what survives a transformation.

A sister publication, The Invariant, runs on a single sentence: identity emerges from invariance across allowable transformations — a thing is what stays the same when you turn it, stretch it, or rewrite it. This Edition borrows that lens and turns it on our own StatCan thread. Read together, Case 10, 21, and 2226 are not seven separate stories. They are one argument about a single object: a statistical identity is whatever survives the transformations a disclosure-control regime applies to a record. Everything the regime calls protection is an operation on that invariant — an attempt to break it, a reveal of it, or a quiet restoration of it.

§01 — The invariant, defined twice

In the mathematics, an invariant is what is left after you apply the allowable transformations: rotate the letter X and it is still X; the orientation was disposable, the identity was not. The catch is the word allowable — the set of permitted transformations is chosen first, and the invariants merely follow from the choice.

A disclosure-control regime makes exactly that choice. Rounding to a base, suppressing small cells, adding noise, aggregating to a coarse geography — these are the "allowable transformations" applied to your record so the output may be released. Your statistical identity is then defined the same way the mathematician defines it: it is whatever still resolves to you after those operations. The regime picks the transformations and calls the survivor "de-identified data." But the survivor is the invariant, and the invariant is the point.

De-identification is not deletion. It is a chosen transformation — and identity is exactly what a chosen transformation leaves behind.

§02 — Break it

The first operation is the attempt to destroy the invariant. This is the whole promise of disclosure control, and Case 26 · The Resolution Layer states its real status precisely: de-identification is a probabilistic state, not a permanent property. The transformations attenuate the invariant; they do not annihilate it. The agency's own concession — that there is "no strict mathematical guarantee… all information being combined" — is, in this vocabulary, the admission that the invariant survives the transformation at some residual resolution.

Case 10 · measurement laundering is the disclosure-control side of the same coin: the apparatus that performs the breaking is also the apparatus that certifies the breaking succeeded. The transformation and the stamp that says "identity destroyed" are issued by the same hand. The reconstruction figures the case carries — high match rates against rounded tables — are simply the invariant, measured.

§03 — Reveal it

The opposite operation makes the invariant sharper. Case 26's §07 names the bind: a category made visible for a group becomes resolution about its members. Disaggregation — finer ethnicity, gender, geography fields — is, in the Erlangen vocabulary the sister site draws, shrinking the group of allowable transformations. Fewer permitted blurrings means more features survive, which means a higher-resolution invariant. The same move that lets a hidden harm finally be counted also lets the counted be more exactly located.

This is the discipline of Case 26, held without a clean answer: the reveal is real and necessary (the harm was invisible), and it is the very thing that raises the resolution. There is no setting of the dial that grants the visibility without the legibility. Two true sentences that do not cancel.

§04 — Restore it

The third operation is the quiet one. Case 26's central device is two streams: the agency adds probabilistic resolution to an environment that is already linkable. Someone outside — a voter roll, a data broker, a breach — already fixes who you are. The published table does not hand them your name; it shifts their posterior on what is true of you. In the invariance vocabulary, a single transformed view may have lost the invariant, but the invariant is reconstructed by combining that view with an external fixed point. Re-identification is identity emerging from invariance — stated as a risk.

This is why "the table does not name you" and "the table can resolve you" are both true. The first is about a single source in isolation (where reconstruction mostly fails). The second is about the invariant recovered across sources — the join, not the cell. The narrow, survivable claim Case 26 turns on is exactly this distinction.

§05 — Fix it

Before any of that, something has to decide which invariants the state will hold fixed for the whole population. That is the census, and it is three cases. Case 22 · The Final Notice is the legal compulsion to supply the record at all. Case 23 · The Ratchet is legibility as a template — James C. Scott's point that the state must first make a population legible by choosing the categories it will see it through. Choosing the categories is choosing the population's invariants: the fields that will be treated as the fixed truth of who people are.

And Case 24 · The Aggregate is what happens once an invariant is enshrined: when lived experience contradicts the official number, the number is "the data" and the life is "anecdote." The measured invariant overrides the thing it was meant to measure. The same gig worker is 0.3%, 8.2%, or 30% depending on the instrument — three numbers, one reality — yet whichever number is published becomes the fixed point everyone else must argue against.

§06 — Fix it by fiat

The sharpest version of fixing an invariant is to declare one. Case 21 · The Definition is the definitional dodge at policy scale: adopt a working definition, and "what counts" stops being contestable and becomes the fixed reference. Case 25 · The Same Facts is that reference wired to a database — an unstated definition of "a hate incident" that will fill a data-sharing system, so that government, schools, and police all "work from the same facts." A definition imposed by fiat is an invariant imposed by fiat: it survives every later transformation because it was placed, by rule, beyond them.

OperationWhat it does to the invariantCase
Breakattenuate it — never fully destroy it10, 26
Revealraise its resolution (disaggregation)26
Restorereconstruct it by joining to an outside key26
Fixenshrine which features count as the truth22, 23, 24
Fix by fiatdeclare the invariant beyond contest21, 25

§07 — Where this sits

The invariance frame is descriptively neutral — the sister site's political reading makes the same warning we make: naming the dynamic is not a verdict on anyone who uses it. Disaggregation can be the instrument of recognition and of resolution; a definition can protect and pre-decide. What the lens adds is only this: it shows the StatCan thread is, underneath its separate subjects, a single mechanism seen from six angles — operations on the one thing every record carries and no transformation cleanly removes.

And it returns us to the series' running guardrail in a new key. "The data is real" holds for the census head-count; it does not hold for survey estimates, nor for reported-crime trends. In this vocabulary: the head-count is a high-confidence invariant; the estimate is a low-confidence one wearing the authority of a fixed point. The Laundering's whole discipline is refusing to let a chosen invariant be narrated as a discovered one.

A statistical identity is a chosen invariant, narrated as a property of nature. The regime picks the transformations — and calls what survives "anonymous."

§ Circulate · Eight ways to file this

Identity is what survives the transformation.

Pick a hook below. Each one is a different door into the same argument.

▸ Reference asset · The Laundering · Thematic Edition A cross-framework reading: the StatCan thread of The Laundering read through the invariance lens of the sister site, The Invariant. Every factual claim about StatCan lives, sourced, in the linked cases; this Edition adds only the connecting argument.