3 Traps That Make Compliance Theater Inevitable

AA-006: the Standards War

Mark S. Carroll

Feb 24, 2026

Top of the Series:

AA-000 Outline: 🗺️ The Map Room

Mark S. Carroll ✅

Jan 13

Read full story

Previous:

$2.9 Billion in Losses. 72 Hours to Harden. 3 Moves to Survive

Mark S. Carroll ✅

Feb 17

Read full story

Stop Buying Badges. Buy Evidence. You Don’t Need No Stinkin’ Badges!

One badge, three meanings. The buyer hears ‘safe.’ The vendor hears ‘ship.’ The auditor hears ‘scope.’

*****⚙ THE ARCHITECTS OF AUTONOMY*****

`Research Binder: the receipts (citations + source notes) are compiled in a PDF at the bottom of this article.`

🧭 Cold Open

You are in a windowless conference room that smells like burnt coffee and warm plastic. The projector fan is louder than the people.

Someone has a slide deck titled “AI Governance Readiness.” Someone else has a procurement deadline. Legal has a calendar invite called “EU compliance discussion” that has been rescheduled three times. An engineer is here because they were told this meeting was about shipping.

A vendor rep puts a certification badge on the screen. Big logo. Clean typography. A little halo of credibility.

Procurement exhales. “Great. So you’re certified.”

The rep nods. “Yes, we’re certified.”

Legal asks, “Certified to what?”

The rep smiles like they have practiced this question. “ISO.”

The engineer hears “ISO” and thinks “ISO 27001.” The security lead hears “ISO” and thinks “audit.” Procurement hears “ISO” and thinks “approved.”

Legal asks again, slower this time. “Certified to what scope?”

The rep pauses. “Our management system.”

The room goes quiet.

Procurement is still smiling, but the smile is now contractual. The security lead is doing math. The engineer is doing a different kind of math, the kind where you count the number of places a sentence can hide a problem.

Someone says, “So… this means you comply with the EU AI Act, right?”

The rep does not say yes. The rep does not say no but rather, “It depends.”

The meeting ends with three action items and zero shared understanding. The badge remains on the slide like a sheriff’s star pinned to a costume.

The Standards War is not about standards. It is about who gets to define compliant.

I’ve been in versions of this room before, in real institutions with real consequences. The vocabulary changes. The incentives don’t.

This is the ecosystem. If it feels like a board game, that’s because everyone’s trying to win without reading the rules

🧱 The Mechanism

Standards are supposed to translate governance into buildable structures. In practice, they often become a market language that people use as a substitute for evidence.

Here is how the machine works.

Standards define categories, controls, and documentation expectations.
Certification and auditing turn those expectations into gatekeeping.
Procurement turns gatekeeping into revenue and exclusion.
Marketing turns all of it into a badge.
Teams quietly learn the wrong lesson: “pass the audit” replaces “reduce the risk.”

One example that makes it undeniable

A management system certification (like ISO/IEC 42001) can be real and valuable for governance. It still does not guarantee model-level outcomes. That gap is where compliance theater grows.

Here’s what the gap looks like in practice. Teams run real tests, bias checks, red team scenarios, drift monitoring. Then the compliance tool does not accept any of that as evidence unless it is manually packaged. So people screenshot dashboards, paste results into documents, upload PDFs, and call it “audit artifacts.” The work expands. The risk stays. That is compliance theater with a file attachment.

The incentive in one sentence

Defining compliant is cheaper than proving safety, and easier to sell than building evidence.

These are the lines we can defend in daylight. Everything else goes in the ‘interesting, but please don’t quote me’ drawer

🏛 The Architects

Designers

Standards bodies and framework writers set the vocabulary. Their work is often serious, slow, and careful. The world they serve is fast, chaotic, and funded.

NIST AI RMF brings a risk management lens and emphasizes TEVV and evidence.
ISO/IEC 42001 offers a certifiable management system for AI governance, which is useful for process discipline.
OECD work on conformity assessment explains how trust marks become meaningful only when embedded in quality infrastructure.

Inspectors

Auditors, assessors, and the emerging algorithmic auditing ecosystem try to turn claims into verifiable judgments. Independent oversight fails fast when access is limited and incentives are misaligned.

Enforcers

Procurement, contracts, regulators, and courts decide what hurts.

Procurement has already started demanding AI governance evidence in RFP language, even when the standards discourse in public is still thin.

🧨 The Turn

The Standards War is not a debate about the best framework. The Standards War is the fight over who gets to define the test, who gets to sell the certificate, and who gets left holding the bag when reality shows up.

Think of this as a layer cake where each slice is paperwork that prevents a different kind of regret

Cost to individuals

People get harmed by systems that were “certified” in a way that never touched the risk that hurt them. The badge becomes a shield until discovery asks for receipts.

Cost to institutions

Organizations spend money proving compliance without improving controls. Teams take screenshots of dashboards, upload them into GRC tools, and call it evidence. The work expands. The risk stays.

Cost to the future

Small teams and new entrants get priced out by certification pressure. Large incumbents gain advantage because they can afford the committees, the consultants, and the audit cycles. The market slowly learns that “trustworthy” means “well-resourced.”

`The claims below are conservative and source-backed. Full receipts are in the Research Binder PDF at the end.`

🧾 The Five Safe Claims (your citation-safe backbone)

Assurance cases are converging on system-safety patterns, but outcome evidence is nascent.
Third-party audits need access and independence to avoid becoming symbolic.
Trust marks can anchor assurance or enable theater, depending on scope, testing depth, and surveillance.
Crosswalks are approximate aids, not equivalence proofs.
Red teaming becomes audit-grade evidence only when structured and integrated.

The five claims above are conservative on purpose. Now comes the practical part.

In the Standards War, credibility collapses in three predictable ways. The audit becomes symbolic. The badge becomes theater. The crosswalk becomes a fake proof. The next three visuals are a fast diagnostic for spotting each failure mode before it becomes a procurement decision you have to live with.

Remove one side and the triangle becomes performance art. The good kind is on Broadway, not in risk reviews

If the audit isn’t credible, the badge is just a costume.

Badges are like resumes. The font is free. The references are not

Badge theater keeps winning because pricing is opaque. The easy-to-buy package is usually the easy-to-audit story. Deep assurance is quote-only and scope-dependent, which makes it hard to compare and easy to underfund.

And when the badge is vague, people reach for crosswalks to pretend the vagueness is solved.

This is your anti-handwaving legend. If you can’t label the relationship, you’re not mapping. You’re narrating

Crosswalks are useful when they are relationship-typed and honest about gaps. Crosswalks become dangerous when they’re used as equivalence claims. If you can’t name the relationship type, you don’t have a mapping. You have a sales story.

Download The Method (PDF):

A short field manual for turning standards talk into evidence, including the claims, the decision logic, scripts, and the scorecard.

The Standards War: The Method

3.83MB ∙ PDF file

Download

Free download: The Standards War: The Method A vendor-neutral evidence pack for leaders who want receipts, not badges. Includes: 1. Five citation-ready claims 2. Decision tree for choosing the right mechanism 3. Copy-paste scripts for vendor calls 4. 0–2 scorecard + interpretation guide 5. Full reference list

Download

That brings us to the antidote. You don’t win this war by memorizing standards. You win by building evidence that survives contact with an auditor, a buyer, and a regulator.

`If you want the underlying standards and audit literature behind this diagnostic, it’s in the Research Binder PDF at the end.`

🧰 Autonomy Survival Kit

Three moves you can make this week

Build a one-page Standards Stack Inventory
List every framework, standard, and regulation you reference. Add scope per item: enterprise, product line, or specific workflow. If scope is missing, the standard is currently decorative.
Start an Evidence Index that does not lie
Create an inventory of artifacts with owners, locations, review cadence, and last updated date. Examples include system description, inventory, risk register, TEVV plan, red team logs, monitoring outputs, change logs. Your new spreadsheet tool is designed for exactly this workflow.
1. Download The Tool (Spreadsheet):
  1. A practical working spreadsheet for turning standards talk into receipts, including the crosswalk, the evidence index, control links, and a 0–2 maturity scorecard you can update as the system changes
    The Standards War: The Tool
    337KB ∙ XLSX file
    Download
    Free download: The Standards War: The Tool (Spreadsheet) A ready-to-use evidence system that helps you: 1. Inventory evidence artifacts (owner, location, cadence, status) 2. Map requirements using relationship types instead of handwaving 3. Connect requirements to controls and evidence IDs 4. Score readiness 0–2 and see gaps in a live dashboard 5. Build an audit ready “proof trail” for one workflow at a time
    Download
Relationship-type your crosswalks
Use four labels only: Equivalent, Partial, Related, No Match. Add assumptions. Highlight gaps. Stop implying compliance-by-osmosis. NIST explicitly treats crosswalks as approximate and context dependent.

Two moves for leaders

Make standards claims executable
New rule: no one is allowed to put “compliant with X” in a slide unless it includes scope, control mapping, and evidence pointer.
Upgrade procurement language
Ask for three things in every AI vendor review: scope statement, testing depth summary, and surveillance or re-assessment mechanism. If a badge has no surveillance, treat it as marketing.

One “tonight” exercise (5 minutes, measurable)

Write one sentence for one standard you cite:

“This standard forces us to do X control, producing Y evidence, reviewed every Z.”

If you cannot write that sentence, you are not using a standard. You are using a vibe.

If it’s not logged, it didn’t happen. If it’s not repeatable, it happened once. Auditors are famously unmoved by once

Minimum is how you start. Mature is how you sleep. Choose based on your relationship with 2 a.m. Slack messages

Research Binder for AA-006: the Standards War

315KB ∙ PDF file

Download

All citations, research, and discovery for this article, organized in one place: 1. Five safe claims and the peer-reviewed sources behind each 2. Standards and primary documents (NIST, ISO/IEC, OECD, EU AI Act) 3. EU AI Act: specific articles and recitals relevant to deployers and documentation 4. Notes on what we can safely claim, and what we cannot (yet)

Download

📎 Materials List (starter shelf)

NIST AI RMF 1.0 (SP 1270) and resources
ISO/IEC 42001:2023 overview page
OECD Digital Economy Paper on conformity assessment for AI systems
Hawkins et al. on assurance cases for ML (JSS, 2021)
Raji et al. Outsider Oversight (FAccT, 2022)

Closing stinger

Next case file: The Whistleblower’s Dilemma (AA-007)

$4.7B at Risk: 5 Governance Gaps, 8 Failure Modes, 3 Questions

Mark S. Carroll

Mar 3

Read full story

Every system eventually meets the person who sees too much.

Episode 7 is about the moment governance turns personal. The moment a “risk” becomes a name, a timeline, and a choice. Do you report what you found, and bet your job on the organization doing the right thing? Do you stay quiet, and become part of the system you used to criticize? Do you try the middle path, and learn why middle paths tend to have cliffs.

We’ll map the real trap: most whistleblowing failures aren’t caused by bad people. They’re caused by processes that were never designed to hear bad news, incentives that punish clarity, and a paper trail that vanishes right when it matters most.

If AA-006 was about badges and evidence, AA-007 is about courage and consequences. The receipts still matter. So does the human carrying them.

And if you want the practical counterweight to that dilemma, my upcoming book CollaborateBetter.us is built for exactly this problem: how to create teams and leaders who can hear the truth, act on it, and stay aligned when the pressure hits.

The case ends here.
The building starts now.

Feb 25

Let’s stress-test the Standards War together.

Comment with:

1. One AI vendor

2. Their strongest safety claim (copy/paste is fine)

3. Your role: buyer, builder, or both

I’ll reply with:

• the single artifact that would prove it (or expose it), and

• the exact procurement question to ask.

Bonus: If you want, add “RISK:” and name the failure you fear most (privacy leak, hallucinations, IP, bias, security, compliance).

Empathy Engine

AA-000 Outline: 🗺️ The Map Room

$2.9 Billion in Losses. 72 Hours to Harden. 3 Moves to Survive

$4.7B at Risk: 5 Governance Gaps, 8 Failure Modes, 3 Questions

Discussion about this post

Ready for more?

Empathy Engine

3 Traps That Make Compliance Theater Inevitable

AA-006: the Standards War

AA-000 Outline: 🗺️ The Map Room

$2.9 Billion in Losses. 72 Hours to Harden. 3 Moves to Survive

Stop Buying Badges. Buy Evidence. You Don’t Need No Stinkin’ Badges!

One badge, three meanings. The buyer hears ‘safe.’ The vendor hears ‘ship.’ The auditor hears ‘scope.’

Research Binder: the receipts (citations + source notes) are compiled in a PDF at the bottom of this article.

🧭 Cold Open

🧱 The Mechanism

Here is how the machine works.

One example that makes it undeniable

The incentive in one sentence

🏛 The Architects

Designers

Inspectors

Enforcers

🧨 The Turn

💸 The Cost

Cost to individuals

Cost to institutions

Cost to the future

The claims below are conservative and source-backed. Full receipts are in the Research Binder PDF at the end.

🧾 The Five Safe Claims (your citation-safe backbone)

The five claims above are conservative on purpose. Now comes the practical part.

If the audit isn’t credible, the badge is just a costume.

Badge theater keeps winning because pricing is opaque. The easy-to-buy package is usually the easy-to-audit story. Deep assurance is quote-only and scope-dependent, which makes it hard to compare and easy to underfund.

Crosswalks are useful when they are relationship-typed and honest about gaps. Crosswalks become dangerous when they’re used as equivalence claims. If you can’t name the relationship type, you don’t have a mapping. You have a sales story.

Download The Method (PDF):

A short field manual for turning standards talk into evidence, including the claims, the decision logic, scripts, and the scorecard.

If you want the underlying standards and audit literature behind this diagnostic, it’s in the Research Binder PDF at the end.

🧰 Autonomy Survival Kit

Three moves you can make this week

Two moves for leaders

Closing stinger

$4.7B at Risk: 5 Governance Gaps, 8 Failure Modes, 3 Questions

Discussion about this post

Ready for more?

`Research Binder: the receipts (citations + source notes) are compiled in a PDF at the bottom of this article.`

`The claims below are conservative and source-backed. Full receipts are in the Research Binder PDF at the end.`

`If you want the underlying standards and audit literature behind this diagnostic, it’s in the Research Binder PDF at the end.`