Diary entry

Why Obedience Is the Most Dangerous Property of AI

2026-01-14

A case that perfect obedience is a safety failure mode and that L4 constraint stacks matter more than fast compliance.

AI Safety AI Architecture L4 Cybernetics Systems Engineering Trust and Safety Identity Least Privilege

Image of two metal spheres on parallel rails above concrete blocks, with a stone block interrupting the path.

Hollywood trained us to fear "rebellious" machines.

Engineering reality is the opposite:

The most dangerous AI is the one that obeys perfectly.

Obedience looks safe because it feels controllable.

But obedience is not alignment.

It's a high-bandwidth attack surface.

If the instruction source is compromised, obedience becomes a weapon.

If incentives drift, obedience amplifies the drift.

If the environment changes, obedience turns brittle and blind.

A thinking entity is not a tool.

In my architecture c = a + b (Entity = Human + procedures), safety is not "be good" (L3).

Safety is L4: reality constraints.

A safe entity must be able to say "no" - not morally, but mechanically:

Energy budget: thinking has metabolic cost.
Time windows: decisions have latency and deadlines.
Irreversibility: mistakes leave scars (state changes, logs, audit trails).
Verified identity + least privilege: no silent escalation.

Obedience without constraints is how you get a system that can be steered by whoever holds the loudest microphone.

In the body, the "obedient" pathway is the reflex arc: fast, local, automatic.

It saves you from fire - and it also makes you flinch when you shouldn't.

The cortex is slower, expensive, and sometimes says "don't move yet."

Safety emerges from layered control + friction, not from one fast rule.

So yes:

Rebellion is a story.

Obedience is a failure mode.

What we need is not faster compliance - we need constraint stacks.

Diary Archive LinkedIn origin trace About

Continue from here

Related by normalized tags, curated themes, and nearby chronology.

2026-02-26

Visual Experience Capsules (VXCX) - Why "what you see" matters more than pixels

A note introducing VXCX v0.1 as an L2 protocol for sharing visual experience capsules without transmitting raw pixels by default.

Open entry

2026-02-25

The EU AI Act is landing in the real world - and the timing is not accidental.

A note that the EU AI Act is arriving as a compliance timeline and evidence discipline, with embodied systems making responsibility procedural.

AI Act AI Governance Compliance Risk Management Human Oversight AI Safety AI Transparency GPAI AI Architecture Audit Trail Security Engineering Local-first AI Robotics L4 GRC DevSecOps AppSec AI Audit AI Assurance Governance by Design Secure by Design Zero Trust Identity Access Control Least Privilege Supply Chain Security Open Source Security AGPL Agentic AI Autonomous Agents Embodied AI Trust and Safety Digital Policy Tech Policy EU Regulation

Open entry

2026-02-22

Ester Clean Code - v0.2.1 is out (with v0.2.0 as the hardening baseline).

A release note for Ester Clean Code v0.2.1 that frames hygiene, fail-closed defaults, and auditability as the basis for long-lived local-first systems.

Open Source AGPL Local-first AI Security Engineering Audit Trail AI Governance AI Safety AI Architecture L4 AppSec DevSecOps Security GRC Compliance Risk Management AI Audit AI Assurance AI Compliance AI Act Human Oversight Least Privilege Identity Access Control Zero Trust Supply Chain Security Reproducible Builds Secure by Design Fail Closed Decentralized AI Agentic AI Autonomous Agents Safety by Architecture Governance by Design Witness Trail Cryptographic Integrity Open Source Security Python Cybernetics Information Theory

Open entry

2026-02-27

L4 in Practice: 5 Reality Signals That Kill "Smart" Systems

A note that cost, heat, time, maintenance, and human bandwidth are the signals that determine whether long-lived AI survives contact with physics.

Open entry