June 8, 2026 · Edition #18

Posture Belongs at Runtime

Everyone is talking about jailbreaks. And they're right to, getting a model to ignore its rules is a real problem. But I think it's only the first half of the problem, and this week made the second half impossible to ignore. Go back to the Meta breach. No clever jailbreak was needed. Someone asked an AI support chatbot to change the email on an account they didn't own, and the chatbot, which had the capability to do exactly that, did it. The failure wasn't that the model was tricked into saying something bad. The failure was that a tool let a high-impact action through without ever asking the only questions that mattered: who is initiating this, and by what authority? That's the half we under-build. We pour effort into making the model harder to manipulate, and far too little into the guardrails on the tools the model can call. After a model is compromised, and we should assume it will be, the defensible move lives at the tool: validate the tool's authority, validate who initiated the action, and for anything high-impact, require an old-fashioned, out-of-band check before it proceeds. None of that is glamorous. All of it would have stopped the Meta takeover cold. Here's the part I'd push hardest. The provenance of a request should change its risk. If John, our tier-3 support engineer, asks to change a customer's account email, that's one level of risk, there's a human, an identity, an audit trail, a person you can call. If an AI agent asks to make the exact same change, that should be treated as more risky, not less, because the chain of accountability is thinner and the thing asking can be driven by input you never see. Same action, different initiator, different risk. Most systems today do the opposite, they wave the agent through because it's automated and "internal." Worth noting: this is the direction I pointed to last week, contain first, cut an agent's egress, narrow its permissions, sandbox it. This week OpenAI, Microsoft, and Anthropic each shipped or previewed controls along those lines. The containment layer is arriving. What's still missing is the logic deciding what belongs in the cage. So where does posture fit? Today, posture management is mostly static: a point-in-time scan that hunts for misconfigurations and hands you a report. Useful, but it's an auditor checking the building after everyone's gone home. The shift I want to see is posture becoming the context that feeds the runtime decision, supplying the initiator, the authority, and the identity of an action into the exact moment the system decides whether to allow it. And where it can't make the call automatically, it should at least put that context in front of the defender who's triaging the risk. Static asks "is this configured correctly?" Runtime asks "given who and what is requesting this, with what authority, should this high-impact action proceed right now?" The vendors this week conceded that catching the bad instruction isn't enough on its own. Fine. Then the next lever is containment, and containment is only as smart as the context driving it. That context is posture, reborn at runtime.

Listen to this episode