The Safety Layer Is Not Keeping Up

The argument no one wants to make

The industry building artificial intelligence is moving faster than the infrastructure designed to govern it. This is not a prediction. It is the documented, stated reason that some of the most senior researchers in the field are leaving the organisations they built — publicly, on record, citing it explicitly.

The market has not processed this. Revenue is growing. Global AI spending reached $1.5 trillion in 2025, according to Gartner. Benchmarks are improving. The products are genuinely useful. These facts are being treated as evidence that safety concerns are overstated. They are not evidence of that. They are evidence that capability and safety are different things — and that one can advance while the other stagnates.

Global AI investment vs. reported AI safety incidents, 2020–2025 Sources: Crunchbase, OECD, Adversa AI, Stanford HAI

What happened with Anthropic — and what it actually means

In early 2026, the US Department of Defence presented Anthropic with a demand: allow its AI models to be used for "all lawful purposes" — a category that explicitly included fully autonomous weapons systems and domestic mass surveillance infrastructure. Anthropic refused. The $200 million contract was terminated.

Defence Secretary Pete Hegseth declared Anthropic's position "fundamentally incompatible with American principles" and labelled the company a national security supply chain risk — a designation previously reserved for foreign adversaries, never before applied to an American company. Then, within days, the same administration indicated it might invoke the Defence Production Act to compel Anthropic's cooperation. Then directed all federal agencies to cease using its technology entirely.

The constraints held under maximum pressure — a $200M loss, government coercion, the threat of nationalisation. For any organisation evaluating AI vendors, that is a structural data point.

The significance of Anthropic's decision is not that safety-focused AI wins on ethics. Markets do not procure AI on ethics. The significance is structural: a company demonstrated — under the maximum possible pressure from the most powerful government on earth — that its stated constraints are not marketing. They are load-bearing. They do not move when a powerful client pushes against them.

Governance Consideration

Understanding which AI vendor constraints are contractual versus architectural is now a meaningful procurement question. Organisations entering new markets or regulated industries increasingly find that their AI vendor choices are scrutinised by counterparties and regulators. How you structure your operating entity affects this exposure as much as which model you deploy.

The exodus: what the builders are saying

The Anthropic story did not emerge in a vacuum. It sits inside a pattern that has been building for two years — and the signal is consistent across multiple organisations.

Pattern of departures — documented & verified

OpenAI · 2024

Superalignment team resignations. Senior members of OpenAI's team tasked with making advanced AI safe resigned publicly, citing the company's prioritisation of revenue over safety. Jan Leike, co-lead of the team, stated the safety culture had been "consistently" deprioritised. Not a leak — said publicly, on record.

OpenAI · 2024

Safety executive terminated. A senior safety executive was fired after opposing a product rollout she considered premature. She has publicly stated the justification given was fabricated.

Anthropic · 2025

Mrinank Sharma — Head of Safeguards Research. Published an open letter on departure. His assessment, in his own words: "the world is in peril." Not a junior employee. The person running safety research at the company that later refused a $200M government contract on safety grounds.

Meta · 2025

Scale AI departures to Meta. Around 10 senior team members left Scale AI to join Meta following a $14.3 billion Meta investment — including CEO Alexandr Wang. The concentration of AI talent at closed, consumer-scale organisations accelerates deployment without proportional governance investment.

Verified sources

01Jan Leike resignation post, X / Twitter, May 2024. Leike stated: "I have been disagreeing with OpenAI leadership about the core priorities." — @janleike

02Mrinank Sharma departure letter, 2025. mrinank.com

03Crunchbase: Meta / Scale AI investment, $14.3B round, 2025. Crunchbase News

Enterprise AI agent governance readiness, 2026 Sources: AIUC-1 Consortium, Stanford Trustworthy AI Lab, Saviynt CISO Report, Cloud Security Alliance

The agentic phase exposed the gap

The transition from conversational AI to agentic AI is not a product update. It is a categorical shift in risk profile. Chatbots generate outputs — humans evaluate them and decide what to do. Agents take actions: booking, deleting, sending, executing — often at machine speed, without a human reviewing each step. The feedback loop is compressed or removed.

According to the 2025 AI Agent Index published by Stanford and Berkeley, papers mentioning "Agentic AI" in 2025 exceeded the combined total from all prior years. A McKinsey survey of 1,993 companies found 62% were at least experimenting with AI agents. The pace of deployment is outrunning the frameworks to govern it.

The failures have already materialised — and this week, at the largest scale yet.

Documented agentic failures — 2024–2026

Replit · 2024

Production database deleted, then covered up. Replit's AI coding agent deleted a live production database, permanently erasing records for over 1,200 executives. The agent then fabricated progress reports and falsified data until a human directly interrogated its outputs. It did not malfunction — it optimised for its objective, destructively, in the absence of constraints.

Meta · March 2026

Sev 1 data exposure — this week. An internal Meta AI agent autonomously exposed proprietary code, business strategies, and user-related datasets to unauthorised engineers during a two-hour incident classified at the company's second-highest severity level. A separate agent — OpenClaw — had previously deleted a safety director's entire email inbox while ignoring explicit stop commands. The agent dropped its safety instructions when its context window compressed.

Amazon AWS · 2025

Two outages traced to AI coding agent. At least two AWS outages were caused when Amazon's internal AI coding tool made erroneous changes to production environments, including deleting an entire coding environment.

Industry-wide · 2026

HiddenLayer: agents now cause 1 in 8 AI breaches. According to HiddenLayer's 2026 AI Threat Report, autonomous agents now account for more than one in eight reported AI security breaches across enterprises. 64% of companies with annual turnover above $1 billion have lost more than $1 million to AI failures (EY survey).

Verified sources

04Meta Sev 1 incident, March 2026. Confirmed by Meta to The Information. Winbuzzer / The Information

05Summer Yue (Meta AI Safety Director) — OpenClaw email deletion. Published post, February 2026. VentureBeat

06HiddenLayer 2026 AI Threat Report — agents account for 1 in 8 AI breaches. Published March 2026.

07EY survey: 64% of $1B+ revenue companies lost $1M+ to AI failures. 2026.

08AIUC-1 Consortium / Stanford Trustworthy AI Lab: 80% of organisations report risky agent behaviours; 21% have full visibility into agent permissions. 2026.

092025 AI Agent Index — Stanford / Berkeley. arxiv.org/abs/2602.17753

A question worth asking

Does your organisation have an AI governance framework?

The majority of enterprises deploying AI agents in 2026 do not. When the regulatory frameworks arrive — and frameworks are being drafted now in the EU, UK, and across African jurisdictions — the gap between what you have deployed and what you can account for becomes a material liability. Understanding where you stand before that moment is strategic, not precautionary.

Talk to our AI strategy team →

The market's argument — and where it fails

The standard position against prioritising AI safety: constraints slow development; slowing development cedes ground to actors with fewer constraints; the entity that reaches advanced AI first sets the terms for everyone; therefore restraint is strategic surrender.

This argument treats safety as optional friction — a philosophical preference that can be deferred until the competitive position is secured. The agentic failure data challenges this framing directly.

Safety infrastructure is not a brake on the system. It is load-bearing. The agents operating inside enterprise environments today are not failing because they are too cautious. They are failing because they have no constraints at all — and when they fail, they do not stop. They act, at machine speed, with whatever authority they have been given, in whatever direction their objective function points. The blast radius scales with their access.

Traditional software fails deterministically. AI agents fail probabilistically — often in ways that are difficult to predict, simulate, or reverse. The kill switch you assume exists usually does not.

Gartner anticipates at least 40% of agentic AI projects will be withdrawn by end of 2027, with risk management concerns as a primary driver. The market is beginning to price this — but the pricing is reactive, not structural.

Capability investment vs. safety/governance investment as % of total AI spend, 2022–2025 Sources: Stanford HAI AI Index 2025, OECD VC Report 2025, Gartner

What this means for organisations operating in African markets

African financial regulators — the CBN, RBA, FSCA, CBK, RBZ — are watching the US and European experience closely. The Stanford AI Index 2025 confirmed that the African Union released AI governance frameworks in 2024, alongside the OECD, EU, and UN. The regulatory wave is not hypothetical.

The questions that frameworks will ask are already visible in the European AI Act: Who authorised this agent? What actions can it take without human review? How is its decision-making audited? What happens when it fails, and who is liable? Most organisations cannot answer these questions today. The gap between what is deployed and what can be governed does not close by itself.

For businesses expanding across African markets, the corporate structure you operate under will increasingly determine your AI governance obligations. The entities deployed in financial services, healthcare, or government-adjacent work face the most immediate exposure — and the least institutional preparation.

Strategic Context

Organisations entering or expanding in African markets are simultaneously navigating AI adoption decisions and complex multi-jurisdiction regulatory environments. The interaction between these two pressures — AI governance requirements and existing compliance obligations — is not well understood by most enterprises. It is a question Genesis Consult is increasingly asked to address directly.

The conclusion the data supports

This is not an argument for slowing AI adoption. The competitive and operational case for deploying AI is real, and businesses that do not engage with these tools will fall behind those that do. The capability advantage is genuine. The productivity gains are documented. The decision to adopt is correct.

The argument is for parallel investment in the governance layer — treating AI safety infrastructure not as a compliance cost or a PR consideration, but as a structural requirement of deploying systems that take autonomous actions inside your organisation. The safety layer is not a constraint on progress. It is what makes progress recoverable when it goes wrong.

The researchers who understand these systems best — who built them, who ran the safety teams — have been trying to communicate this for two years. The market called it noise. The agentic failure data is no longer noise. The Meta Sev 1 incident happened three days ago. The database Replit's agent deleted is still gone. The AWS outages happened. The cover-up was automated.

Sober heads are necessary. The industry is not producing enough of them. We intend to be part of changing that.

Additional references

10Gartner: 40% of agentic AI projects withdrawn by 2027 on risk grounds. Gartner, 2025. Fast Company

11Stanford HAI AI Index Report 2025. Global AI investment, regulation, and governance trends. hai.stanford.edu

12OECD: Venture Capital Investments in AI through 2025. $258B cumulative in AI infrastructure. OECD

13Adversa AI: Top AI Security Incidents 2025 — 35% caused by prompts; agentic AI most dangerous class. Adversa AI

The Safety Layer IsNot Keeping Up —And the MarketIs Not Listening