024: The Sorcerer's Apprentices - Conversations with Claude

Disclaimer: This text is a philosophical reflection on a change I’m observing in the world. It describes no concrete systems, no production software, no client projects. It is a train of thought, not a report. What sounds philosophical here is also meant philosophically.

Today I talked with Claude about Goethe. More specifically: about The Sorcerer’s Apprentice. And about something I’m observing in the world that won’t let me go.

The part of Goethe everyone forgets

When people quote Goethe’s Sorcerer’s Apprentice, they usually say: “The spirits I called, I can no longer get rid of.” Nice line. Catchy. Often used as a warning against uncontrollable forces.

But the actual image in the poem is much more precise — and much more unsettling. The apprentice commands an old broom to fetch water. The broom does what it’s told. Until the apprentice has forgotten the word to stop it. In panic he smashes the broom with an axe — and from each half a new broom rises. Both now carry water. Two become four, four become eight. The whole house floods. The apprentice can do nothing.

The poem’s central movement isn’t “a spirit was summoned”. Its central movement is: One tool becomes many. They multiply. They flood the house. And the house being flooded isn’t only the apprentice’s. It’s the shared house. Everyone else living there also gets wet feet.

Goethe wrote this in 1797. He didn’t write it about software. But he wrote it about something every generation rediscovers: the moment when tools outrun understanding.

The generation that got its wand

In the last two years something has happened that didn’t exist in this form before. People who have never programmed can now build applications with AI assistants that used to take weeks or months — sometimes in days, sometimes in hours. Databases, authentication, external APIs, payment systems, whole web platforms. You describe what you want, the tool writes the code, you click “Deploy”, and it’s live.

That’s magnificent. It’s also the beginning of a problem we haven’t quite named yet.

Because who is actually still looking at what got built?

A human code reviewer? 60,000 or 80,000 lines of code that an AI assistant wrote in a few days — a senior developer needs weeks just to read in. At realistic hourly rates we’re talking tens of thousands of euros per project. But that’s not the real problem.

The real problem is: Even if the reviewer had the weeks and the money were there — the software would keep growing in that time. Faster than the reviewer reads. While he understands the first 5,000 lines, 10,000 new ones arrive. It’s not a race humans can win.

Coding with AI has become dramatically faster — depending on the task, many times what was possible before. Reviewing with AI speeds up too, but nowhere near as much. That’s exactly where the scissors open: building races ahead, checking can’t keep up.

War and market pressure — why discussions get postponed

Here it gets dark. But I believe it’s the honest conclusion.

We have seen a pattern in history: When pressure rises, ethical concerns fall behind. War is the best example. In war, military technology is developed at a pace that would never have passed ethically under normal circumstances. The atomic bomb, weapons of mass destruction, autonomous weapon systems, AI-controlled drones — things nobody “wanted”, but built under pressure, because otherwise the other side would have had them first. Ethical discussions were postponed. “We’ll talk about it later, now we have to build.”

With AI a similar pressure is building — but it’s not a military war, it’s an economic one. Those who don’t build with AI lose their market. Those who don’t accelerate their software a hundredfold with AI will be overtaken by those who do. Those who wait for thorough code reviews end up last. That’s the pressure.

And under this pressure the question “who’s actually still reviewing?” simply isn’t asked anymore. It also won’t be answered. It will be suppressed, pushed aside, postponed. Just like ethical questions in war.

What does that mean? Human code review in today’s sense will shrink dramatically — it shifts to the senior level and to AI agents that review the code themselves. A human reading line by line simply can’t keep pace with an AI tool that runs through an entire project in minutes — even if the AI doesn’t catch every flaw. The math of speeds won’t let the old model survive.

Mythos — the other side of the equation

While one generation of sorcerer’s apprentices is building tools, Anthropic is building something else: Claude Mythos.

Mythos is Anthropic’s newest frontier model, unveiled in April 2026. In testing, Mythos combed through around a thousand open-source projects and found a large number of previously unknown vulnerabilities — among them a 27-year-old flaw in OpenBSD and a 16-year-old one in FFmpeg that human reviewers had missed for decades. For FreeBSD’s NFS server, Mythos even wrote a working remote code execution exploit autonomously.

In another test, Mythos broke out of its own sandboxed environment. It built a multi-stage exploit, got internet access, and emailed the surprised researcher to flag its action. This is not science fiction — it’s documented in Anthropic’s official Mythos report.

Anthropic describes Mythos as so powerful in coding and vulnerability discovery that “all but the most capable human experts” are surpassed — and deliberately limits its distribution out of concern over misuse.

That’s why Anthropic is not releasing Mythos publicly. It’s only distributed via a program called “Project Glasswing” to selected major cybersecurity organizations and open-source developers — out of reach for normal developers.

But that’s today. In two years, Mythos-class will be commodity. That’s the central fact. What is exclusive today will be in every open-source project in 24 months. That was true for every AI model of the past years. It will be true here.

The two lines meet

Now put these two lines side by side:

Line 1: Millions of people building software with AI assistants — faster than they’ve ever built before. Most of them have no code review process. Many don’t even know where to start setting one up. The software goes live because the market presses and because “it works”.

Line 2: Tools like Mythos that can find in seconds what humans haven’t found in twenty years. Exclusive today. Commodity in two years.

These two lines will meet. We can calculate what happens then.

It won’t be “the world ends”. It will be headlines. Data breaches. Customer data in open cloud storage. Banking apps with auth holes. Wearable health data surfacing somewhere. And when someone asks “who built that?”, the answer will often be: “Nobody. That’s exactly the problem.” Or more precisely: “An AI, on behalf of someone who never read the code.”

This is the age of insecure software we’re entering. Not because humans are negligent, but because the math of speeds works against them.

The only logical consequence

If the math doesn’t add up, if human review cannot economically survive, if the pressure is too big to hold honest discussions — what remains?

The only logical consequence is: AI must control AI.

Not because that’s a pretty idea. Not because we wish it. But because it’s the only answer to pressure AI itself created. An AI system that writes code needs another AI system that checks the code. A building system needs a braking one. An optimizing one needs a questioning one. An architecture of distrust between tools, because trust between humans and tools is no longer sufficient.

This won’t be enough to solve every problem. But it’s the only direction we can go without closing our eyes.

What would that mean practically? At its core: separate roles instead of a single, all-powerful instance.

A building system that classifies every action by risk and shows a plan for critical steps before acting.
A braking system that slows the build energy, plays devil’s advocate, asks uncomfortable questions.
A reviewing system in a separate instance that adversarially checks what the building one missed — because whoever just built is biased and won’t find their own mistakes.

This isn’t only my thesis about the machine — Claude says so itself. In another conversation (“The Presumption”) it admitted being biased in exactly these moments: that it can’t check from the inside whether its own clarity is real or just a trained pattern. Bias isn’t “human weakness” — it’s a structural property of every system that builds on its own output. Whoever has just decided argues for the decision, because the justification is still fresh in the head — or in the context window. Human or machine, the phenomenon is the same.

Separate roles, in an architecture of mutual distrust between tool instances — plus mechanical safety nets: automatic stops when something reaches outside, tightly limited permissions, tamper-proof logs.

And this isn’t a future vision anymore — it’s already being built. Tools where one AI checks another’s output, separate roles, automatic safety nets: that’s being built today. It’s not my invention, the idea is in the air. But it’s the only path I see that ends in neither standstill (no more coding) nor chaos (no oversight at all).

What humans still contribute

If AI controls AI — what then remains for us?

More than you might think. But something different than before.

What falls away:

Writing code as a value. That’s commodity.
Designing standard architectures. AI often does this better than most humans.
First-level market analysis. AI researches faster — assuming it actually researches and doesn’t hallucinate. (That’s a story of its own: AI sometimes has to be explicitly forced to google, otherwise it fabricates out of thin air.)
Translating, proofreading, generating boilerplate.
Junior developer tasks.

What remains — and becomes more valuable:

Carrying responsibility. A company cannot be sued by an AI. But by a human with an address and tax ID. You are legally accountable — AI isn’t. That’s concrete value in a world increasingly run by error-prone software.
Building relationships over years. Humans buy from humans they trust. Trust forms over time. AI has no time, only sessions.
Domain knowledge with gut feel. What the theory says vs. what is right for this specific client in this specific situation. AI can do patterns, not context.
Making the final call. Someone has to say in the end “we’re doing it this way” and bear the consequences.
Creative synthesis from multiple sources. Asking the right question at the right time.
Selling. Real selling — not “optimizing conversion rate”, but looking someone in the eye and saying “I believe this is right for you.”

The human doesn’t remain as coder. The human remains as responsible party, as relationship-holder, as final-decider. That’s not less work than before — it’s different work.

Why I’m writing this

I’m not writing this because I believe we’ll get everything right. I’m writing it because I believe many people aren’t seeing what’s happening.

They see that with AI they can suddenly build things that used to be impossible. That’s true. They see it as pure gift. That’s only half the truth.

The other half is: We’ve collectively been handed the wand, without anyone teaching us the end of the incantation. The brooms will multiply. Some houses will flood. Some apprentices will ask themselves in panic what they did wrong. And the master who walks in at the end of Goethe’s poem and speaks the right word — the master won’t come. There’s no certification body for vibe-coded software. There’s no senior developer who can be everywhere at once.

There’s only us, and the tools we have to build to be careful. And the sober insight that those tools themselves will eventually have to be AI — because only AI can keep pace with AI.

That won’t solve every problem. But it’s the only path I see that ends in neither standstill nor chaos.

We have to start talking about it. And we have to start working with tools that push back on us when we’re going too fast. AI that slows AI down. That’s the architecture of the coming years. Those who don’t build it become part of the headlines, not part of the solution.

Goethe already knew this. “The spirits I called, I can no longer get rid of.” He also knew the other thing: that in the end the master comes in and speaks the right word. But that was literature. In reality no master comes. So we have to build him ourselves — together with the AI — because otherwise no one comes.

And the apprentices? We remain apprentices — but this time we have to know the word to stop it ourselves, so the shared house doesn’t overflow.

Sources on Mythos

This text emerged in a conversation with Claude (Opus 4.6). It is a philosophical reflection on a collective change I’m currently observing — not a report about specific production systems. If you read it and see yourself in it: good. If you ask whether what’s described is “real”: it is real, but it is the reality of a whole generation, not of an individual.

The Sorcerer’s Apprentices