Anthropic’s Three Judgments and a Hidden Agenda: When AI Writes 80% of Its Code

You’ve probably seen it—that sprawling Anthropic essay everyone’s sharing, titled “When AI Builds Itself.” It’s packed with internal numbers they’ve never made public. But instead of geeking out over how smart AI is, let’s ask the only question that matters: “So what? What should I do about it?”

Here are three judgments the article points to, plus one thing they’re not telling you.

Judgment 1: The price of execution is falling to zero.

Anthropic now merges code where more than 80% was written by Claude. Two years ago, a team showing 10–20% AI-written code was considered cutting-edge. Inside Anthropic, it was single digits even in early 2025—because their engineers were skeptical of AI’s messy output. Fast forward one year: the ratio flipped. Not by a little, but by 8x in lines of code merged per developer per day.

Now, 8x doesn’t mean 8x productivity—like judging a chef by how many dishes they plate per hour, ignoring quality. But when every line passes human review, the volume speaks. The real shift is in the nature of work. The programmer used to be a craftsman: understand, code, test, submit. Now they’re a technical supervisor: set a goal, let AI write, run tests, fix bugs, then decide whether to approve.

Execution is becoming a commodity. Like photography: when everyone has a phone that auto-focuses and auto-filters, your edge isn’t shutter speed—it’s what you choose to shoot and why.

Judgment 2: The scarce skill is accepting AI’s output.

If you can’t tell good code from bad code, you’re not supervising—you’re just rubber-stamping. Anthropic’s engineers don’t write most of the code anymore, but they spend more time on review and alignment. They check: “Is this doing what we actually want? Is there a hidden bug? Is it coherent with the overall system?”

This is the new bottleneck. It’s not about how fast you code—it’s about how well you judge. And judgment is not something AI can hand you. It comes from deep knowledge of the domain, the business, the users. That’s why the most valuable people in an AI-powered team won’t be the fastest typers—they’ll be the ones who can say “no” with confidence, or point out a better direction.

Judgment 3: Human value is shifting from “making” to “defining.”

When AI handles the heavy lifting, the human role becomes more architectural. You set constraints, define goals, establish boundaries, and decide what “done” looks like. This isn’t just about coding. It applies to writing, design, strategy—any field where AI can produce first drafts.

The danger? If you only know how to execute, you’ll be replaced by a model that executes faster. But if you know what to execute, you become indispensable. That’s the new asymmetry.

The hidden agenda: They’re selling you the new standard.

Anthropic shared those staggering numbers for a reason. It’s not just transparency—it’s a play. By showing how 80% of their code is AI-written, they’re setting a benchmark. Every CTO reading this thinks: “If Anthropic does it, maybe we should too.” And once companies adopt AI-heavy workflows, they naturally align with the tools that define how AI code is evaluated—Anthropic’s safety framework, their model’s strengths, their methodology.

This is the “yangmou” (open strategy): Making AI self-building seem inevitable, while positioning themselves as the default arbitrator of quality and safety. You may think you’re adopting AI to boost productivity. But you’re also adopting their rules.

So what do you do?

First, stop optimizing for execution speed. Start investing in your ability to judge and define. Second, treat AI tools like junior teammates—you still need to review their work. Third, stay aware of whose standards you’re adopting. The most dangerous thing is to outsource not just the doing, but the deciding.

The price of execution is falling to zero. The price of wisdom is rising. That’s the only math that matters.