The Intent Gap

Apr 15

Your AI-generated code is degrading, and the degradation isn't a tooling problem. It's a translation problem, and every step of the chain is lossy.

I've fought against this degradation. I swap models. I rerun implementation prompts. The damage happens before the first line of code gets generated, in a chain of translations that no refactoring pass can reverse. This is a different problem than the one I wrote about in Risk Evaluation in the Age of AI-Aided Development, which is about deciding when AI acceleration is worth the technical debt. The Intent Gap is upstream of that decision.

I call the thing at the center of this the Intent Gap: the distance between what you meant and what the AI produced. The Gap is where everything fails. And once you see it, you can't unsee it.

The Lossy Translation Chain

Think about what actually happens when a human intent turns into AI-generated code. Someone has an idea. That idea gets compressed into a JIRA ticket, which is rarely more than a paragraph. The ticket gets dropped into a prompt, or fed to an agent, or expanded into a PRD that nobody reviews carefully because the AI can "just figure it out." The agent produces code. The code gets merged, maybe without anyone reading it closely, because reviewing AI output at volume is tedious and the reviewer has their own work to do.

Every step in that chain is lossy. Every step strips context that the next step has to guess at.

It's a screenshot of a screenshot. You can still see the shape of the original, but the resolution keeps dropping. And the artifact everyone references later, the JIRA ticket, is the lowest-resolution version of the whole chain. It becomes the de facto history of the decision, because the git log only shows the output, and the original reasoning lives in someone's head or a Slack thread that got archived.

The code looks fine. The build is green. Tests pass. And the actual intent, the reason any of it exists, is nowhere in the system. It's been discarded at every hop.

Why the Usual Fixes Don't Work

The instinct, when you notice the degradation, is to refactor. Run another AI pass and clean it up. Add another evaluation step. Feed the output back through a different model. This is the same move, one more time.

Refactoring AI-generated code with AI is another lossy pass. You're not recovering the original intent. You're running the degraded artifact through the translation chain again, hoping the noise averages out. It doesn't. It compounds.

I've seen this play out in practice. Teams generate code, notice it's drifting from what they actually wanted, and respond by generating more code on top. The first pass was a rough translation of a vague ticket. The second pass is a refactor of the first translation, working from the code itself as the only available context. The third pass is a cleanup of the refactor. By the time anyone looks at the result, the connection to the original intent is a rumor.

Martin Fowler recently endorsed Margaret-Anne Storey's Triple Debt Model, which proposes three categories of debt in AI-augmented development: Technical Debt in the code, Cognitive Debt in the people, and Intent Debt in the artifacts. Storey's paper makes the paradox explicit: "Generative AI may reduce technical debt while simultaneously accelerating cognitive and intent debt."

That's the dynamic I'm describing. The code gets cleaner. The intent gets harder to recover. The refactor pass reduces Technical Debt by a measurable amount and adds Intent Debt in the same motion. You're moving debt from a column your tools can measure into a column they can't.

Gap and Debt Are Not the Same Thing

Storey's framework is diagnostic. It names the condition. Intent Debt is what accumulates when intent goes unmanaged across teams and time. It's the balance sheet version.

The Intent Gap is the unit that accumulates into Debt. It's individual and it's happening in real time, every time someone hands a vague ticket to an agent and accepts whatever comes back.

This distinction matters because Gap is actionable and Debt is retrospective. You can close a gap in the moment, by forcing the intent to be explicit before the generation starts. You can only diagnose debt after the fact, by auditing what's already in your codebase and trying to reconstruct why it's there. Gap is where the leverage is.

It also matters because the mechanism that creates the Gap has a name. Shaw and Nave's research on Cognitive Surrender found that AI tools "inflate confidence even when AI is wrong." The person with the original intent stops verifying because the output feels authoritative. They become an Armchair Architect, approving artifacts they haven't actually checked against what they meant. The Gap widens because the human on the other side quietly stopped holding the line.

The Old Practices Are Coming Back

Marshall McLuhan had a concept called retrieval: new media don't just create new practices, they make old practices viable again in new forms. I think that's exactly what's happening with the software development lifecycle right now.

Spec-driven development. Design by contract. Formal verification. All of these were considered too expensive for most teams, because the cost of writing and maintaining the spec exceeded the cost of just writing the code. When the cost of code generation drops to near zero, that equation inverts. The spec becomes the expensive, valuable artifact. The code becomes the cheap build output. I've argued elsewhere that teams need a clearer map of where AI can actually carry weight versus where it can't; The AI Adoption Ladder was my first pass at that map. The Intent Gap is what determines how high up the ladder you can safely climb.

This isn't speculation. Martin Fowler and ThoughtWorks hosted a Future of Software Development retreat in February 2026 where the Triple Debt Model got formalized and practitioners named something they're calling the Middle Loop: a new category of supervisory engineering work sitting between the inner loop of writing code and the outer loop of deploying it. That's exactly the space where Intent Gap management has to live. The people closest to this shift are all circling the same structural problem.

The old practices aren't coming back because they're traditional. They're coming back because the economics that killed them have flipped.

Solving the Gap Requires a Different Foundation

Here's the part I'm not going to resolve in this post, because resolving it requires a fundamentally different relationship between specs and code than most teams currently have.

If you want to close the Intent Gap, the spec has to become the source of truth and the code has to become a build artifact. Not the other way around. That means the intent gets captured in a machine-readable form, the agent generates code from that spec, and the verification layer holds deterministic gates the agent can't weaken, reinterpret, or route around.

That's a big swing. It changes how PRDs work, how code review works, how git history works, and how accountability for intent distributes across a team. I'm building toward that shape, and I'll write about the mechanics of it when I have more to show.

For now, the useful thing is to see the Gap clearly, because seeing it changes what you do next.

What to Do Tomorrow

Pick one AI-generated pull request that shipped in the last week. Open the JIRA ticket that kicked it off. Read the original ask. Then read the merged code.

Write down, in one sentence, the difference between what was asked for and what got shipped. Not the bugs. Not the style issues. The semantic difference between the intent and the artifact.

That's your Intent Gap for that change. It's probably larger than you expected, and it's sitting in your main branch right now, quietly becoming the system of record for a decision nobody is checking.

You can't refactor your way out of it. You can only close it at the source.

Related Content

Featured

July 14, 2026

Enterprise AI Pricing Is a Governance Decision

July 14, 2026

July 7, 2026

Culture Is the Only Proprietary Layer

July 7, 2026

Every agent company hires from the same labor pool. You and your competitor employ literally the same workers: the same frontier models, refreshed quarterly by the same vendors. Raw capability is identical by construction. So what differentiates an agent company from a generic agent?

Culture. Externalized into documents.

July 7, 2026

June 30, 2026

Human Review Is Intent Review, Not Diff Review

June 30, 2026

We still assign a human reviewer to every pull request. The human opens the diff, scrolls, approves. That ritual is already dead. Most teams are just propping up the corpse.

The question underneath it, the one nobody wants to say out loud: what is a human code review even for once agents write the code?

June 30, 2026

June 23, 2026

The Harness Eats the Coding

June 23, 2026

The most valuable thing I do as an engineer right now isn't writing code. It isn't even reviewing code. It's building the harness that lets the agent verify its own work before it asks me to look at it.

June 23, 2026

June 16, 2026

The Iteration Loop Got Longer. That Changed Everything.

June 16, 2026

The thing nobody talks about with AI-assisted development isn't the models. It's the cycle time. The agent's iteration loop got longer, the right way to work changed, and most people are still working as if the loop is two seconds long.

June 16, 2026

June 9, 2026

Your Laptop Is Just a Portal

June 9, 2026

My laptop is a four-year-old Dell XPS 15 with 16 gigs of RAM. Fine for normal work. Not fine for running Windows, WSL, a real codebase, a Claude session, and a browser at the same time. It came to a head over Thanksgiving last year, when I was accidentally on the road for three weeks and couldn't get serious work done. WSL on 16 gigs just exploded.

The first fix was offloading development to an EC2 instance. That worked, but the monthly bill kept climbing and the hardware was still anemic for what I actually needed. So I bought a remote dev box for the home lab and moved everything off the EC2.

That's the boring origin story. The interesting part is what the setup unlocked.

June 9, 2026

June 2, 2026

Tickets Are the New Prompts

June 2, 2026

I haven't written a Linear ticket by hand in six months. I don’t write the majority of my Claude prompts. The two stopped being separate things. The ticket is the prompt.

June 2, 2026

May 26, 2026

The Amdahl's Law Problem in AI-Assisted Development

May 26, 2026

AI did not make the whole software delivery system faster.

It made one stage louder.

That is the part missing from most productivity conversations right now. A developer gets a coding assistant, the coding step accelerates, and everyone acts like the entire SDLC should accelerate by the same amount. Then review queues grow. Test failures pile up. Deployment gets riskier. Senior engineers spend more of their day reconstructing intent from code that looks plausible but does not quite match the system.

That is not a paradox. That is Amdahl's Law doing exactly what Amdahl's Law does.

Speed up one stage in a constrained system, and the bottleneck moves.

May 26, 2026

May 19, 2026

Concentric Feedback Loops: How AI Agent Teams Actually Ship Code

May 19, 2026

I've been rebuilding one of my Claude Code workflows because the old version was too linear.

That sounds like a small implementation detail. It isn't. It points at the part of AI-assisted development that most teams are about to run into: once agents can do real work for hours, strict phase gates start getting in the way of the feedback loops that make the work safe.

The normal development cycle is familiar: requirements, plan, plan review, implementation, tests, peer review, more implementation, more tests, security review, architecture review, integration testing, end-to-end testing. We pretend this is a clean sequence because it is easier to write down that way.

It has never been that clean.

The work has always been loops. AI agent teams just make the loops visible.

May 19, 2026

May 12, 2026

Your Team's AI Metrics Are Lying to You

May 12, 2026

Your engineering team adopted AI coding tools six months ago. Deployment frequency is up. Lead time is down. PRs are flying through the pipeline. Everyone feels faster.

But are they?

I've been digging into the data across multiple client engagements, and there's a growing gap between what AI-assisted engineering teams perceive and what the numbers actually show. The metrics most teams celebrate are painting an incomplete picture, and the metrics that would tell the real story are the ones nobody's watching.

May 12, 2026

aiengineeringleadership

Brian Conn https://connsulting.io

The Intent Gap

The Lossy Translation Chain

Why the Usual Fixes Don't Work

Gap and Debt Are Not the Same Thing

The Old Practices Are Coming Back

Solving the Gap Requires a Different Foundation

What to Do Tomorrow

Related Content

Connsulting

About

Offerings

The Intent Gap

The Lossy Translation Chain

Why the Usual Fixes Don't Work

Gap and Debt Are Not the Same Thing

The Old Practices Are Coming Back

Solving the Gap Requires a Different Foundation

What to Do Tomorrow

Related Content

The SDLC is Rediscovering Itself

Tests as Ceremony: When AI Breaks the Safety Net

Connsulting

About

Offerings