Risk Evaluation in the Age of AI-Aided Development

Nov 18

Engineering teams have always made judgment calls about risk and speed. With AI development tools becoming standard practice, that judgment call has gained a new dimension demanding more careful consideration.

The Traditional Risk Equation

Before AI tools entered the picture, risk evaluation in software development was already a balancing act. Consider a straightforward example: building a new web page using existing APIs. This is typically low-risk work. If something goes wrong, the business impact is contained. You can iterate quickly without catastrophic consequences.

Even for low-risk features, smart teams use de-risking mechanisms:

Create the new page without adding it to navigation menus
Share a direct URL for internal testing with real data
Duplicate and refactor instead of modifying in-place
Deploy to staging environments first

Backend changes and database modifications sit on the opposite end of the risk spectrum, carrying higher business impact when they fail. De-risking here looks different: feature flags, environment variables, staged rollouts, and thorough review processes.

This risk evaluation has always determined our development approach. How much peer review do we conduct? How detailed are our PRDs? How much architectural planning happens upfront? These decisions trade speed against thoroughness, with the stakes determined by the risk level of what we're building. Understanding which project risks to tackle first helps teams prioritize their uncertainty management efforts.

AI Development Adds Another Tier

AI-aided development tools create a new tier in this risk evaluation framework. According to Stack Overflow's 2025 Developer Survey, 63% of professional developers now use AI in their workflows, making this shift unavoidable rather than optional. You can now offload work to an AI agent, but with varying levels of oversight:

Lower Risk Path:

Generate a detailed PRD
Review the PRD with the AI tool
Make smaller, incremental commits
Revise and refine iteratively

(Managing requirements across multiple sources becomes easier when you leverage AI tools to consolidate project context.)

Higher Speed Path:

Provide the AI with a JIRA ticket
Accept the generated code with minimal review
Ship quickly to appropriate environments

The critical question becomes: how thoroughly do you review AI-generated code before it reaches production? For backend or database changes, thorough human review remains essential. For lower-risk features like prototypes, your tolerance for shipping lightly reviewed code increases. This isn't reckless. It's a calculated decision based on context.

This Isn't New, It's Evolved

The fundamental judgment call hasn't changed. We've always balanced development speed against risk. Before AI, you could accept limited additional risk for speed: using a new framework without deep expertise or reducing review cycles for time-sensitive features.

AI development tools extend this spectrum dramatically, letting you accelerate development even further while accepting a different kind of risk: code that may not have been fully reviewed by a human developer. Research from GitHub and Accenture demonstrates developers complete tasks 55% faster with AI assistance, validating the speed gains.

Yet developer confidence is cratering. The Stack Overflow survey found trust in AI accuracy dropped from 40% in 2024 to just 29% in 2025, while 66% report spending significant time fixing code that's "almost right, but not quite." The speed is real. So is the erosion of confidence.

The Tech Debt Reality

This additional risk comes with a real price tag: technical debt at scale. GitClear's analysis of 211 million lines of code from 2020 to 2024 reveals alarming patterns. Code clones (duplicated code blocks) increased four-fold during this period. Code refactoring collapsed from 25% of changed lines in 2021 to less than 10% in 2024. Meanwhile, research from Apiiro found that teams using AI coding assistants achieve 4x velocity improvements but introduce 10x more vulnerabilities.

Let that sink in. Four times more duplicated code. Half as much refactoring. This isn't about messy formatting or style inconsistencies. This is about creating maintenance nightmares where you fix the same bug in four different locations, where architectural improvements become prohibitively expensive, where technical debt compounds faster than your team can pay it down.

The speed gains are seductive. The long-term costs are brutal. Many teams focus on incident response speed without addressing the upstream processes that create problems in the first place. AI-generated code that looks functional today can introduce subtle architectural issues that multiply into production incidents months later.

This doesn't mean avoiding AI tools. It means acknowledging that the risk you accept today for a "quick prototype" will haunt your codebase tomorrow. For genuinely disposable code, that's acceptable. For production systems with multi-year lifespans, you need radically different standards.

Security Remains Non-Negotiable

Accepting more risk for development speed does not mean skipping security checks. AI-generated code can introduce vulnerabilities that aren't obvious in functional testing. Your CI/CD pipeline must catch these automatically through security scanning, static analysis, and dependency checks. Don't trade security for speed, even for prototypes that might evolve into production features.

Practical Guidance

When evaluating risk with AI-aided development:

For Low-Risk Features:

New UI pages without complex business logic
Internal tools and admin interfaces
Prototype features you're testing for product-market fit
Documentation and content generation

Accept faster AI development with lighter review. Use de-risking mechanisms like hidden pages and staged rollouts, but still run security scans and static analysis.

For High-Risk Changes:

Database schema modifications
Authentication and authorization logic
Payment processing and financial transactions
Core business logic and algorithms

Maintain rigorous review standards. AI can draft the code, but human expertise must validate it thoroughly. Security scanning and static analysis remain mandatory, not optional.

For Everything In Between:

Assess the blast radius if something fails
Calculate the real cost of fixing issues in six months when context is lost
Evaluate honestly whether your team can understand and maintain AI-generated code
Review AI output more carefully for domains where the tool has shown weaknesses
Always run automated security checks regardless of feature risk

Understanding whether your team members thrive as generalists or specialists helps you match AI-assisted work to the engineers who can review it effectively.

The Bottom Line

Here's what the data tells us: 63% of developers now use AI tools and I expect this to only go up. Developer confidence in those tools dropped from 40% to 29% in a single year. Code duplication quadrupled. Refactoring was cut in half. Two-thirds of developers are fixing AI-generated code that looked right but wasn't.

This isn't a crisis. It's a signal. AI development tools are powerful accelerators that come with hidden costs we're only beginning to measure. The teams that thrive won't be the ones who ship fastest. They'll be the ones who understand exactly when to accept AI-generated code and when to demand human expertise.

Use AI tools for what they excel at: accelerating low-risk development, generating boilerplate, prototyping quickly. Reject them for what they struggle with: architectural decisions, security-critical code, complex business logic. And never, under any circumstances, skip automated security scanning because you trust the AI output.

The speed is seductive. The technical debt is real. The judgment call is still yours.

Related Content

Featured

Nov 18, 2025

Risk Evaluation in the Age of AI-Aided Development

Nov 18, 2025

Nov 11, 2025

The Seven Tiers of SaaS Engineering Complexity

Nov 11, 2025

In cycling, the pain doesn't decrease as you get better. You just get faster.

This applies directly to software engineering. Engineers don't find work easier as they mature; they tackle increasingly complex problems that maintain the same cognitive challenge. A senior engineer debugging distributed systems experiences similar mental strain as a junior fixing their first API bug. The difference is the tier of complexity they're operating within.

Nov 11, 2025

Nov 4, 2025

Your Dashboards Are a Code Smell (And How to Fix It)

Nov 4, 2025

I've been on call for over a decade across production SaaS platforms. I've debugged cascading failures at 3 AM, managed 99.99%+ uptime commitments, and transformed reactive teams into proactive operational excellence cultures. Through all of that, I've learned one uncomfortable truth: if your team relies on dashboards for incident response, you have an observability problem.

Dashboards are the lowest common denominator for monitoring. Over-reliance on them (or truly any reliance on them for production incident response) is a code smell for your observability strategy.

Nov 4, 2025

Oct 28, 2025

Alert Fatigue is Better Than Radio Silence (And That's a Problem)

Oct 28, 2025

Having too many alerts that drive everyone insane is still better than having no alerts at all. I've complained about alert fatigue plenty of times before, but here's the uncomfortable truth: that statement is completely backwards.

Oct 28, 2025

Oct 21, 2025

Designing Monitoring Tools for the Job to Be Done

Oct 21, 2025

Successful monitoring platforms rest on a fundamental principle that many teams overlook: the format of a page should be determined by who you expect to be there and what job they need to accomplish.

This requires purpose-built interfaces, not configuration layers. Different users come to your monitoring platform with completely different needs, and your page design should reflect those differences from the ground up.

Oct 21, 2025

Oct 14, 2025

The On-Premises Revenue Trap - Why Enterprise SaaS Deployments Kill Engineering Velocity

Oct 14, 2025

Enterprise customers love asking for on-prem deployments. The contract values look irresistible: 2-5x your standard SaaS pricing, multi-year commitments, and the validation that comes with enterprise logos. But having managed hybrid and full on-prem deployments across multiple SaaS platforms, I can tell you the operational reality is a trap that strangles engineering teams.

The numbers tell a stark story: research shows that personnel costs represent 50-85% of total on-prem application ownership, with the vast majority of that time spent on monitoring, maintenance, and troubleshooting rather than innovation.

Oct 14, 2025

Oct 7, 2025

The Risk Funnel - Why Your Biggest Project Uncertainties Must Come First

Oct 7, 2025

Every engineering leader has lived this nightmare: two days from deadline, the team discovers the core architectural assumption doesn't work, the third-party API is missing critical functionality, or the algorithm can't handle production scale. A manageable project suddenly needs another week, a 100% schedule overrun.

This scenario highlights why successful engineering leadership requires systematic approaches across project organization and technical oversight, not just individual heroics.

This isn't bad luck. It's predictable project physics that most teams systematically ignore.

Oct 7, 2025

Sep 30, 2025

Why Public Communication Just Got Even More Important - The AI Amplification Effect

Sep 30, 2025

I've written before about the importance of keeping work discussions in public forums: Slack channels, JIRA tickets, shared docs, anywhere that's searchable and accessible. If it's about work, other people probably need to know about it. I've recommended that teams target 60-80% of their messages in public channels to preserve institutional knowledge and make information searchable for future team members.

With AI tools becoming ubiquitous, this practice has transformed from best practice to competitive necessity.

Sep 30, 2025

Sep 23, 2025

The Monitoring Trap - Why Build vs Buy Is the Wrong Question

Sep 23, 2025

Engineering leadership's most expensive monitoring decision isn't choosing the wrong tool. It's falling into the monitoring trap that costs organizations in wasted engineering time and preventable downtime. The classic "build vs buy" framing is fundamentally broken. It ignores how most teams end up trapped in an expensive middle ground that delivers neither cost efficiency nor operational effectiveness, creating cascading impacts on engineering velocity and business outcomes.

Sep 23, 2025

Sep 16, 2025

The Two Types of Engineers And How to Optimize for Both

Sep 16, 2025

Through managing teams across multiple clients, I've observed that engineering productivity isn't just about technical skills. It's about recognizing that different engineers thrive under different working conditions. Recent research from McKinsey's 2024 software engineering productivity study found that companies implementing tailored management approaches achieved a 20% improvement in employee experience scores, validating the importance of matching management style to individual work preferences.

Sep 16, 2025

aileadershipengineering

Brian Conn https://connsulting.io

Risk Evaluation in the Age of AI-Aided Development

The Traditional Risk Equation

AI Development Adds Another Tier

This Isn't New, It's Evolved

The Tech Debt Reality

Security Remains Non-Negotiable

Practical Guidance

The Bottom Line

Related Content

Connsulting

About

Offerings