Stop Fighting the Wrong Battles - The Three-Level Problem Framework

Dec 9

Most engineering teams waste weeks solving the wrong problems.

They polish user interfaces while core APIs fail. They optimize conversion funnels while databases crash. They redesign onboarding flows while authentication randomly breaks.

This happens because everything gets labeled "high priority" without any systematic way to determine what actually needs fixing first.

Here's a three-level framework that immediately clarifies what to fix first, what can wait, and what's wasting everyone's time.

The Problem with "Everything is Important"

Most teams treat all problems equally. Bug reports, integration failures, and user experience complaints all land in the same backlog with the same "high priority" label.

This creates analysis paralysis. Teams bounce between fixing individual bugs, debugging integration issues, and redesigning user flows without any clear prioritization logic.

The result? Critical infrastructure problems linger while teams polish features that don't even function reliably. Often these issues run deeper than execution speed—as I explore in The Upstream Root Cause Problem, most production chaos traces back to product requirements, architecture decisions, and development processes made earlier in a project's lifecycle.

The Three-Level Problem Framework

Every problem in your system falls into one of three levels, each requiring a different approach and timeline:

Level 1: Individual Component Failures (Fix Immediately)

These are broken parts. Bugs, service crashes, database connection failures. Individual components that simply don't work.

Resolution timeline: Hours to days Why fix first: Nothing else matters if core components are broken Action: Log, assign, fix, verify

Examples:

Authentication service throwing 500 errors
Payment processing failing silently
User data not saving to database
Search returning no results

Level 2: Integration Breakdowns (Fix Urgently)

Components work individually but fail to communicate properly. Service A can't talk to Service B. Data flows break between systems.

Resolution timeline: Days to a week Why fix second: End-to-end functionality is impossible without proper integration Action: Identify involved components, facilitate cross-team collaboration, diagnose interaction failures

Examples:

User registration creates account but doesn't send welcome email
Order placement succeeds but inventory doesn't update
New user data syncs to CRM but not to billing system
Mobile app can authenticate but can't fetch user preferences

Level 3: User Experience Issues (Fix Strategically)

Everything works technically, but the experience is slow, confusing, or frustrating. These are the "polish" problems.

Resolution timeline: Weeks to months Why fix last: You can't evaluate true UX until underlying systems are stable Action: Gather user feedback, analyze experience patterns, prioritize based on business impact

Examples:

Slow page load times (when all APIs work correctly)
Confusing navigation flow
Poor mobile responsiveness
Unclear error messages

Why This Order Matters

The levels build on each other. You cannot:

Fix integration issues while individual components are broken
Evaluate true user experience while systems randomly fail
Make informed UX decisions with unreliable data flows

Without this framework, teams optimize conversion rates while underlying systems fail. They A/B test headlines while APIs return errors. They redesign interfaces while core functionality breaks randomly. This sequencing challenge connects directly to The Risk Funnel—the principle that your biggest uncertainties should be validated first, not last, because deferring critical decisions compounds risk exponentially.

The business impact is immediate. User experience improvements become meaningless when built on unstable foundations.

The Business Impact Override

This framework isn't absolute law. Sometimes a Level 3 issue blocking major revenue outranks a Level 1 bug in a rarely-used feature. A confusing checkout flow costing $5,000 per day deserves more attention than a broken admin panel feature used by two people monthly.

The key is making these exceptions explicit. When you override the framework, document why. "We're fixing this Level 3 checkout flow issue before the Level 1 admin bug because it's blocking $35,000 in weekly revenue." This transparency prevents exception-based prioritization from becoming the norm.

Teams Can Work Multiple Levels Simultaneously

This framework isn't about absolute sequencing. It's about proportional urgency and resource allocation.

A team of ten engineers might assign six to Level 1 issues, three to Level 2 integration work, and one to Level 3 improvements. The framework tells you where to concentrate most of your effort, not where to concentrate all of it.

The mistake teams make is inverting these ratios. They put eight engineers on Level 3 polish while two people struggle with Level 1 failures. The framework prevents that inversion by making the imbalance visible.

How to Apply This Today

Step 1: Audit your current issue backlog

Go through your bug tracker and categorize every issue:

Level 1: Broken individual components
Level 2: Integration/communication failures
Level 3: User experience improvements

Step 2: Implement the priority cascade

All Level 1 issues become immediate blockers
Level 2 issues get urgent status but wait for Level 1 completion
Level 3 issues move to a "strategic improvement" backlog

When this hits reality, you'll need to shed tactical work. Tactical Work Shedding addresses exactly this scenario—how to preemptively identify which items can be cut when (not if) estimates slip, protecting your core deliverables while maintaining credibility with your team.

Step 3: Change your team's vocabulary

Stop saying "high priority bug." Start saying:

"Level 1 blocker" (drop everything)
"Level 2 integration issue" (urgent but sequenced)
"Level 3 improvement" (important but strategic)

Step 4: Adjust your meeting focus

Daily standups: Level 1 and Level 2 status only
Weekly planning: Level 3 roadmap discussion
Retrospectives: Process improvements for faster Level 1/2 resolution

This vocabulary change creates the shared mental model your team needs for synchronized incident response. For teams scaling their incident handling capabilities across multiple groups, Two-Phase War Games provides a framework for building this muscle memory systematically—first within homogeneous teams, then across system boundaries.

Start Using This Right Now

Open your issue tracker. Pick your five highest priority items. Categorize them using the three levels.

I guarantee you'll find at least one Level 1 or Level 2 issue disguised as a Level 3 improvement. Fix those first.

Your users don't care how beautiful your interface is if your core functionality doesn't work reliably.

The debates you'll have about categorization are valuable. When someone argues that a Level 3 issue should be Level 2, that conversation surfaces hidden assumptions about user impact, system dependencies, and business priorities. The framework's real value isn't perfect categorization. It's creating a shared language that makes priority decisions explicit instead of implicit.

Start categorizing today. The clarity you gain in the first week will pay dividends for months.

Related Content

Featured

Mar 11, 2026

Claude Code as an Operational Partner for DevOps

Mar 11, 2026

People are building incredible things with AI coding tools. But there's a quieter, equally powerful use case: using Claude Code as an operational partner. DevOps work is half investigation, and AI coding tools are remarkably effective at analysis, script generation, and iterative diagnostics alongside a human who handles execution and judgment.

Mar 11, 2026

Mar 3, 2026

The AI Adoption Ladder - A Practical Framework for Engineering Teams

Mar 3, 2026

Most AI adoption failures share the same origin story: someone tries the hardest possible task, it fails spectacularly, and they declare they'll "come back next year." This happens constantly because teams lack a mental model for sequencing adoption.

After helping engineers navigate AI integration, I've developed a staged approach I call the AI Use-Case Ladder. It sequences adoption by risk and blast radius, building confidence and literacy before touching anything that could damage production systems.

Mar 3, 2026

Feb 24, 2026

The Three Levels of AI Product Integration - A Framework for SaaS Leaders

Feb 24, 2026

SaaS companies are all AI companies these days. How deep does does that AI integration really go, though?

They bolt on a "Generate with AI" button, watch users test their hardest problems, and wonder why adoption craters after week one. The issue isn't AI capability. It's integration depth. After working with multiple SaaS teams on AI implementations, I've seen a clear pattern: companies that understand how deeply AI should touch their product consistently outperform those chasing the latest demo.

Here's the framework I use to help teams navigate this decision.

Feb 24, 2026

Feb 17, 2026

The Three Pillars of Scalable Data Processing

Feb 17, 2026

Every unit of work in a data processing system should aspire to be small, independently processable, and consistently sized. When these three properties hold, scaling becomes almost trivially simple. Reality rarely cooperates, which is why understanding these properties matters so much for platform engineering.

Feb 17, 2026

Feb 10, 2026

The Async Decoupling Pattern for Scalable Batch Processing

Feb 10, 2026

Batch processing architecture has a clean pattern that scales elegantly: decouple batch systems asynchronously from everything else. When you get this right, your real-time system stays stable regardless of batch volume, and you never need elaborate job scheduling to avoid infrastructure strain.

Feb 10, 2026

Feb 3, 2026

Batch and Real-Time Platforms Have Different Jobs

Feb 3, 2026

When designing data platforms, I frequently encounter teams trying to build one unified system that handles both real-time streaming and batch analytics. The instinct makes sense: both workloads operate on the same underlying data, so why not share infrastructure?

Getting this architecture right has real consequences.

The challenge is that these workloads have fundamentally different characteristics. Supporting both well on a single platform is expensive and complex. In most cases, you get better results by separating them early and letting each system lean into its strengths.

Feb 3, 2026

Jan 28, 2026

Making Interviews Objective with AI (Without Making Them Worse)

Jan 28, 2026

Everyone has opinions about candidates. That's the problem.

We're supposed to ask standard questions, evaluate people against the job description, and test whether they can do the work. Instead, we dig into areas where we think they're weak, ask different questions for each person, and end up testing our biases instead of their abilities.

Jan 28, 2026

Jan 20, 2026

The Software That Shouldn't Exist

Jan 20, 2026

Everyone's worried about AI replacing engineers. The more interesting question is what happens when the cost of building software drops so dramatically that entirely new categories of software become viable.

The industry is calling this "personalized software." Custom tools built for a specific person, a specific context, a specific moment. Software that never leaves your machine. Software that would never justify a product. Software that, until recently, simply wouldn't exist.

Jan 20, 2026

Jan 6, 2026

Shifting Left - How Small Teams Handle Organizational Gaps Without Breaking

Jan 6, 2026

Every small organization has gaps. Maybe you have an engineering lead but no dedicated DevOps team. Maybe your product manager is stretched thin and the tech lead is absorbing PM responsibilities. Maybe a designer role is emerging, but nobody owns it yet.

These gaps often emerge in specific domains. Growing organizations typically need four types of engineering leadership, and early-stage teams almost never have all of them covered. This is normal. The question is: how do you respond?

Teams may make the mistake of dumping the entire burden on one person. They identify the gap, find whoever is closest to it, and expect that person to absorb all the additional work. This breaks people.

There's a better approach I call "shifting left."

Jan 6, 2026

Dec 30, 2025

Working in the Mud - The Mental Model That Keeps Engineering Teams Moving

Dec 30, 2025

Every engineering blog paints a picture of clean microservices, continuous deployment, and comprehensive observability. I've been in this industry for over a decade, and I've never experienced this ideal state across the board. I've seen glimmers. Teams that nail one dimension. But never everything at once.

That gap between the ideal and reality is what I call working in the mud.

Dec 30, 2025

leadershipengineering

Brian Conn https://connsulting.io

Stop Fighting the Wrong Battles - The Three-Level Problem Framework

The Problem with "Everything is Important"

The Three-Level Problem Framework

Level 1: Individual Component Failures (Fix Immediately)

Level 2: Integration Breakdowns (Fix Urgently)

Level 3: User Experience Issues (Fix Strategically)

Why This Order Matters

The Business Impact Override

Teams Can Work Multiple Levels Simultaneously

How to Apply This Today

Start Using This Right Now

Related Content

Connsulting

About

Offerings

Stop Fighting the Wrong Battles - The Three-Level Problem Framework

The Problem with "Everything is Important"

The Three-Level Problem Framework

Level 1: Individual Component Failures (Fix Immediately)

Level 2: Integration Breakdowns (Fix Urgently)

Level 3: User Experience Issues (Fix Strategically)

Why This Order Matters

The Business Impact Override

Teams Can Work Multiple Levels Simultaneously

How to Apply This Today

Start Using This Right Now

Related Content

AI-Assisted Development Changes What Matters in Framework Selection

Tactical Work Shedding - How to Plan for the Plan to Fail

Connsulting

About

Offerings