Designing Monitoring Tools for the Job to Be Done
Successful monitoring platforms rest on a fundamental principle that many teams overlook: the format of a page should be determined by who you expect to be there and what job they need to accomplish.
This requires purpose-built interfaces, not configuration layers. Different users come to your monitoring platform with completely different needs, and your page design should reflect those differences from the ground up.
Three Types of Users, Three Different Needs
The Executive: Is Everything Okay?
When an executive logs in, they're asking one question: "Is everything okay?" They need low data density with visual status indicators. Big green checkmarks when things are healthy, obvious red flags when something needs attention.
If things aren't okay, they need actionable context, not action buttons. They're delegating to their team, not taking action themselves. The executive's entry point should answer their core question in seconds with clear paths to delegate when needed.
The Analyst: Show Me the Details
Practitioners and analysts need depth. They consume detailed information to understand patterns and investigate trends. Their pages require higher data density with tabular data, detailed graphs, and reporting packages.
These users need compact styling and drill-down capabilities. Clickable tables with accordion views, flyouts, and linked details let them explore data without constantly navigating to new pages.
The Power User: Let Me Do My Job Quickly
Engineers and operators come to the platform to take action: reviewing events, approving requests, investigating incidents, or making system changes. They need maximum efficiency for repetitive actions. Bulk operations, keyboard shortcuts, inline editing. If they need to review 50 events, don't make them click through 50 individual pages.
Their interfaces should be action-oriented. Every table row should have contextual actions built in: approve, deny, escalate, investigate. They often arrive via alerts or notifications, landing directly on what needs attention, so the page should provide immediate context and available actions.
The Core Insight: Nobody Wants to Be There
People don't want to hang out in your monitoring platform. They're not there to be entertained. They have a job to be done, and they want to complete it and move on with their day.
This is fundamentally different from B2C products, where engagement metrics and time on site are success indicators. For monitoring tools, the opposite is true.
The real value is the automated 24/7 coverage. Machines talking to your platform, constantly monitoring. That's the sticky value. That's what customers pay for.
If someone never logs into your platform because they never have issues, but they know you're monitoring everything 24/7, that's a happy customer. They have confidence. They can focus on other work.
If someone has to log in frequently because issues keep arising, and your interface makes it difficult for them to diagnose, resolve, or address those issues efficiently, that's a frustrated customer. You're adding friction to their crisis.
Designing for the Job, Not the Feature
When building monitoring tools, the temptation is to build one interface and add filters or settings to customize the view. But that approach forces every user to configure their experience before they can do their job.
Design different entry points and page formats based on the expected user and their job:
Executive dashboard. Daily health report with clear status, automatic routing to their inbox, high-level summaries that answer "Is everything okay?" in one glance.
Analyst workspace. Detailed reporting pages with rich visualizations, comparison tools, and drill-down capabilities for investigation.
Operator interface. Action-oriented pages optimized for speed, with bulk operations, keyboard navigation, and inline actions that eliminate unnecessary clicks.
The page format isn't determined by personal preference. It's determined by the job to be done.
The Insurance Analogy
Monitoring platforms function like insurance. The best outcome is never needing to file a claim because nothing goes wrong, but you sleep better knowing you're covered.
A customer who sees a big green checkmark every morning and never investigates anything is experiencing the ideal state: confidence without burden. A customer who constantly investigates issues is having a negative experience, not because issues exist (that's sometimes unavoidable), but because your platform should make resolution as efficient as possible.
Design your monitoring tool to minimize the time users spend in your platform while maximizing their confidence that everything is being watched. That's the real measure of success for operational tooling.
This is all assuming your monitoring platform is working as designed. If it's not, maybe you should investigate your false positive alerts.