Engineering Decision Lab

Decisions shaped by experience, grounded in real-world systems.

Software Engineering Manager / Director focused on scaling high-performing teams and resilient architectures.

Explore my labs Download resume Get in touch

Start here: If you're evaluating me for engineering leadership, start with a Lab — they're real decisions I've navigated, not hypotheticals. Then check the architecture and leadership sections to see how I think about systems, teams, and trade-offs.

AI-powered search

Find what matches your situation.

Describe an engineering challenge in plain text — a migration, a team issue, a reliability problem — and get pointed to the most relevant articles and case studies on this site.

Leadership

Leading through complexity.

Explore leadership

The Summer Rewrite That Never Comes

Legacy systems don't survive because they're good. They survive because they work — which is exactly why no one funds the rewrite. The job isn't to replace everything. It's to modernize in a way that pays off at every stage instead of asking leadership to bet on a multi-year rewrite with no return until the end.

Legacy code that works is the hardest kind to modernize, because 'it works' is a complete argument against spending money on it.

Prioritize by what's costing the business — blocked features, scaling walls, concentration risk, active bottlenecks — not by what offends you to look at.

Observe and adjust from day one. The mandate to make big changes is earned by being useful on small ones, not by standing back and watching.

Philosophy

The things I'd tell you on day one.

Beliefs shaped by real systems, real failures, and real wins.

Three documents describe your system. Only one of them is true.

The functional spec says what the system is supposed to do. The code comments say what the developer thought it did. The source code says what it actually does. All three start drifting apart the moment they're written — the spec ages, the comments rot, the code keeps running regardless.

Architecture

Don't Tell Me You Can't Do It Unless You Can Tell Me Why

If you can explain why something can't be done, you usually understand it well enough to find a path through. The inability to articulate the blocker is often the actual blocker. It leads to understanding and collaboration.

Mindset

Series

Multi-part deep dives.

Leadership4 parts

AI Adoption in Engineering Teams

AI adoption fails when leaders treat it as a tool problem instead of a judgment problem — you can't mandate trust, you can't speed your way past context, and you can't automate away the need for thinking. These four pieces show you how to build adoption that actually improves your system instead of just making output faster. The real work isn't getting engineers to use AI. It's building the conditions where they use it right.

View series

Architecture

Systems thinking, made visible.

Explore architecture

Taming 50 Million Callbacks with Event-Driven Architecture

A legacy .NET HttpHandler buried inside the customer portal was processing webhook callbacks synchronously — and at 20M+ messages a month, vendor retry storms inflated that to 75 million callbacks with 90-second processing latency. We replaced it with an Azure Function that acknowledges in milliseconds and routes to channel-isolated processors via Service Bus, dropping latency to sub-second and eliminating the retry cascade entirely.

Speed over completeness. Acknowledge first, process later

Isolation over simplicity.

Serverless cost for predictable scale.

Standardization over flexibility.

Independent deploys, coordinated schemas.

Read article

Featured Lab→

LeadershipSecurityFeatured

Security Vulnerabilities Were Accumulating in Our GraphQL Stack

Active CVEs in a production compliance platform. Audit scheduled. Limited team capacity. Every new feature built on a deprecated foundation.

What's at stake

Active CVEs in a production compliance system handling sensitive data
Compliance audit in 6 weeks — auditors will check dependency versions
Every new feature is built on a deprecated, vulnerable foundation

About

John Tolar (JT)

Full background

I've spent most of my career in systems that don't behave the way we expect them to. Under load, across teams, and in environments where the impact of a decision is real.

The hardest problems I've worked on sit at the intersection of people, systems, and decisions. A 47-minute database outage that triggered SLA penalties. A monolithic worker hitting its scaling ceiling with compliance-critical data flowing through it. A lead developer resisting a platform migration they'd need to own. None of these had clean technical answers — they required understanding the full picture.

Recent

Latest articles

Architecture

Taming 50 Million Callbacks with Event-Driven Architecture

Read article

Architecture

When Compliance Makes the Architectural Decision for You

The cleanest architectural decisions aren't the ones where you evaluate all the options and pick the best one. They're the ones where a constraint eliminates the bad options and forces you to build something more durable than unconstrained choice would have produced. Compliance requirements did that here — and the result was a better design than I would have chosen on my own.

Read article

The difference between mediocre and exceptional engineering leadership isn't philosophy—it's the daily discipline of seeing people clearly, developing them honestly, and protecting both their growth and your organization through documented accountability. Read these pieces in order and you'll understand why hiring for curiosity matters more than coding puzzles, why letting people struggle builds resilience, why claiming people are assets means nothing without real investment, why documentation is power, and how all of it converges into a single operating system: accountable autonomy. Leadership isn't about being liked or being hands-off—it's about setting the standard, trusting people to meet it, and having the clarity to act decisively when they don't.

Decisions shaped by experience, grounded in real-world systems.

Find what matches your situation.

Leading through complexity.

The Summer Rewrite That Never Comes

The things I'd tell you on day one.

Multi-part deep dives.

AI Adoption in Engineering Teams

Systems thinking, made visible.

Taming 50 Million Callbacks with Event-Driven Architecture

Security Vulnerabilities Were Accumulating in Our GraphQL Stack

John Tolar (JT)

Latest articles

Taming 50 Million Callbacks with Event-Driven Architecture

When Compliance Makes the Architectural Decision for You

Accountable Autonomy: The Leadership Philosophy That Actually Works

Why I Don't Do Technical Interviews for Senior Engineers

Scaling Decision-Making: How to Stay Collaborative Without Losing Authority

People Are Your Most Valuable Asset — And Most Leaders Don't Actually Believe That

People-First Leadership

Scaling Engineering Leadership

From Busy Flag to Service Boundaries

When Your Codebase Becomes a Knowledge Problem

From Busy Flag to Service Boundaries: Scaling a Monolithic Worker with Strangler Fig

Want someone who can see both the code and the consequences?