Software Architecture & Technical Leadership

AI Writes the Code.
You Still Make the Call.

Frameworks for the decisions AI can't make for you — architecture, hiring, and technical debt — from an architect who's led teams through 15 years of tech evolution.

Browse All Articles Learning Pathways

Just launched

I built Zsper — an AI writing partner that remembers how you think.

Read the story

123+Articles Published

15+Years Building

1K+Interviews

9Pathways

Featured

software architecture

LLM Architecture in Production: RAG, Vector Databases, and the 7-Point System-Design Checklist

Adding an LLM to your product is a distributed-systems problem with a non-deterministic dependency, not a single API call. When RAG actually helps (and when a prompt will do), how to think about vector databases and chunking without cargo-culting, the retrieval pipeline that separates demos from products, and the seven-point production checklist — evals, guardrails, cost ceilings, latency budgets, fallbacks, observability, and a human-in-the-loop boundary — to put in place before a real user touches it.

Jun 2, 2026·15 min read

Read article

A tool I built · Live at zsper.com

Zsper

An AI writing partner with a memory of how you think.

AI made writing fast and made everyone sound the same. Zsper learns your thinking as you write, then selects from it — your stances, your stories, your way of making an argument — before it drafts a line. Same philosophy as everything on this site: the machine drafts, you still make the call.

Continuity of thought

Every article is the next chapter of the same mind.

Zsper keeps a persistent, inspectable memory of your stances, stories, and framing — built as a by-product of writing. You never maintain a knowledge base; the writing is the input.

Not RAG — an intelligence layer

It decides what to say, and why, before it writes.

Deterministic scoring picks the records to use — your opinion, a prior piece, a story, what to avoid — and the LLM only phrases the brief. Every draft ships with the provenance of what it drew from.

You stay the author

Nothing speaks in your name until you say so.

No auto-publish. New stances and retired beliefs wait for your review. One brain drafts an article, a newsletter, and a LinkedIn post — all native to their format, all in your voice.

Try Zsper free Read why I built it

Software Architecture

Scalable systems from first principles. Architecture patterns, code quality, and the craft of building software that lasts.

Explore 39 articles

I Over-Engineered a SaaS for Millions. It Got 3 Users.

I built a SaaS with multi-tenancy, event-driven architecture, and elaborate domain abstractions — for millions of users that never arrived. The product now serves two or three internal people in the same building. This is the architecture post-mortem, and the operating patterns that would have changed the outcome.

Jun 25, 2026·12 min read

Technical Leadership

Scale teams from chaos to discipline. Insights from 1,000+ interviews and leading 25+ engineers.

Explore 31 articles

The Conversation-Based Interview: How I Evaluate 7-8 Year Experienced Engineers

After 1,000+ technical interviews, I've learned traditional coding tests don't reveal what matters. 6 out of 10 candidates tell me my interview felt like a professional discussion, not an interrogation. Here's the scenario-based framework: career journey evaluation, technical depth through projects, collaborative problem-solving, SOLID principles integration, leadership assessment, and feedback delivery that turns interviews into conversations.

Nov 22, 2025·22 min read

Developer Productivity

Multiply your output with AI tools, workflow automation, deep work strategies, and measurement frameworks.

Explore 24 articles

AI-Driven Development: The Spec-First Workflow That Makes Agents Actually Useful

Vibe coding — prompt, accept, repeat — produces fast demos and slow disasters. The senior move is spec-first development: invest in a precise specification, let agents implement against it with MCP for real context, and gate everything behind tests, types, and human review of intent. The four-phase loop, why the spec becomes the asset when code is cheap, where autonomous PRs actually fit, and the failure modes (context rot, confident wrongness, review debt) that bite teams who skip the discipline.

May 30, 2026·14 min read

Career & Life Design

Design your career and life with intent. From IC to CTO, work-life balance, and sustainable engineering.

Explore 18 articles

Energy Management for Engineers: Your Calendar Isn't the Real Bottleneck

Perfectly scheduled week but exhausted by Wednesday? You don't have a time problem—you have an energy allocation problem. Learn your energy profile (track patterns for 2 weeks), match work types to energy levels (deep work at peaks, admin at dips), fix energy leaks (context switching, notifications, emotional tension), treat recovery as a design constraint, and lead teams with energy awareness.

Nov 18, 2025·14 min read

Quality & Craft

Lessons from watchmaking, luxury design, and long-term thinking applied to software craftsmanship.

Explore 11 articles

The Craft of Software: A Philosophy of Quality That Ships

'Quality' in software has been hollowed into a slogan. Real craft isn't gold-plating, slowness, or the enemy of shipping — it's building things that hold up and stay maintainable while shipping. The pillar that ties together how I think about quality and craft: where craftsmanship comes from (watchmaking, luxury, long-term thinking), why good engineers cut corners, what 'finished' means, why fewer tools beat more, and how AI changes but doesn't erase the need for taste.

May 30, 2026·13 min read

Learning Pathways

Structured journeys, not just articles.

View all →

System Design Mastery

From first principles to production-grade architecture.

8 articles·1h 49min·Beginner to Advanced

The Pragmatic Tech Lead

From senior engineer to engineering leader without losing your soul.

7 articles·1h 26min·Intermediate to Advanced

AI-Augmented Developer

Use AI tools to multiply your output without losing craftsmanship.

6 articles·1h 9min·Beginner to Intermediate

Editorial Picks

Hand-picked reads to start with.

View all →

Scaling to Millions of Users: A Real-World Architecture Teardown

An anonymized teardown of a consumer platform I scaled to several million users. The architecture that carried ~30K req/s at peak, the four walls we hit on the way up — database connections, a cache stampede that caused a 19-minute outage, payment double-charges, and a credential-stuffing attack that looked like organic growth — and the trade-offs behind each fix. Topology, layered caching, the data tier, WAF and rate-limiting stack, and four real ADRs. No vendor named; the engineering is exactly as it happened.

Jun 20, 2026·24 min readRead now

AI Engineering Team Structure: The Generation–Review Ratio

AI moved the engineering bottleneck from writing code to reviewing it — and most org charts haven't caught up. The Generation–Review Ratio, why cutting junior hiring is a five-year trap, the four roles every AI-native team needs, and how to rewrite hiring and leveling for 2026.

Jun 1, 2026·14 min readRead now

SpecLoom: Deterministic Context for Coding Agents

Most agent SDLC setups use the LLM as the runtime for everything—including deciding which files to read—which is the biggest source of token waste and non-determinism. SpecLoom flips this: write your spec as typed blocks with IDs and dependencies, and a deterministic compiler emits a minimal, hash-stamped bundle for one task. A real engineer bundle compiles to ~370 tokens instead of 20–60k, the same task always produces a byte-identical bundle, and @spec:ID#hash anchors turn spec/code drift into a CI failure. Covers the .loom format, the Deterministic Context Compiler, tiered budget degradation, the drift gate, engine-enforced persona gates, and a 60-second loop to try it.

Jun 11, 2026·19 min readRead now

Software Architecture Patterns: A Reference Catalog with Diagrams, Failure Modes, and Code

A practical reference catalog of the eight architectures worth knowing — layered, modular monolith, hexagonal, event-driven, CQRS + event sourcing, microservices, serverless, and the strangler fig. Each with a diagram, the forces that make it the right call, the failure mode that makes it the wrong one, and a link to runnable reference code. Plus a decision flowchart so you pick on fit, not hype.

Jun 3, 2026·18 min readRead now

Your First 90 Days as CTO: A Practical Playbook for Startup and Scaleup Leaders

It's day three as CTO. Production is on fire, your team is demoralized, and the CEO wants a roadmap by Friday. Your job isn't to fix everything—it's to build a clear picture, earn trust, and set a direction. Here's your survival framework.

Nov 14, 2025·18 min readRead now

Custom Copilot Agents: How I Automated 12 Hours of Architecture Work Per Week

Senior engineers waste hours typing the same Copilot prompts repeatedly. GitHub Copilot Agents (.agent.md files) let you encode expertise once, reuse forever. Built 4 production agents that coordinate: reduced article writing 12 hours → 90 minutes. Learn Agent Maturity Model, 3-Gate Validation Framework, Agent Design Canvas, and orchestrator patterns. Real .agent.md files, metrics from 6 months production use.

Feb 19, 2026·20 min readRead now

“True expertise isn't measured by years of experience — it's measured by the depth of problems you've solved and the quality of solutions you've crafted. Whether architecting software or curating a watch collection, the principles remain the same: attention to detail, understanding of purpose, and respect for craftsmanship.”

123+ articles published

Architecture, leadership, and craft —
written for engineers who build.

Field-tested frameworks for the decisions AI can't make for you. No fluff — just what holds up in production.

Start Reading Explore Pathways

AI Writes the Code.You Still Make the Call.