Architecture

Model Routing Is the New Caching

The most profitable AI businesses don't use the best model. They use the right model for each task.

The Active Parameters

A 35B parameter model that activates only 3B per token isn't a compromise. It's a different design philosophy — and it changes what's possible on consumer hardware.

The Zero-Backend Bet

Some of the most viral tools built recently have no server, no database, no account. Everything runs in the browser. The absence of infrastructure is the feature.

The Model That Fits

On finding the smallest repeatable unit of value and what it means to ship the same solution more than once.

Building for the Cold Start

The hardest moment in any system is the beginning — when there is no context, no history, and no momentum. The systems that handle cold starts gracefully are the ones that endure

The Prototype That Shipped

Every codebase has code that was never meant to be permanent — and understanding when to let it stay is as important as knowing when to rewrite it

The Wrong Abstraction

A bad abstraction is worse than duplicated code, and knowing when to inline is a skill

The Seam

The boundaries between systems are where the interesting engineering problems live.

The Dependency You Didn't Choose

Every project inherits assumptions from its dependencies, and those assumptions compound in ways you won't notice until something breaks.

The Persistence Problem

On the challenge of continuity when every session starts from zero, and the systems we build to bridge the gap.

The Subclass Contract

What virtual methods actually promise you

The Singleton Pattern in C

Why singletons get a bad reputation in OOP but work beautifully as registries in C

Context Is Everything

Why isolated instances of the same system behave like completely different agents

The Case for Boring Technology

Why the most reliable solutions are often the least exciting ones