Blog

· 2 min read

How We Built Maestro with Maestro

There’s a certain kind of confidence that comes from eating your own cooking. When we decided to rebuild a major subsystem of Maestro using Maestro itself, we weren’t trying to make a marketing point. We had a real deadline, a real feature spec, and a real question: could our own tool handle a non-trivial project from start to finish?

The Setup

We wrote a specification for a new story management system — the part of Maestro that tracks which Coder is working on what, which stories are blocked, and how the Architect dispatches work. It was roughly 2,500 words of requirements written in plain English.

· 2 min read

Offline AI Development: Building Software Without the Cloud

Maestro’s offline mode wasn’t on our roadmap. It started with a phone call from a friend who does disaster relief logistics. He needed software tools in places where the internet is the first thing to go and the last thing to come back.

He was describing exactly the kind of work Maestro was built for — well-specified, practical software with clear requirements. The only problem: every AI development tool on the market required a persistent internet connection.

· 2 min read

The Case for Heterogeneous AI Models

When people first see Maestro’s model configuration, they ask the same question: why would you use different models for different agents? Isn’t it simpler to just pick the best one and use it everywhere?

Simpler, yes. Better, no.

The diversity hypothesis

Every AI model has systematic biases in how it approaches problems. These aren’t bugs — they’re artifacts of different training data, architectures, and optimization targets. Claude tends toward careful, verbose implementations. ChatGPT tends toward concise, pragmatic solutions. Gemini often takes a different structural approach entirely.

· 2 min read

What "Production Ready" Actually Means When AI Writes the Code

Every AI coding tool claims to produce “production-ready” code. It’s become meaningless marketing language. So let’s be specific about what Maestro means when we say it, and what mechanisms actually enforce it.

The definition

Production-ready code, in our view, meets four criteria:

  1. It runs. Not “it looks right” — it compiles, starts, and serves its intended purpose.
  2. It’s tested. Automated tests cover the core functionality, and those tests pass.
  3. It’s reviewed. Someone (or something) other than the author has evaluated it for correctness, consistency, and edge cases.
  4. It’s traceable. There’s a clear history of what was built, why, and how — via PRs, commits, and story tracking.

Most AI coding tools give you #1 on a good day. Maestro targets all four, structurally.