aiagency.nz
Back to blog
Engineering 17 February 2026 9 min read

Evals or it didn't happen

Vibes-based AI development is dead. The eval rigs we built before shipping anything to a paying user.

This post is a placeholder while we get the editorial pipeline online. The shape and rhythm of the page is right; the words are coming. In the meantime, here's a sketch of what this piece will cover.

The premise

Vibes-based AI development is dead. The eval rigs we built before shipping anything to a paying user. The full piece will walk through the specific decisions, the trade-offs we considered, the ones we got wrong, and what we'd do differently with the benefit of three months on the other side.

What we're publishing

Field notes from production builds. No abstractions, no recycled threads. If we ship something we'll write about why it worked, why it broke, and what we'd test if we were starting fresh tomorrow.

  • Honest write-ups of features that flopped
  • Architecture diagrams from systems handling real load
  • Concrete numbers, accuracy, latency, dollars saved, hours back
  • Eval rigs and prompts we'd actually use ourselves

Stay close

Subscribe to the dispatch below if you want the full post when it lands. One email a month, no recaps, no churn.

Read next