April 8, 2026 Read on ssp.sh

Specs Over Vibes: Consistent AI Results ft. Mark Freeman

Data EngineeringAI & LLMsTools & ProductsIndustry

Summary

Simon Späti interviews Mark Freeman about his Spec-Driven Development (SDD) workflow for producing consistent, high-quality results with AI coding agents like Claude Code. Mark's approach centers on spending extensive time defining requirements through ExcaliDraw diagrams, JSON schemas, and markdown specs before letting agents build, then assessing outcomes against specs rather than reviewing code directly. The first build is deliberately treated as throwaway — a form of requirements exploration — with learnings fed back into updated specs for subsequent iterations. Mark argues AI agents benefit senior engineers far more than juniors, since experience is needed to make sound architectural decisions and avoid accumulating early legacy code. The interview also covers agent parallelization with tmux and Agent Teams, the role of evals in data contract work, and the addictive 'Claude Code slot machine' dynamic of shipping AI-generated code without learning.

Key Insight

Consistent AI results come not from better prompting but from rigorous upfront specification — spending hours on specs and treating initial builds as disposable explorations produces far better outcomes than iterating on generated code.

Spicy Quotes (click to share)

7
We've all become senior reviewers, more exhausted than before, with less of the work that made this fun in the first place.
5
The key mindset shift here is that the first build is deliberately treated as throwaway. It's requirements exploring via building.
2
Value first, then outcome and then worry about other things.
4
If you lack experience, you basically have no chance of knowing.
7
Claude code slot machine. Getting your dopamine hit beyond usefulness.
7
Shipping lots of code with AI can feel like deep work, but if you're not learning in the process, it's pseudo work.
4
You can't compete on compute, but you can use the factor of time, iterating multiple versions for a specified problem, and choosing the best one.
2
Along with the wider industry, we are figuring out how to use AI safely at scale.

Tone

practical, reflective, cautionary