On this pageOpen

Agentic engineering, not vibe coding

You can't ship faster than you understand.

A field guide for solo devs and small teams shipping code they own. The agent runs inside a harness: rules, tools, checks. This is the orientation; the next three chapters make it a working loop: Discovery, Implementation, Maintenance.

96% of developers don't fully trust AI-generated code.

Only 48% always verify it before committing.

Sonar, State of Code 2026 (n=1,149)

Start with Discovery

The spectrum

AI work runs on a spectrum, not a switch.

The line isn't whether you use AI.

It's how outputs get verified.

Vibe coding

Casual prompts
"Seems to work?" is the check
Often ships unread
Best for: prototypes, scripts, hackathons

AI-assisted

Detailed prompts + constraints
Manual tests, spot-checks
Review the critical paths
Best for: features in known code

Agentic eng

Specs + architecture docs
Tests, CI/CD gates, quality evals
Review the whole design
Best for: production, code you'll maintain

Pick by the stakes, not by habit.

A config tweak or one-off? Skip the agent, just type it.

Weekend prototype? Vibe it. Payments API? Engineer it.

This playbook lives at that agentic end.

The economics

Speed is the wrong metric. Cost of ownership is the real one.

Cheap to start, expensive to own. Or pay upfront, and compound down.

Illustrative. The crossover is the argument: pay upfront, compound down.

AI multiplies output. Unharnessed, it outruns your ability to review:

+81%

duplicated code

−74%

refactoring: code rots untouched

+243%

incidents per PR

+54%

bugs

GitClear & Faros, 2026: risk scales with output, measured across AI broadly.

In plain terms: more code ships, but it's copy-pasted, breaks more, and goes unmaintained. Speed borrowed against next year.

A harness keeps each change small and reviewable. That's how you bend the curve.

What that buys you: fewer incidents, changes small enough to audit, code your next hire can maintain.

The harness

Agent = Model + Harness

Model: the reasoning. Claude, Codex, Gemini.

Harness: the scaffolding that keeps it bounded and reviewable.

A raw model isn't an agent.

The harness is what lets it finish something you can own.

Prompt, paste, pray maps to Spec, run, verify
Big unread diffs maps to Small reviewable tasks
"Seems to work" maps to Tests and evidence

Six parts: instructions, tools, environment, control flow, guardrails, and observability (logs and traces). Unpacked in Implementation.

Treat agent failures as harness failures first. It's the part you control.

Two modes

Two modes, moved between by task: Conductor, you drive live; Orchestrator, you delegate, then review.

Orchestrating lets you review outcomes, not keystrokes. But you still can't merge faster than you understand.

The three phases

Modes are how you work the agent. Phases are where the work is: planned, shipped, maintained.

Shape it, build it, improve it. Each cycle feeds the next.

Discovery

Conductor mode

Shape the work. You steer; the agent thinks with you.

Implementation

Orchestrator + review mode

Run one task in the harness. Review the output.

Maintenance

Orchestrator-led mode

Improve the next cycle. What shipped teaches Discovery.

Bottom line

Vibe coding versus agentic engineering comes down to comprehension plus a harness.

Code you cannot maintain, debug, or defend is vibe-coded.

The model writes the code. Your judgment owns the task and the review.
The harness keeps the work bounded and reviewable.

Start with Discovery

Sources

Sonar: State of Code Developer Survey (2026)
n=1,149 pro devs: 96% don't fully trust AI code, only 48% always verify it. The verification gap a harness closes.
Google: The New SDLC With Vibe Coding (2026)
The paper this playbook adapts, and where the industry converges.
Karpathy: Vibe coding (Feb 2025)
The post that named the casual end of the spectrum.
Faros: AI Engineering Report 2026
AI adoption's measured bill: incidents/PR +243%, bugs +54%.
Google DORA: 2025 State of AI-Assisted Software Development
AI amplifies your conditions; it won't fix broken engineering systems.
GitClear: The Maintainability Gap (2026)
Duplicated code +81%, refactoring −74%: AI's ownership tax.
METR: AI Developer Productivity (2026 update)
AI's task-level speedup is small and contested. Exactly why speed is the wrong metric.

1 The spectrum

Vibe coding

AI-assisted

Agentic eng

2 The economics

3 The harness

4 Two modes

Conductor

Orchestrator

5 The three phases