The Stranger Test

Question 1

A fresh agent session starts. What does it know about your project?

Accepted Answer

Born knowing — It’s briefed on arrival — conventions, architecture, the landmines — from context that’s maintained and loaded every session.

Question 2

An agent learns something hard-won on Tuesday. Where is that lesson on Wednesday?

Accepted Answer

In shared memory — Written back to a memory every agent reads. Tuesday’s lesson is Wednesday’s briefing.

Question 3

What can your agents actually touch?

Accepted Answer

Scoped to the role — Each agent gets what its niche needs: the reviewer can’t write, the writer can’t deploy.

Question 4

Risky agent output — what checks it before it lands?

Accepted Answer

Systematic gates — Deterministic checks plus a human on the risky paths. Evidence, not vibes.

Question 5

Could you replay exactly what an agent did last Tuesday?

Accepted Answer

Step by step — Every tool call and decision is on a record I can actually replay.

Question 6

An agent is mid-task and drifting. What can you do about it?

Accepted Answer

Steer or take the wheel — Redirect it with a sentence, or take over live — without killing the run.

Question 7

How is your fleet of agents organized?

Accepted Answer

Specialists with niches — Defined roles, each shipped with the tools and permissions its job requires.

Question 8

An agent fails. How do you find out why?

Accepted Answer

From the record — Trace the run, find the decision that went sideways, fix the cause.

How do you treat
your AI agents?