Agents must be specific and testable

The vagueness failure

You have an agent that does not work. You may not realize it does not work, because it feels like it works. It sounds like this: "I will be more organized." Or: "When things get stressful, I will take a step back." Or: "I need to be better about following up with people."

These are not agents. They are wishes wearing the costume of agents. They have the grammatical structure of a rule — a situation paired with an action — but they lack the two properties that make agents actually fire in real conditions: specificity and testability.

An agent that is not specific cannot activate reliably because you cannot recognize the trigger when it arrives. An agent that is not testable cannot improve because you have no way to determine whether it executed or failed. You are left with a vague intention that occasionally produces action when motivation happens to be high and never produces action when motivation is low — which is precisely when you need the agent most.

This lesson is about why specificity and testability are not optional refinements to your agents but structural requirements for their operation, why four decades of research converge on this conclusion, and how the same principle governs every system — cognitive, scientific, and computational — that must produce reliable outputs from defined inputs.

Gollwitzer's meta-analysis: the if-then revolution

In 2006, Peter Gollwitzer and Paschal Sheeran published a meta-analysis across 94 independent studies involving more than 8,000 participants. They measured the effect of a single intervention: converting a goal intention ("I intend to do X") into an implementation intention ("When situation Y arises, I will do behavior Z"). The effect size was d = .65 — a medium-to-large effect — meaning that specifying when, where, and how you will act roughly doubled follow-through compared to holding the same goal without a specific plan (Gollwitzer & Sheeran, 2006).

The mechanism is not motivational. People did not want the goal more after forming an implementation intention. What changed was the cognitive accessibility of the trigger situation. By specifying "when I finish my morning coffee" as the trigger for "I will write for 30 minutes," the mental representation of that moment becomes heightened. You notice it when it happens. Without the specific link, the same situation passes without activating anything.

Gollwitzer's research identified three failures that specificity corrects. Forgetting to act — the moment passed without you noticing. Missing opportunities — the right situation arose but you did not recognize it. Initial reluctance — you noticed the moment but hesitated. Specific if-then plans solve all three by making the trigger salient, the conditions recognizable, and the action pre-decided.

In one study, women who formed the implementation intention "When I am in the shower on the first day of the month, I will examine my breasts" achieved 100% compliance for breast self-examination, compared to 53% among women who held the same goal without the if-then plan. Same motivation. Double the execution. The only difference was specificity.

Locke and Latham: forty years of goal-setting research

The evidence extends far beyond health behaviors. Edwin Locke and Gary Latham spent four decades studying goal-setting across approximately 400 studies, and their central finding is unambiguous: specific, difficult goals consistently produce higher performance than vague goals, with effect sizes ranging from .42 to .80 (Locke & Latham, 2002).

The critical comparison is not between having a goal and having no goal. It is between having a specific goal and having a vague one. "Do your best" is a goal. It is a terrible one. People told to "do your best" performed significantly worse than people given a specific target — even a demanding one. The reason: "do your best" does not define what success looks like, so you cannot orient your effort, track your progress, or know when you have succeeded or failed.

Locke and Latham identified four mechanisms. Specificity directs attention toward relevant activities. It energizes effort by creating a clear gap between where you are and where you need to be. It increases persistence because you can see the distance closing. And it triggers strategy development — when you know specifically what you are trying to achieve, you can devise specific methods to get there. A vague agent like "be more organized" activates none of these mechanisms. A specific agent like "when I receive an email that requires more than a two-minute response, I will move it to my 'respond' folder and process that folder at 4 PM" activates all four.

The Popper principle: untestable agents are not agents

Karl Popper established that a statement is scientific if and only if it is falsifiable — if some possible observation could prove it wrong (Popper, 1934). "All swans are white" is scientific because observing a black swan falsifies it. "The universe works in mysterious ways" is not scientific because no observation could contradict it.

The same criterion applies to your agents. An agent is operational only if it is testable. "I will be mindful throughout the day" is unfalsifiable — any state of awareness could count as mindfulness, any lapse excused as growth. You cannot fail at it, which means you cannot succeed at it either.

A testable agent specifies what observable outcome it produces and within what timeframe. "When I sit down at my desk, I will write my three most important tasks on a sticky note before opening my laptop" is testable. At the end of the week, count: five mornings, three sticky notes — 60% firing rate. Now you have data. You can ask why it failed Tuesday and Thursday. You can adjust the trigger, simplify the action, iterate on the system.

Testability also requires a review cadence. Without it, you rely on subjective memory, which is catastrophically biased: you remember the times the agent fired (because the action was salient) and forget the times it did not (because nothing happened). The result is a systematically inflated sense of agent reliability. This is the same confirmation bias Popper identified in unfalsifiable theories — your untested agents are confirmed by selective memory, immune to disconfirmation, and therefore unable to improve.

The software engineering parallel

Software engineering learned this through decades of expensive failures. Ambiguous requirements specifications are one of the primary sources of defects in software systems. When a requirement says "the system should respond quickly," every developer interprets "quickly" differently. One builds for 500 milliseconds. Another for 5 seconds. The defect traces back to a specification that never defined what it meant.

The IEEE Standard 830 for software requirements exists to combat this. It mandates that requirements be specific, unambiguous, testable, and measurable — not as a stylistic preference but as a structural necessity. A requirement that cannot be tested cannot be verified.

Your agents are your personal specifications for behavior. "I will communicate better with my team" is an ambiguous requirement — every behavior is compatible with it, every failure rationalizable as partial success. The engineering community learned that ambiguous specifications produce unreliable systems. The same applies to ambiguous personal rules.

The AI parallel: vague prompts produce vague outputs

If you have used a large language model, you already have intuitive evidence for this principle. A 2024 study using the DETAIL framework measured the performance gap between vague and specific prompts across reasoning tasks. On mathematical problems, specific prompts produced 92% accuracy compared to 45% for vague prompts — a 47-percentage-point improvement from specificity alone. On logic puzzles, the gap was 36 points. On code understanding, 29 points (Chen et al., 2024).

The mechanism mirrors what operates in your own cognition. "Write something about leadership" could produce anything from a haiku to a textbook. "Write a 500-word analysis of why first-time managers fail at delegation, with three examples from technology companies" directs attention, eliminates irrelevant outputs, and defines success. The specificity does not limit the model — it enables it.

You are prompting yourself when you design an agent. "Be healthier" is a vague prompt — it could mean anything, so it produces no reliable behavior. "When I order food at a restaurant, I will choose the dish with the most vegetables" is a specific prompt. Your cognitive system knows when it fires, what it evaluates, and what it produces. Specificity is not a constraint on your freedom. It is the mechanism that converts intention into action.

The anatomy of a specific and testable agent

A properly specified agent has four dimensions of specificity, corresponding to the four questions that must have concrete answers:

Trigger specificity: when does it activate? Not "when things get stressful" but "when I notice my jaw clenching or my shoulders rising above their resting position." The trigger must be an observable event you can recognize in real time. Abstract triggers like "when I feel overwhelmed" fail because the feeling is diffuse and gradual — you are already overwhelmed before you notice you are overwhelmed, and the agent never activates.

Condition specificity: what must be true? Not "when it is appropriate" but "when the meeting has more than three participants and I have not spoken in the last ten minutes." Conditions narrow the agent's scope so it fires in the right situations and stays dormant in others. Without conditions, the agent either fires constantly (producing fatigue) or never (producing nothing).

Action specificity: what exactly do you do? Not "take a break" but "stand up, walk to the window, take five breaths while looking at something more than fifty feet away, then return." The action must be concrete enough that someone watching you could verify whether you performed it. If the action description requires interpretation, it is not specific enough.

Success specificity: how do you know it worked? Not "I will feel better" but "I will have completed the five-breath protocol at the window at least three times today, and I will record each instance in my tracker." This is the testability dimension — the observable evidence that distinguishes a fired agent from a dormant one. Without it, you are running unfalsifiable experiments on yourself.

The specificity gradient

Specificity exists on a continuum, and understanding where your agent falls on it helps you diagnose reliability failures:

Level 0 — Aspiration: "I want to read more." This is not an agent. It is a desire. It has no trigger, no condition, no action protocol, and no success criterion.

Level 1 — Intention: "I will read every day." This is closer but still insufficient. When during the day? Under what conditions? How much counts as reading? How will you know if you did it?

Level 2 — Partial specification: "I will read for 20 minutes before bed." Better. You have a trigger (before bed) and a measurable action (20 minutes of reading). But the trigger is imprecise — "before bed" could mean 9 PM or midnight — and there is no condition for edge cases. What if you arrive home at 1 AM? What if you are sick?

Level 3 — Full specification: "When I turn off the living room lights for the night, I will sit in the reading chair and read my current book for 20 minutes by the timer on my phone. Exception: if it is past 11:30 PM, I will read for 10 minutes instead. I will track this in my habit app with a checkmark. If I hit fewer than 5 out of 7 days in a week, I will review the trigger conditions the following Sunday."

Level 3 is where agents become reliable. Not because the specification is long, but because every component is concrete enough to execute without in-the-moment interpretation. You have pre-decided every variable. The only thing left is execution.

Why people resist specificity

If specificity is so clearly superior, why do most agents remain vague? Three forces.

First, accountability avoidance. A vague agent cannot fail — "be more organized" is satisfied by any organizational behavior. A specific agent can fail visibly — "process my inbox to zero by 5 PM" either happens or it does not. People keep agents vague to avoid the discomfort of measurable failure.

Second, the flexibility illusion. "Eat healthier" feels like it preserves choice. In practice, it means "eat whatever I feel like and occasionally feel guilty." The specific agent — "replace my afternoon snack with an apple or almonds" — produces more actual behavior change precisely because it removes the decision point where willpower gets consumed.

Third, upfront cost aversion. Designing a specific agent requires thinking through triggers, conditions, actions, and success criteria before it operates. A vague agent costs nothing to create. But that savings is borrowed against future execution: every time the vague agent should fire, you re-incur the full cognitive cost of deciding what to do. The specific agent pays the design cost once and executes cheaply thereafter.

The design protocol

When you build or audit an agent, run this protocol:

State the agent in one sentence. If you cannot state it concisely, it is doing too many things. Split it.
Identify the trigger. What observable event activates this agent? If you cannot point to a specific moment, the trigger is too abstract.
Define the conditions. What must be true for the agent to fire? What exceptions exist? If "it depends," specify what it depends on.
Describe the action. What exactly do you do? Could someone else execute this from your description alone?
Set the success criterion. What measurable outcome proves the agent fired? What data will you collect?
Define the falsification threshold. What failure rate over what time period would tell you the agent is broken? If no failure rate would convince you, the agent is unfalsifiable.
Schedule the review. When will you examine the data and decide whether to adjust the agent?

This protocol converts vague intentions into operational systems. It is not bureaucratic overhead — it is the minimum structure required for an agent to function as an agent rather than an aspiration.

Specificity is infrastructure

Four decades of goal-setting research, 94 implementation intention studies, the scientific method, software engineering, and AI systems all converge: vague specifications produce unreliable outputs. Specificity is not a personality trait. It is a structural requirement for any system that must produce consistent results from defined inputs.

Your agents are your behavioral specifications. Every one that remains vague is a system running on ambiguous requirements — producing inconsistent outputs, resistant to debugging, immune to improvement. Every one you make specific and testable becomes a system you can observe, measure, diagnose, and refine.

The agent that says "I will be better about X" will never improve because it cannot fail. The agent that says "when A, if B, then C, measured by D, reviewed on E" will improve continuously because every failure is visible and every adjustment is testable. Specificity is not the finishing touch on your agents. It is the foundation that makes them agents at all.