Question
What does it mean that variable rewards and habit strength?
Quick Answer
Unpredictable rewards create stronger habits than predictable ones.
Unpredictable rewards create stronger habits than predictable ones.
Example: A runner who finishes every morning jog with the same protein shake has a stable habit. A runner who finishes with a protein shake most days but occasionally finds a handwritten note from her partner on the kitchen counter, or discovers a new playlist queued up, or arrives to a fresh pot of her favorite coffee she did not expect — that runner has a stronger habit. The jog itself has not changed. The cue has not changed. But the reward has become a category with variation: post-run recovery plus an unpredictable bonus. On mornings when the bonus appears, dopamine spikes higher than the baseline reward alone would produce. On mornings when it does not appear, the anticipation of a possible bonus still fires the dopaminergic system — because the brain has learned that any given run might deliver something extra. The unpredictability keeps the reward prediction system engaged in a way that a perfectly reliable protein shake never could.
Try this: Select a positive habit you have been maintaining for at least two weeks with a consistent reward. First, identify the reward category — is it relief, stimulation, competence, connection, or something else? Second, design three variations within that category: one baseline reward (your current consistent one), one upgraded reward (a richer version in the same category), and one surprise reward (something delightful but logistically easy to deliver). Third, create a simple randomization method — roll a die each day, and on a 1 or 2, deliver the upgraded reward; on a 6, deliver the surprise reward; on 3 through 5, deliver the baseline. Run this protocol for two weeks. Track two metrics daily: (1) how strong the urge to perform the habit felt before you started, rated 1-5, and (2) how satisfying the completion felt, rated 1-5. Compare your average urge score during the variable period against the two weeks of consistent reward that preceded it. If the variable schedule is working, the urge score should increase — the habit is pulling harder.
Learn more in these lessons