What it is
Learning by consequences.
Why it matters
Reward schedules shape long-term behavior.
When you'll use it
When designing loyalty programs, gamification, and pricing.

What is Operant Conditioning?

B.F. Skinner's operant conditioning: behavior followed by a positive consequence (reward) is more likely to repeat; behavior followed by a negative consequence (punishment or removal of reward) is less likely. Marketers use operant conditioning continuously: loyalty points reward repeat purchase, surge pricing punishes off-peak shifting, push notifications variable-rate-reward attention. The most powerful schedule is variable ratio — rewards delivered unpredictably after a varying number of behaviors — which creates the same compulsive behavior as a slot machine. Most app-store gambling, social-media feeds, and loot-box video games exploit it.

How Operant Conditioning actually works

The framework breaks down into the following moving parts. Knowing what each piece is — and what it is not — is what separates a B-grade answer from an A-grade answer in a written assignment.

  • Positive reinforcement — add a reward after the desired behavior (loyalty points, free shipping)
  • Negative reinforcement — remove a discomfort (skip ads with subscription)
  • Punishment — add discomfort after undesired behavior (surge price, cancellation fee)
  • Extinction — remove reward to discourage behavior
  • Schedules — fixed ratio, variable ratio (most addictive), fixed interval, variable interval

A worked example: Starbucks Rewards

Starbucks Rewards is a textbook operant conditioning system. Every purchase yields stars (positive reinforcement). Star bonuses on certain days create variable-ratio rewards. Personalized challenges ("buy three lunch sandwiches this week, earn 100 stars") use fixed-ratio. Free birthday drinks anchor a holiday. The behavioral data feeds back into more targeted rewards. The system raised average spend per active member by an estimated 30–40% and is regularly cited as one of the most effective loyalty programs in retail.

Common mistakes

Don't lose marks for these

  • Using continuous reward schedules — they extinguish quickly when stopped
  • Confusing operant with classical conditioning
  • Relying on punishment without offering reinforcement alternatives

How to use this on the exam

Exam tips

Score-maximizing moves

  • Cite Skinner by name
  • Identify the reward schedule (especially variable-ratio for engagement)
  • Distinguish positive/negative reinforcement from punishment

When to use Operant Conditioning (and when not to)

Use Operant Conditioning when your assignment asks you to analyze, structure, or recommend — and when you have at least two data points to populate every cell of the framework. Skip it when the question is asking for a numerical answer or a single recommendation, since Operant Conditioning is a structuring tool, not a calculator.

Editor's note Want a deeper walkthrough? Our editors recommend pairing this with The Consumer Decision Process for a worked example you can adapt to your assignment.
learningpsychologyloyalty