What is Operant Conditioning?
B.F. Skinner's operant conditioning: behavior followed by a positive consequence (reward) is more likely to repeat; behavior followed by a negative consequence (punishment or removal of reward) is less likely. Marketers use operant conditioning continuously: loyalty points reward repeat purchase, surge pricing punishes off-peak shifting, push notifications variable-rate-reward attention. The most powerful schedule is variable ratio — rewards delivered unpredictably after a varying number of behaviors — which creates the same compulsive behavior as a slot machine. Most app-store gambling, social-media feeds, and loot-box video games exploit it.
How Operant Conditioning actually works
The framework breaks down into the following moving parts. Knowing what each piece is — and what it is not — is what separates a B-grade answer from an A-grade answer in a written assignment.
- Positive reinforcement — add a reward after the desired behavior (loyalty points, free shipping)
- Negative reinforcement — remove a discomfort (skip ads with subscription)
- Punishment — add discomfort after undesired behavior (surge price, cancellation fee)
- Extinction — remove reward to discourage behavior
- Schedules — fixed ratio, variable ratio (most addictive), fixed interval, variable interval
A worked example: Starbucks Rewards
Starbucks Rewards is a textbook operant conditioning system. Every purchase yields stars (positive reinforcement). Star bonuses on certain days create variable-ratio rewards. Personalized challenges ("buy three lunch sandwiches this week, earn 100 stars") use fixed-ratio. Free birthday drinks anchor a holiday. The behavioral data feeds back into more targeted rewards. The system raised average spend per active member by an estimated 30–40% and is regularly cited as one of the most effective loyalty programs in retail.
Don't lose marks for these
- Using continuous reward schedules — they extinguish quickly when stopped
- Confusing operant with classical conditioning
- Relying on punishment without offering reinforcement alternatives
How to use this on the exam
Score-maximizing moves
- Cite Skinner by name
- Identify the reward schedule (especially variable-ratio for engagement)
- Distinguish positive/negative reinforcement from punishment
When to use Operant Conditioning (and when not to)
Use Operant Conditioning when your assignment asks you to analyze, structure, or recommend — and when you have at least two data points to populate every cell of the framework. Skip it when the question is asking for a numerical answer or a single recommendation, since Operant Conditioning is a structuring tool, not a calculator.