Reinforcement Schedule

A reinforcement schedule is a rule that specifies the timing and frequency of reinforcers. In his early experiments on operant conditioning, Skinner rewarded animals with food every time they made the desired response—a schedule known as continuous reinforcement. Skinner soon tried rewarding only some instances of the desired response and not others—a schedule known as partial reinforcement. To his surprise, he found that animals showed entirely different behavior patterns.

Skinner and other psychologists found that partial reinforcement schedules are often more effective at strengthening behavior than continuous reinforcement schedules, for two reasons. First, they usually produce more responding, at a faster rate. Second, a behavior learned through a partial reinforcement schedule has greater resistance to extinction—if the rewards for the behavior are discontinued, the behavior will persist for a longer period of time before stopping. One reason extinction is slower after partial reinforcement is that the learner has become accustomed to making responses without receiving a reinforcer each time. There are four main types of partial reinforcement schedules: fixed-ratio, variable-ratio, fixed-interval, and variable-interval. Each produces a distinctly different pattern of behavior.

On a fixed-ratio schedule, individuals receive a reinforcer each time they make a fixed number of responses. For example, a factory worker may earn a certain amount of money for every 100 items assembled. This type of schedule usually produces a stop-and-go pattern of responding: The individual works steadily until receiving one reinforcer, then takes a break, then works steadily until receiving another reinforcer, and so on.

On a variable-ratio schedule, individuals must also make a number of responses before receiving a reinforcer, but the number is variable and unpredictable. Slot machines, roulette wheels, and other forms of gambling are examples of variable-ratio schedules. Behaviors reinforced on these schedules tend to occur at a rapid, steady rate, with few pauses. Thus, many people will drop coins into a slot machine over and over again on the chance of winning the jackpot, which serves as the reinforcer.

On a fixed-interval schedule, individuals receive reinforcement for their response only after a fixed amount of time elapses. For example, in a laboratory experiment with a fixed-interval one-minute schedule, at least one minute must elapse between the deliveries of the reinforcer. Any responses that occur before one minute has passed have no effect. On these schedules, animals usually do not respond at the beginning of the interval, but they respond faster and faster as the time for reinforcement approaches. Fixed-interval schedules rarely occur outside the laboratory, but one close approximation is the clock-watching behavior of students during a class. Students watch the clock only occasionally at the start of a class period, but they watch more and more as the end of the period gets nearer.

Variable-interval schedules also require the passage of time before providing reinforcement, but the amount of time is variable and unpredictable. Behavior on these schedules tends to be steady, but slower than on ratio schedules. For example, a person trying to call someone whose phone line is busy may redial every few minutes until the call gets through.