That’s known as a VR 4 schedule (variable-ratio 4) because the behavior is rewarded an average of every four times the correct response is given. Variable schedules maximize what is known as resistance to extinction; the probability a behavior will decrease in frequency goes down. Continuous schedules are best for establishing a new behavior. I would expect they use continuous reinforcement whenever a new skill is being learned in the game.
Upvote for content, but I think that there’s a typo in your second sentence
Variable schedules maximize what is known as resistance to extinction, the probability a behavior will decrease in frequency goes down.
Perhaps a semicolon instead of a comma, or “as frequency of rewards … ” instead of “in frequency …”, was intended?
That’s known as a VR 4 schedule (variable-ratio 4) because the behavior is rewarded an average of every four times the correct response is given. Variable schedules maximize what is known as resistance to extinction; the probability a behavior will decrease in frequency goes down. Continuous schedules are best for establishing a new behavior. I would expect they use continuous reinforcement whenever a new skill is being learned in the game.
Upvote for content, but I think that there’s a typo in your second sentence
Fixed