That’s true and false. Intermittent reinforcement gets a more robust effect than continual reinforcement, yes, but randomly intermittent reinforcement isn’t as effective as setting the reward threshold higher as the behavior becomes more common… e.g., rewarding only the 10% nicest things.
I want to design a reinforcement schedule in one of our apps. Can anyone link me to some specific guidelines on how to optimise this?
(Reinforce exactly what % of successes (30%? 26%? 8%?)? Reinforce performances in the top 10% of past performances (or the top 12%, or the top 8%?)? How does time factor (if the user hasn’t used the app for a week, should I push a reinforcer forward?)?)
I can’t, but if you find anything concise and useful, I’d love to hear about it myself.
My rule of thumb is to set the threshold so as to reinforce the top 20% or so of performances, and arrange performance frequencies so I’m reinforcing 2-3 times/minute during active training periods. But that’s not based on anything.
I’ll also note that reinforcing higher-tier performances more strongly works really well (though is hard to do consistently by hand), as do very intermittent “jackpots” (disproportional and unpredictable mega-rewards).
That’s true and false. Intermittent reinforcement gets a more robust effect than continual reinforcement, yes, but randomly intermittent reinforcement isn’t as effective as setting the reward threshold higher as the behavior becomes more common… e.g., rewarding only the 10% nicest things.
I want to design a reinforcement schedule in one of our apps. Can anyone link me to some specific guidelines on how to optimise this?
(Reinforce exactly what % of successes (30%? 26%? 8%?)? Reinforce performances in the top 10% of past performances (or the top 12%, or the top 8%?)? How does time factor (if the user hasn’t used the app for a week, should I push a reinforcer forward?)?)
I can’t, but if you find anything concise and useful, I’d love to hear about it myself.
My rule of thumb is to set the threshold so as to reinforce the top 20% or so of performances, and arrange performance frequencies so I’m reinforcing 2-3 times/minute during active training periods. But that’s not based on anything.
I’ll also note that reinforcing higher-tier performances more strongly works really well (though is hard to do consistently by hand), as do very intermittent “jackpots” (disproportional and unpredictable mega-rewards).