By repetition penalty do you mean an explicit logit bias when sampling or internally it’s generalized to avoiding repeated tokens?
By repetition penalty do you mean an explicit logit bias when sampling or internally it’s generalized to avoiding repeated tokens?