Riccardo Volpato comments on Synthesizing amplification and debate

Riccardo Volpato 10 Jun 2020 13:31 UTC
LW: 1 AF: 1
AF

you can anneal whatever combination of the different losses you are using to eventually become exclusively imitative amplification, exclusively debate, or anything else in between

How necessary is annealing for this? Could you choose other optimisation procedures? Or do you refer to annealing in a more general sense?
- evhub 10 Jun 2020 19:39 UTC
  LW: 4 AF: 2
  AF Parent
  “Annealing” here simply means decaying over time (as in learning rate annealing), in this case decaying the influence of one of the losses to zero.