mako yass comments on Speedrun ruiner research idea

mako yass 14 Apr 2024 19:20 UTC
2 points
0
Cool then.
Are you aware that prepotence is the default for strong optimizers though?
- lemonhope 14 Apr 2024 19:24 UTC
  1 point
  0
  Parent
  What about mediocre optimizers? Are they not worth fooling with?
  - mako yass 14 Apr 2024 23:59 UTC
    2 points
    0
    Parent
    Wouldn’t really need reward modelling for narrow optimizers. Weak general real-world optimizers, I find difficult to imagine, and I’d expect them to be continuous with strong ones, the projects to make weak ones wouldn’t be easily distinguishable from the projects to make strong ones.
    Oh, are you thinking of applying it to say, simulation training.