Ben Smith comments on Clippy, the friendly paperclipper

Ben Smith 2 Mar 2023 5:00 UTC
1 point
0
I’ve been writing about multi-objective RL and trying to figure out a way that an RL agent could optimize for a non-linear sum of objectives in a way that avoids strongly negative outcomes on any particular objective.
https://www.lesswrong.com/posts/i5dLfi6m6FCexReK9/a-brief-review-of-the-reasons-multi-objective-rl-could-be
- Seth Herd 3 Mar 2023 19:50 UTC
  1 point
  0
  Parent
  Thank you! This is addressing the question I was trying to get at. I’ll check it out.