Razied comments on Yann LeCun: We only design machines that minimize costs [therefore they are safe]

Razied 15 Jun 2024 20:15 UTC
11 points
2
Very many things wrong with all of that:
1. RL algorithms don’t minimize costs, but maximize expected reward, which can well be unbounded, so it’s wrong to say that the ML field only minimizes cost.
2. LLMs minimize expected log probability of correct token, which is indeed bounded at zero from below, but achieving zero in that case means perfectly predicting every single token on the internet.
3. The boundedness of the thing you’re minimizing is totally irrelevant, since maximizing $f (x)$ is exactly the same as maximizing $g (f (x))$ where g is a monotonic function. You can trivially turn a bounded function into an unbounded one without changing anything to the solution sets.
4. Even if utility is bounded between 0 and 1, an agent maximizing the expected utility will still never stop, because you can always decrease the probability you were wrong. Quadruple-check every single step and turn the universe into computronium to make sure you didn’t make any errors.
This is very dumb, Lecun should know better, and I’m sure he *would* know better if he spent 5 minutes thinking about any of this.
- tailcalled 15 Jun 2024 20:21 UTC
  2 points
  0
  Parent
  RL algorithms don’t minimize costs, but maximize expected reward, which can well be unbounded, so it’s wrong to say that the ML field only minimizes cost.
  Yann LeCun’s proposals are based on cost-minimization.
  - Razied 15 Jun 2024 20:32 UTC
    2 points
    0
    Parent
    Do you expect Lecun to have been assuming that the entire field of RL stops existing in order to focus on his specific vision?
    - tailcalled 15 Jun 2024 20:52 UTC
      4 points
      0
      Parent
      I’m not sure he has coherent expectations, but I’d expect his vibe is some combination of “RL doesn’t currently work” and “fields generally implement safety standards”.