Dagon comments on Does a LLM have a utility function?

Dagon 10 Dec 2022 15:43 UTC
3 points
0
And you can’t determine if it’s safe by examining or understanding it’s utility function, if the abstraction is so loose as to not be align-able.
- Dan 15 Jan 2023 15:53 UTC
  2 points
  0
  Parent
  Its not really an abstraction at all in this case, it literally has a utility function. What rates highest on its utility function is returning whatever token is ‘most likely’ given it’s training data.