Neel Nanda comments on Actually, Othello-GPT Has A Linear Emergent World Representation

Neel Nanda 2 Apr 2023 10:15 UTC
2 points
0
Idk, I feel like GPT4 is capable of tool use, and also capable of writing enough code to make its own tools.
- ws27a 2 Apr 2023 12:08 UTC
  1 point
  0
  Parent
  I agree that it’s capable of doing that, but it just doesn’t do it. If you ask it to multiply a large number, it confidently gives you some incorrect answer a lot of the time instead of using it’s incredible coding skills to just calculate the answer. If it was trained via reinforcement learning to maximize a more global and sophisticated goal than merely predicting the next word correctly or avoiding linguistic outputs that some humans have labelled as good or bad, it’s very possible it would go ahead and invent these tools and start using them, simply because it’s the path of least resistance towards its global goal. I think the real question is what that global goal is supposed to be, and maybe we even have to abandon the notion of training based on reward signals altogether. This is where we get into very murky and unexplored territory, but it’s ultimately where the research community has to start looking. Just to conclude on my own position; I absolutely believe that GPT-like systems can be one component of a fully fledged AGI, but there are other crucial parts missing currently, that we do not understand in the slightest.