Gordon Seidoh Worley comments on OpenAI: Leaks Confirm the Story

Gordon Seidoh Worley 12 Dec 2023 23:50 UTC
2 points
6
Reading this post, here’s my new galaxy-brain take on Altman:
If we’re going to solve AI alignment, Altman is making sure we first have to solve Altman alignment. If we can’t even get hyper-optimizing human Altman aligned, we have no hope of aligning AI.
- Thane Ruthenis 13 Dec 2023 5:05 UTC
  12 points
  8
  Parent
  While a fun quip to make, it has little relation to the physical reality. There are technologically possible ways of aligning AIs that have no analogy for aligning humans, and there are approaches to aligning humans that are not available for aligning AIs. Solving one problem, thus, doesn’t necessarily have anything to do with solving the other: you can be able to align an AGI without being able to align a flesh-and-blood human, and you could develop the ability to robustly align humans yet end up not one step closer to aligning an AGI.
  I mean, I can’t say this whole debacle doesn’t have a funny allegorical meaning in the context of AI Alignment and OpenAI’s chances of achieving it. But it’s a funny allegory, not exact correspondence.