RHollerith comments on The Shutdown Problem: An AI Engineering Puzzle for Decision Theorists

RHollerith 21 Nov 2023 2:44 UTC
3 points
1
Yudkowsky’s suggestion is for preventing the creation of a dangerous AI by people. Once a superhumanly-capable AI has been created and has had a little time to improve its situation, it is probably too late even for a national government with nuclear weapons to stop it (because the AI will have hidden copies of itself all around the world or taken other measures to protect itself, measures that might astonish all of us).

The OP in contrast is exploring the hope that (before any dangerous AIs are created) a very particular kind of AI can be created that won’t try to prevent people from shutting it down.
- O O 21 Nov 2023 3:20 UTC
  1 point
  0
  Parent
  If a strongly superhuman AI was created sure, but you can probably box a minimally superhuman AI.
  - RHollerith 21 Nov 2023 15:53 UTC
    2 points
    0
    Parent
    It’s hard to control how capable the AI turns out to be. Even the creators of GPT-4 were surprised, for example, that it would be able to score in the 90th percentile on the Bar Exam. (They expected that if they and other AI researchers were allowed to continue their work long enough that eventually one of their models would be able to do, but had no way of telling which model it would be.)
    
    But more to the point: how does boxing have any bearing on this thread? If you want to talk about boxing, why do it in the comments to this particular paper? why do it as a reply to my previous comment?