Hastings comments on Half-baked AI Safety ideas thread

Hastings 23 Jul 2022 1:49 UTC
2 points
To steelman, I’d guess this idea applies in the hypothetical where GPT-N gains general intelligence and agency (such as via a mesa-optimizer) just by predicting the next token.