dr_s comments on OpenAI: Facts from a Weekend

dr_s 21 Nov 2023 7:46 UTC
6 points
0
I mean, this would not be too hard though. It could be achieved by a simple trick of appearing smarter to some people and then dumber at subsequent interactions with others, scaring the safety conscious and then making them look insane for being scared.

I don’t think that’s what’s going on (why would even an AGI model they made be already so cleverly deceptive and driven? I would expect OAI to not be stupid enough to build the most straightforward type of maximizer) but it wouldn’t be particularly hard to think up or do.