jacob_cannell comments on [linkpost] “What Are Reasonable AI Fears?” by Robin Hanson, 2023-04-23

jacob_cannell 16 Apr 2023 1:13 UTC
3 points
0
If you have played with chatGPT4 its pretty clear that it is aligned (humans have roughly chose its values), especially compared to reports of the original raw model before RLHF, or less sophisticated alignment attempts in the same model family—ie Bing. Now its possible of course that its all deception, but this seems somewhat unlikely.