Logan Riggs comments on AGI in sight: our look at the game board

Logan Riggs 19 Feb 2023 23:16 UTC
LW: 4 AF: 3
2
AF

Monitoring of increasingly advanced systems does not trivially work, since much of the cognition of advanced systems, and many of their dangerous properties, will be externalized the more they interact with the world.

Externalized reasoning being a flaw in monitoring makes a lot of sense, and I haven’t actually heard of it before. I feel that should be a whole post on itself.