Vladimir_Nesov comments on Chantiel’s Shortform

Vladimir_Nesov 17 Sep 2021 19:51 UTC
2 points

Is that universal AI drive wrong, then?

Damage to AI’s implementation makes the abstractions of its design leak. If somehow without the damage it was clear that a certain part of it describes goals, with the damage it’s no longer clear. If without the damage, the AI was a consequentialist agent, with the damage it may behave in non-agentic ways. By repairing the damage, the AI may recover its design and restore a part that clearly describes its goals, which might or might not coincide with the goals before the damage took place.