Rohin Shah comments on Likelihood of hyperexistential catastrophe from a bug?

Rohin Shah 23 Jul 2020 14:18 UTC
6 points
Not really, because it takes time to train the cognitive skills necessary for deception.
You might expect this if your AGI was built with a “capabilities module” and a “goal module” and the capabilities were already present before putting in the goal, but it doesn’t seem like AGI is likely to be built this way.
What links here?
- Anirandis's comment on Open & Welcome Thread—August 2020 by habryka (21 Aug 2020 15:20 UTC; 3 points)
- Anirandis 23 Jul 2020 14:36 UTC
  1 point
  Parent
  Not really, because it takes time to train the cognitive skills necessary for deception.
  Would that not be the case with *any* form of deceptive alignment, though? Surely it (deceptive alignment) wouldn’t pose a risk at all if that were the case? Sorry in advance for my stupidity.