Noosphere89 answers Why should we expect AIs to coordinate well?

Noosphere89 17 Feb 2023 2:34 UTC
1 point
0
The basic answer is acausal decision theories make AI collusion/cooperation easier.

The most important aspect of acausal/logical decision theories is that they cooperate even in scenarios designed to force non-cooperation, like the Prisoner’s Dilemma.

And there’s already evidence that RLHF makes models have more acausal decision theories.