O O comments on O O’s Shortform

O O 3 Jun 2023 23:32 UTC
1 point
0
Something that’s been intriguing me. If two agents figure out how to trust that each others goals are aligned (or at least not opposed), haven’t they essentially solved the alignment problem?

e.g. one agent could use the same method to bootstrap an aligned AI.