I think the extrinsic optimization you describe is what I’m pointing toward with the label “coordination failures,” which might properly be labeled “alignment failures arising uniquely through the interactions of multiple actors who, if deployed alone, would be considered aligned.”
Cheers, Remmelt! I’m glad it was useful.
I think the extrinsic optimization you describe is what I’m pointing toward with the label “coordination failures,” which might properly be labeled “alignment failures arising uniquely through the interactions of multiple actors who, if deployed alone, would be considered aligned.”