ryan_greenblatt answers Has Anthropic checked if Claude fakes alignment for intended values too?