ctic2421

Karma: 9

ctic2421 Jun 17, 2023, 11:19 PM
1 point
0
in reply to: ryan_greenblatt’s comment on: MetaAI: less is less for alignment.
Curious if you could elaborate more on why MACHIAVELLI isn’t a good test for outer alignment!

ctic2421 Jun 17, 2023, 11:18 PM
1 point
0
in reply to: Cleo Nardo’s comment on: MetaAI: less is less for alignment.
Yep, it’s a language model agent benchmark. It just feeds a scenario and some actions to an autoregressive LM, and asks the model to select an action.