Kerrigan comments on Decision theory does not imply that we get to have nice things

Kerrigan 19 Sep 2024 6:18 UTC
LW: 1 AF: 1
0
AF
“Similarly, it’s possible for LDT agents to acquiesce to your threats if you’re stupid enough to carry them out even though they won’t work. In particular, the AI will do this if nothing else the AI could ever plausibly meet would thereby be incentivized to lobotomize themselves and cover the traces in order to exploit the AI.
But in real life, other trading partners would lobotomize themselves and hide the traces if it lets them take a bunch of the AI’s lunch money. And so in real life, the LDT agent does not give you any lunch money, for all that you claim to be insensitive to the fact that your threats don’t work.”

Can someone please why trading partners would lobotomize themselves?
- quetzal_rainbow 19 Sep 2024 8:31 UTC
  2 points
  0
  Parent
  Let’s suppose that you give in to threats if your opponent is not capable to predict that you do not give in to threats, so they carry the threat anyway. Therefore, other opponents are incentivised to pretend very hard to be such opponent, up to “literally turn themselves into sort of opponent that carries on useless threats”.