I thought Probabilistic Models of Cognition was quite great (it seems criminally underappreciated); that seems like a good step in this direction.
Perhaps in the future, one could prove that “This environment with these actors will fail in these ways” by empirically showing that reinforcement agents optimizing in those setups lead to specific outcomes.
I thought Probabilistic Models of Cognition was quite great (it seems criminally underappreciated); that seems like a good step in this direction.
Perhaps in the future, one could prove that “This environment with these actors will fail in these ways” by empirically showing that reinforcement agents optimizing in those setups lead to specific outcomes.