Seems like you need to run a Bayesian model where the AI has a prior distribution over the value of exiert and carries out some exploration/​exploitation tradeoff, as in bandit problems.
Seems like you need to run a Bayesian model where the AI has a prior distribution over the value of exiert and carries out some exploration/​exploitation tradeoff, as in bandit problems.