So, Robutil is trying to optimize utility of individual actions, but Humo is trying to optimize utility of overall policy?
So, Robutil is trying to optimize utility of individual actions, but Humo is trying to optimize utility of overall policy?