the gears to ascension comments on Found Paper: “FDT in an evolutionary environment”

the gears to ascension 28 Nov 2023 4:01 UTC
2 points
0
that’s value(action) = sum_j( prob(outcome_j GIVEN action) * D(outcome_j) ), what is D? it is not at all obvious to me that there’s a straightforward way to parameterize D for learning that is self-consistent in moral dilemmas.
- jacob_cannell 28 Nov 2023 4:02 UTC
  2 points
  0
  Parent
  That should be U—it is the utility function which computes the utility of a future universe.