Linda Linsefors comments on Linda Linsefors’s Shortform

Linda Linsefors 2 Feb 2023 10:22 UTC
1 point
Yes, that is a thing you can do with decision transforms too. I was referring to variant of the decision transformer (see link in original short form) where the AI samples the reward it’s aiming for.