Oliver Sourbut comments on Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense

Oliver Sourbut 24 Nov 2023 19:51 UTC
6 points
0
Strong agree with long-horizon sequential decision-making success being very tied to wantingness.

I kinda want to point at things like the Good and Gooder Regulator theorems here as theoretical reasons to expect this, besides the analogies you give. But I don’t find them entirely satisfactory. I have recently wondered if there’s something like a Good Regulator theorem for planner-simulators: a Planner Simulator conjecture something like, ‘every (simplest) simulator of a planner contains (something homomorphic to) a planner’. Potential stepping-stone for the agent-like structure problem. I also have some more specific thoughts about long-horizon and the closed-loop of deliberation for R&D-like tasks. But I’ve struggled to articulate these, in part because I flinch when it seems too capabilities-laden.

Any tips?
What links here?
- Oliver Sourbut's comment on Conditions for mathematical equivalence of Stochastic Gradient Descent and Natural Selection by Oliver Sourbut (13 Dec 2023 13:41 UTC; 5 points)