abramdemski comments on Beyond Reinforcement Learning: Predictive Processing and Checksums

abramdemski 16 Feb 2023 22:02 UTC
2 points
0
We now have all the components we need to create a system that robustly pursues abstract goals.
(I note that if this part works out reliably, alignment would essentially be solved.)
- lsusr 17 Feb 2023 2:15 UTC
  2 points
  0
  Parent
  I would be flattered, had your comment be a compliment. ☺
  
  What I meant is that we have a system with a self-correcting world model which solves the “finger pointing at the Moon” problem. It optimizes the world according to its beliefs about the Moon, even though all we could give it was the finger.
  - abramdemski 17 Feb 2023 5:55 UTC
    5 points
    2
    Parent
    To be clear, I don’t necessarily think you’re wrong about how bio brains do it. A lot rests on the word “reliably”. One possible explanation for sexual fetishes is that the human biological mechanism for pointing at sexual partners is quite unreliable (a hypothesis I predict you agree with).
    But if we could get a similar mechanism to work reliably, we’d have a mechanism for pointing learning machines at things in the world.