Glad to hear it :) I was talking about DL not RL, although I’d also claim that the latter is unreasonably effective. Basically, we throw compute at neural nets, and they solve problems! We don’t need to even know how to solve them ourselves! We don’t even know what the nets are doing internally! I think this efficacy is as entirely magical as the one in the original paper I was referencing.
This is really good and I found it very useful for what I’m currently working on.
One note: it felt a bit disconnected. And I didn’t get the impression that RL is “unreasonably effective.”
Glad to hear it :) I was talking about DL not RL, although I’d also claim that the latter is unreasonably effective. Basically, we throw compute at neural nets, and they solve problems! We don’t need to even know how to solve them ourselves! We don’t even know what the nets are doing internally! I think this efficacy is as entirely magical as the one in the original paper I was referencing.