+1
I think I’ve already mentioned these papers to you, but just in case I haven’t, to add on to the one Nisan suggested:
Verifiable Reinforcement Learning via Policy Extraction
Towards Mixed Optimization for Reinforcement Learning with Program Synthesis
+1
I think I’ve already mentioned these papers to you, but just in case I haven’t, to add on to the one Nisan suggested:
Verifiable Reinforcement Learning via Policy Extraction
Towards Mixed Optimization for Reinforcement Learning with Program Synthesis