In mathematical terms, what separates agents that could arise from natural selection from a generic agent?
To ask a more concrete question, suppose we consider the framework of DeepMind’s Population Based Training (PBT), chosen just because I happen to be familiar with it (it’s old at this point, not sure what the current thing is in that direction). This method will tend to produce a certain distribution over parametrised agents, different from the distribution you might get by training a single agent in traditional deep RL style. What are the qualitative differences in these inductive biases?
This is an entire field of research: evolutionary psychology. (Translating that into mathematical terms may be challenging, but I’m unclear why you feel it’s necessary?)
In mathematical terms, what separates agents that could arise from natural selection from a generic agent?
To ask a more concrete question, suppose we consider the framework of DeepMind’s Population Based Training (PBT), chosen just because I happen to be familiar with it (it’s old at this point, not sure what the current thing is in that direction). This method will tend to produce a certain distribution over parametrised agents, different from the distribution you might get by training a single agent in traditional deep RL style. What are the qualitative differences in these inductive biases?
This is an entire field of research: evolutionary psychology. (Translating that into mathematical terms may be challenging, but I’m unclear why you feel it’s necessary?)