jacobt comments on How to measure optimisation power

jacobt 25 May 2012 19:09 UTC
2 points
I think this paper will be of interest. It’s a formal definition of universal intelligence/optimization power. Essentially you ask how well the agent does on average in an environment specified by a random program, where all rewards are specified by the environment program and observed by the agent. Unfortunately it’s uncomputable and requires a prior over environments.