I think you probably need some mechanism for restarting the agent in a randomly different environment/with different open ended goals. Otherwise, I think it will just get permanantly stuck or go in loops.
Not a serious obstacle to make making this happen of course.
Have a loop-detector that shuts it down and restarts upon detection of a loop? It would be interesting to track the metric of ‘how long on average does it take before it gets stuck / in a loop.’ Over the course of years I’d expect to see exciting progress in this metric.
I think you probably need some mechanism for restarting the agent in a randomly different environment/with different open ended goals. Otherwise, I think it will just get permanantly stuck or go in loops.
Not a serious obstacle to make making this happen of course.
Have a loop-detector that shuts it down and restarts upon detection of a loop? It would be interesting to track the metric of ‘how long on average does it take before it gets stuck / in a loop.’ Over the course of years I’d expect to see exciting progress in this metric.