I agree. I didn’t mean to imply that I thought this step would be easy, and I would also be interested in more concrete ways of doing it. It’s possible that creating a hereditarily restricted optimizer along the lines I was suggesting could end up being approximately as difficult as creating an aligned general-purpose optimizer, but I intuitively don’t expect this to be the case.
I agree. I didn’t mean to imply that I thought this step would be easy, and I would also be interested in more concrete ways of doing it. It’s possible that creating a hereditarily restricted optimizer along the lines I was suggesting could end up being approximately as difficult as creating an aligned general-purpose optimizer, but I intuitively don’t expect this to be the case.