I recently gave a talk (slides) on some thoughts about what automating AI safety research might look like.
Some [earlier versions] of the ideas there were developed during my Astra Fellowship Winter ’24 with @evhub and through related conversations in Constellation.
I recently gave a talk (slides) on some thoughts about what automating AI safety research might look like.
Some [earlier versions] of the ideas there were developed during my Astra Fellowship Winter ’24 with @evhub and through related conversations in Constellation.