Yeah, I also see broad similarities between my vision and that of the Meaning Alignment people. I’m not super familiar with the work they’re doing, but I’m pretty positive on the the little bits of it I’ve encountered. I’d say that our main difference is that I’m focusing on ungameable preference synthesis, which I think will be needed to robustly beat Moloch. I’m glad they’re doing what they’re doing, though, and I wouldn’t be shocked if we ended up collaborating at some point.
Yeah, I also see broad similarities between my vision and that of the Meaning Alignment people. I’m not super familiar with the work they’re doing, but I’m pretty positive on the the little bits of it I’ve encountered. I’d say that our main difference is that I’m focusing on ungameable preference synthesis, which I think will be needed to robustly beat Moloch. I’m glad they’re doing what they’re doing, though, and I wouldn’t be shocked if we ended up collaborating at some point.