Great point. I agree and should have said something like that in the post.
To expand on this a bit more, studying these specialized models will be valuable for improving their robustness and performance. It is possible that this research will be useful for alignment in general, but it’s not the most promising approach. That being said, I want to see alignment researchers working on diverse approaches.
Great point. I agree and should have said something like that in the post.
To expand on this a bit more, studying these specialized models will be valuable for improving their robustness and performance. It is possible that this research will be useful for alignment in general, but it’s not the most promising approach. That being said, I want to see alignment researchers working on diverse approaches.