[Justification for voting behavior, not intending to start a discussion. If I were I would have commented on the linked post]
I’ve read the model distillation post, and it is bad, so strong disagree. I don’t think that person understands the arguments for AI risk and in particular don’t want to continuously reargue the “consequentialism is simpler, actually” line of discussion with someone who hasn’t read pretty basic material like risks from learned optimization.
[Justification for voting behavior, not intending to start a discussion. If I were I would have commented on the linked post]
I’ve read the model distillation post, and it is bad, so strong disagree. I don’t think that person understands the arguments for AI risk and in particular don’t want to continuously reargue the “consequentialism is simpler, actually” line of discussion with someone who hasn’t read pretty basic material like risks from learned optimization.
I still think this one is interesting and should get more attention, though: https://www.lesswrong.com/posts/JqWQxTyWxig8Ltd2p/relative-abstracted-agency
fair enough. I’ve struck it from my comment.