This feels like a key detail that’s lacking from this post. I actually downvoted this post because I have no idea if I should be excited about this development or not. I’m pretty familiar with Stuart’s work over the years, so I’m fairly surprised if there’s something big here.
Might help if I put this another way. I’d be purely +1 on this project if it was just “hey, I think I’ve got some good ideas AND I have an idea about why it’s valuable to operationalize them as a business, so I’m going to do that”. Sounds great. However, the bit about “AND I think I know how to build aligned AI for real this time guys and the answer is [a thing folks have been disagreeing about whether or not it works for years]” makes me −1 unless there’s some explanation of how it’s different this time.
Sorry if this is a bit harsh. I don’t want to be too down on this project, but I feel like a core chunk of the post is that there’s some exciting development that leads Stuart to think something new is possible but then doesn’t really tell us what that something new is, and I feel that by the standards of LW/AF that’s good reason to complain and ask for more info.
Firstly, because the problem feels central to AI alignment, in the way that other approaches didn’t. So making progress in this is making general AI alignment progress; there won’t be such a “one error detected and all the work is useless” problem. Secondly, we’ve had success generating somekeyconcepts, implying the problem is ripe for further progress.
Can you describe what changed / what made you start feeling that the problem is solvable / what your new attack is, in short?
This feels like a key detail that’s lacking from this post. I actually downvoted this post because I have no idea if I should be excited about this development or not. I’m pretty familiar with Stuart’s work over the years, so I’m fairly surprised if there’s something big here.
Might help if I put this another way. I’d be purely +1 on this project if it was just “hey, I think I’ve got some good ideas AND I have an idea about why it’s valuable to operationalize them as a business, so I’m going to do that”. Sounds great. However, the bit about “AND I think I know how to build aligned AI for real this time guys and the answer is [a thing folks have been disagreeing about whether or not it works for years]” makes me −1 unless there’s some explanation of how it’s different this time.
Sorry if this is a bit harsh. I don’t want to be too down on this project, but I feel like a core chunk of the post is that there’s some exciting development that leads Stuart to think something new is possible but then doesn’t really tell us what that something new is, and I feel that by the standards of LW/AF that’s good reason to complain and ask for more info.
Firstly, because the problem feels central to AI alignment, in the way that other approaches didn’t. So making progress in this is making general AI alignment progress; there won’t be such a “one error detected and all the work is useless” problem. Secondly, we’ve had success generating some key concepts, implying the problem is ripe for further progress.