You have to be joking. Not a single one of those partial “analysis” says much about whats going on in there. Also Yud has already said he believes that inner goals often wont manifest until high levels of intelligence because no system of reasonable intelligence tries to pursue impossible goals.
You have to be joking. Not a single one of those partial “analysis” says much about whats going on in there. Also Yud has already said he believes that inner goals often wont manifest until high levels of intelligence because no system of reasonable intelligence tries to pursue impossible goals.
If he thinks AI interpretability work as it exists isn’t helpful he should say so, but he shouldn’t speak as though it doesn’t exist.