TAG comments on Is LW making progress?

TAG 25 Aug 2019 7:47 UTC
1 point

On the AI safety side, I feel like there’s been an enormous amount of progress. Most notably for me was Stuart Armstrong’s post: Humans can be assigned any values whatsoever..

There has been significant work on utility functions, but it’s not so much incremental progress and more correction of a mistake.