On the AI safety side, I feel like there’s been an enormous amount of progress. Most notably for me was Stuart Armstrong’s post: Humans can be assigned any values whatsoever..
There has been significant work on utility functions, but it’s not so much incremental progress and more correction of a mistake.
There has been significant work on utility functions, but it’s not so much incremental progress and more correction of a mistake.