My new research direction for an “end-to-end” alignment scheme.
See also this clarifying comment.
I’m posting this in the Open Thread because, for technical reasons the shortforms don’t appear in the feed on the main page of alignmentforum, so I am a little worried people missed it entirely (I discussed it with Oliver).
Thoughts about understanding how game theory combines with learning theory.
The incomplete models formalism solves a large chunk of decision theory.
My new research direction for an “end-to-end” alignment scheme.
See also this clarifying comment.
I’m posting this in the Open Thread because, for technical reasons the shortforms don’t appear in the feed on the main page of alignmentforum, so I am a little worried people missed it entirely (I discussed it with Oliver).
Thoughts about understanding how game theory combines with learning theory.
The incomplete models formalism solves a large chunk of decision theory.