RSS

Tomáš Gavenčiak

Karma: 180

A researcher in CS theory, AI safety and other stuff.

How can In­ter­pretabil­ity help Align­ment?

May 23, 2020, 4:16 PM
37 points
3 comments9 min readLW link

What is In­ter­pretabil­ity?

Mar 17, 2020, 8:23 PM
39 points
1 comment11 min readLW link