RSS

Robert_AIZI

Karma: 1,386

Rat­ing my AI Predictions

Robert_AIZIDec 21, 2023, 2:07 PM
22 points
5 comments2 min readLW link
(aizi.substack.com)

Com­par­ing An­thropic’s Dic­tionary Learn­ing to Ours

Robert_AIZIOct 7, 2023, 11:30 PM
137 points
8 comments4 min readLW link

Sparse Au­toen­coders Find Highly In­ter­pretable Direc­tions in Lan­guage Models

Sep 21, 2023, 3:30 PM
159 points
8 comments5 min readLW link

Un­safe AI as Dy­nam­i­cal Systems

Robert_AIZIJul 14, 2023, 3:31 PM
11 points
0 comments3 min readLW link
(aizi.substack.com)

AIs teams will prob­a­bly be more su­per­in­tel­li­gent than in­di­vi­d­ual AIs

Robert_AIZIJul 4, 2023, 2:06 PM
3 points
1 comment2 min readLW link
(aizi.substack.com)

[Re­search Up­date] Sparse Au­toen­coder fea­tures are bimodal

Robert_AIZIJun 22, 2023, 1:15 PM
24 points
1 comment5 min readLW link
(aizi.substack.com)

Ex­plain­ing “Tak­ing fea­tures out of su­per­po­si­tion with sparse au­toen­coders”

Robert_AIZIJun 16, 2023, 1:59 PM
10 points
0 comments8 min readLW link
(aizi.substack.com)