RSS

Garrett Baker

Karma: 4,283

Independent alignment researcher

I have signed no contracts or agreements whose existence I cannot mention.

Value drift threat models

Garrett BakerMay 12, 2023, 11:03 PM
27 points
4 comments5 min readLW link

[Question] What con­straints does deep learn­ing place on al­ign­ment plans?

Garrett BakerMay 3, 2023, 8:40 PM
9 points
0 comments1 min readLW link

Pes­simistic Shard Theory

Garrett BakerJan 25, 2023, 12:59 AM
72 points
13 comments3 min readLW link

Perform­ing an SVD on a time-se­ries ma­trix of gra­di­ent up­dates on an MNIST net­work pro­duces 92.5 sin­gu­lar values

Garrett BakerDec 21, 2022, 12:44 AM
9 points
10 comments5 min readLW link

Don’t de­sign agents which ex­ploit ad­ver­sar­ial inputs

Nov 18, 2022, 1:48 AM
72 points
64 comments12 min readLW link

A frame­work and open ques­tions for game the­o­retic shard modeling

Garrett BakerOct 21, 2022, 9:40 PM
11 points
4 comments4 min readLW link

Tak­ing the pa­ram­e­ters which seem to mat­ter and ro­tat­ing them un­til they don’t

Garrett BakerAug 26, 2022, 6:26 PM
120 points
48 comments1 min readLW link

How (not) to choose a re­search project

Aug 9, 2022, 12:26 AM
79 points
11 comments7 min readLW link

In­for­ma­tion the­o­retic model anal­y­sis may not lend much in­sight, but we may have been do­ing them wrong!

Garrett BakerJul 24, 2022, 12:42 AM
7 points
0 comments10 min readLW link

Model­ling Deception

Garrett BakerJul 18, 2022, 9:21 PM
15 points
0 comments7 min readLW link

Another ar­gu­ment that you will let the AI out of the box

Garrett BakerApr 19, 2022, 9:54 PM
8 points
16 comments2 min readLW link

[cross-post with EA Fo­rum] The EA Fo­rum Pod­cast is up and running

Garrett BakerJul 5, 2021, 9:52 PM
3 points
0 comments1 min readLW link

[Question] In­for­ma­tion on time-com­plex­ity prior?

Garrett BakerJan 8, 2021, 6:09 AM
6 points
2 comments1 min readLW link

D0TheMath’s Shortform

Garrett BakerOct 9, 2020, 2:47 AM
1 point
226 comments1 min readLW link

[Question] Why does “deep ab­strac­tion” lose it’s use­ful­ness in the far past and fu­ture?

Garrett BakerJul 9, 2020, 7:12 AM
1 point
1 comment1 min readLW link