RSS

Jeffrey Ladish

Karma: 1,979

Bounty for Ev­i­dence on Some of Pal­isade Re­search’s Beliefs

Sep 23, 2024, 8:01 PM
46 points
4 comments2 min readLW link

Take SCIFs, it’s dan­ger­ous to go alone

May 1, 2024, 8:02 AM
42 points
1 comment3 min readLW link

Pal­isade is hiring Re­search Engineers

Nov 11, 2023, 3:09 AM
23 points
0 comments3 min readLW link

unRLHF—Effi­ciently un­do­ing LLM safeguards

Oct 12, 2023, 7:58 PM
117 points
15 comments20 min readLW link

LoRA Fine-tun­ing Effi­ciently Un­does Safety Train­ing from Llama 2-Chat 70B

Oct 12, 2023, 7:58 PM
151 points
29 comments14 min readLW link

The Agency Overhang

Jeffrey LadishApr 21, 2023, 7:47 AM
85 points
6 comments6 min readLW link

Dona­tion offsets for ChatGPT Plus subscriptions

Jeffrey LadishMar 16, 2023, 11:29 PM
53 points
3 comments3 min readLW link

To de­ter­mine al­ign­ment difficulty, we need to know the ab­solute difficulty of al­ign­ment generalization

Jeffrey LadishMar 14, 2023, 3:52 AM
12 points
3 comments2 min readLW link

Thoughts on the OpenAI al­ign­ment plan: will AI re­search as­sis­tants be net-pos­i­tive for AI ex­is­ten­tial risk?

Jeffrey LadishMar 10, 2023, 8:21 AM
58 points
3 comments9 min readLW link

AGI sys­tems & hu­mans will both need to solve the al­ign­ment problem

Jeffrey LadishFeb 24, 2023, 3:29 AM
59 points
14 comments4 min readLW link

When you plan ac­cord­ing to your AI timelines, should you put more weight on the me­dian fu­ture, or the me­dian fu­ture | even­tual AI al­ign­ment suc­cess? ⚖️

Jeffrey LadishJan 5, 2023, 1:21 AM
25 points
10 comments2 min readLW link

Mar­riage, the Giv­ing What We Can Pledge, and the dam­age caused by vague pub­lic commitments

Jeffrey LadishJul 11, 2022, 7:38 PM
98 points
27 comments6 min readLW link1 review

My vi­sion of a good fu­ture, part I

Jeffrey LadishJul 6, 2022, 1:23 AM
66 points
18 comments9 min readLW link

In­for­ma­tion se­cu­rity con­sid­er­a­tions for AI and the long term future

May 2, 2022, 8:54 PM
76 points
6 comments10 min readLW link

Don’t die with dig­nity; in­stead play to your outs

Jeffrey LadishApr 6, 2022, 7:53 AM
281 points
60 comments5 min readLW link

EA Han­gout Pri­son­ers’ Dilemma

Jeffrey LadishSep 27, 2021, 11:15 PM
55 points
18 comments3 min readLW link

Com­ment on the lab leak hypothesis

Jeffrey LadishJun 11, 2021, 10:49 PM
63 points
14 comments4 min readLW link

Nu­clear war is un­likely to cause hu­man extinction

Jeffrey LadishNov 7, 2020, 5:42 AM
131 points
48 comments11 min readLW link3 reviews

Was SARS-CoV-2 ac­tu­ally pre­sent in March 2019 wastew­a­ter sam­ples?

Jeffrey LadishJul 7, 2020, 11:08 PM
4 points
1 comment2 min readLW link

land­fish lab

Jeffrey LadishFeb 20, 2020, 12:20 AM
5 points
20 commentsLW link