RSS

porby

Karma: 1,863

Soft Prompts for Eval­u­a­tion: Mea­sur­ing Con­di­tional Dis­tance of Capabilities

porby2 Feb 2024 5:49 UTC
47 points
1 comment4 min readLW link
(1drv.ms)

FAQ: What the heck is goal ag­nos­ti­cism?

porby8 Oct 2023 19:11 UTC
66 points
36 comments28 min readLW link

A plea for more fund­ing short­fall transparency

porby7 Aug 2023 21:33 UTC
73 points
4 comments2 min readLW link

Us­ing pre­dic­tors in cor­rigible systems

porby19 Jul 2023 22:29 UTC
19 points
6 comments27 min readLW link

One path to co­her­ence: con­di­tion­al­iza­tion

porby29 Jun 2023 1:08 UTC
28 points
4 comments4 min readLW link

One im­ple­men­ta­tion of reg­u­la­tory GPU restrictions

porby4 Jun 2023 20:34 UTC
42 points
6 comments5 min readLW link

porby’s Shortform

porby24 May 2023 21:34 UTC
6 points
20 comments1 min readLW link

Im­plied “util­ities” of simu­la­tors are broad, dense, and shallow

porby1 Mar 2023 3:23 UTC
45 points
7 comments3 min readLW link

In­stru­men­tal­ity makes agents agenty

porby21 Feb 2023 4:28 UTC
20 points
4 comments6 min readLW link

[Question] How would you use video gamey tech to help with AI safety?

porby9 Feb 2023 0:20 UTC
9 points
5 comments1 min readLW link

Against Boltz­mann mesaoptimizers

porby30 Jan 2023 2:55 UTC
76 points
6 comments4 min readLW link

FFMI Gains: A List of Vitalities

porby12 Jan 2023 4:48 UTC
26 points
1 comment7 min readLW link

Si­mu­la­tors, con­straints, and goal ag­nos­ti­cism: por­bynotes vol. 1

porby23 Nov 2022 4:22 UTC
37 points
2 comments35 min readLW link

Am I se­cretly ex­cited for AI get­ting weird?

porby29 Oct 2022 22:16 UTC
116 points
4 comments4 min readLW link

Why I think strong gen­eral AI is com­ing soon

porby28 Sep 2022 5:40 UTC
335 points
141 comments34 min readLW link1 review

Pri­vate al­ign­ment re­search shar­ing and coordination

porby4 Sep 2022 0:01 UTC
62 points
13 comments5 min readLW link