RSS

janus

Karma: 3,357

How LLMs are and are not myopic

janusJul 25, 2023, 2:19 AM
134 points
16 comments8 min readLW link

[Si­mu­la­tors sem­i­nar se­quence] #2 Semiotic physics—revamped

Feb 27, 2023, 12:25 AM
24 points
23 comments13 min readLW link

Cyborgism

Feb 10, 2023, 2:47 PM
336 points
46 comments35 min readLW link2 reviews

Ano­ma­lous to­kens re­veal the origi­nal iden­tities of In­struct models

Feb 9, 2023, 1:30 AM
139 points
16 comments9 min readLW link
(generative.ink)

Gra­di­ent Filtering

Jan 18, 2023, 8:09 PM
55 points
16 comments13 min readLW link

Lan­guage Ex Machina

janusJan 15, 2023, 9:19 AM
41 points
23 comments24 min readLW link
(generative.ink)

Si­mu­lacra are Things

janusJan 8, 2023, 11:03 PM
63 points
7 comments2 min readLW link

[Si­mu­la­tors sem­i­nar se­quence] #1 Back­ground & shared assumptions

Jan 2, 2023, 11:48 PM
50 points
4 comments3 min readLW link

Re­sults from a sur­vey on tool use and work­flows in al­ign­ment research

Dec 19, 2022, 3:19 PM
79 points
2 comments19 min readLW link

Search­ing for Search

Nov 28, 2022, 3:31 PM
94 points
9 comments14 min readLW link1 review

Up­date to Mys­ter­ies of mode col­lapse: text-davinci-002 not RLHF

janusNov 19, 2022, 11:51 PM
71 points
8 comments2 min readLW link

[simu­la­tion] 4chan user claiming to be the at­tor­ney hired by Google’s sen­tient chat­bot LaMDA shares wild de­tails of encounter

janusNov 10, 2022, 9:39 PM
19 points
1 comment13 min readLW link
(generative.ink)

Mys­ter­ies of mode collapse

janusNov 8, 2022, 10:37 AM
284 points
57 comments14 min readLW link1 review

Simulators

janusSep 2, 2022, 12:45 PM
618 points
168 comments41 min readLW link8 reviews
(generative.ink)

A de­scrip­tive, not pre­scrip­tive, overview of cur­rent AI Align­ment Research

Jun 6, 2022, 9:59 PM
139 points
21 comments7 min readLW link

A sur­vey of tool use and work­flows in al­ign­ment research

Mar 23, 2022, 11:44 PM
45 points
4 comments1 min readLW link