RSS

lisathiergart

Karma: 860

https://​​admonymous.co/​​lisath

Does davi­dad’s up­load­ing moon­shot work?

Nov 3, 2023, 2:21 AM
146 points
35 comments25 min readLW link

Paper: Un­der­stand­ing and Con­trol­ling a Maze-Solv­ing Policy Network

Oct 13, 2023, 1:38 AM
70 points
0 comments1 min readLW link
(arxiv.org)

Ac­tAdd: Steer­ing Lan­guage Models with­out Optimization

Sep 6, 2023, 5:21 PM
105 points
3 comments2 min readLW link
(arxiv.org)

Open prob­lems in ac­ti­va­tion engineering

Jul 24, 2023, 7:46 PM
51 points
2 comments1 min readLW link
(coda.io)

Distil­la­tion of Neu­rotech and Align­ment Work­shop Jan­uary 2023

May 22, 2023, 7:17 AM
51 points
9 comments14 min readLW link

Steer­ing GPT-2-XL by adding an ac­ti­va­tion vector

May 13, 2023, 6:42 PM
437 points
98 comments50 min readLW link1 review

Maze-solv­ing agents: Add a top-right vec­tor, make the agent go to the top-right

Mar 31, 2023, 7:20 PM
101 points
17 comments11 min readLW link