RSS

Ulisse Mini

Karma: 1,661

Born too late to explore Earth; born too early to explore the galaxy; born just the right time to save humanity.

https://​​uli.rocks/​​about

[Question] What ra­tio­nal­ity failure modes are there?

Ulisse Mini19 Jan 2024 9:12 UTC
42 points
11 comments1 min readLW link

[Question] What ML gears do you like?

Ulisse Mini11 Nov 2023 19:10 UTC
25 points
4 comments1 min readLW link

Paper: Un­der­stand­ing and Con­trol­ling a Maze-Solv­ing Policy Network

13 Oct 2023 1:38 UTC
69 points
0 comments1 min readLW link
(arxiv.org)

Ac­tAdd: Steer­ing Lan­guage Models with­out Optimization

6 Sep 2023 17:21 UTC
105 points
3 comments2 min readLW link
(arxiv.org)

Open prob­lems in ac­ti­va­tion engineering

24 Jul 2023 19:46 UTC
43 points
2 comments1 min readLW link
(coda.io)

[ASoT] GPT2 Steer­ing & The Tuned Lens

Ulisse Mini1 Jul 2023 14:12 UTC
23 points
0 comments2 min readLW link

LIMA: Less Is More for Alignment

Ulisse Mini30 May 2023 17:10 UTC
16 points
6 comments1 min readLW link
(arxiv.org)

TinyS­to­ries: Small Lan­guage Models That Still Speak Co­her­ent English

Ulisse Mini28 May 2023 22:23 UTC
60 points
8 comments2 min readLW link
(arxiv.org)

Steer­ing GPT-2-XL by adding an ac­ti­va­tion vector

13 May 2023 18:42 UTC
423 points
97 comments50 min readLW link

How to get good at programming

Ulisse Mini5 May 2023 1:14 UTC
39 points
3 comments2 min readLW link

Un­der­stand­ing and con­trol­ling a maze-solv­ing policy network

11 Mar 2023 18:59 UTC
312 points
22 comments23 min readLW link

Pre­dic­tions for shard the­ory mechanis­tic in­ter­pretabil­ity results

1 Mar 2023 5:16 UTC
105 points
10 comments5 min readLW link

[ASoT] Policy Tra­jec­tory Visualization

Ulisse Mini7 Feb 2023 0:13 UTC
9 points
2 comments1 min readLW link

In­cen­tives con­sid­ered harmful

Ulisse Mini15 Jan 2023 6:38 UTC
6 points
0 comments1 min readLW link
(uli.rocks)

[Question] Where do you find peo­ple who ac­tu­ally do things?

Ulisse Mini13 Jan 2023 6:57 UTC
7 points
12 comments1 min readLW link

[Question] Effec­tive Evil Causes?

Ulisse Mini30 Dec 2022 2:56 UTC
−12 points
2 comments1 min readLW link

[ASoT] Nat­u­ral ab­strac­tions and AlphaZero

Ulisse Mini10 Dec 2022 17:53 UTC
33 points
1 comment1 min readLW link
(arxiv.org)

[ASoT] Prob­a­bil­ity In­fects Con­cepts it Touches

Ulisse Mini7 Dec 2022 1:48 UTC
10 points
4 comments1 min readLW link

Three Fables of Mag­i­cal Girls and Longtermism

Ulisse Mini2 Dec 2022 22:01 UTC
31 points
11 comments2 min readLW link

[ASoT] Reflec­tivity in Nar­row AI

Ulisse Mini21 Nov 2022 0:51 UTC
6 points
1 comment1 min readLW link