Ulisse Mini

Karma: 1,725

Born too late to explore Earth; born too early to explore the galaxy; born just the right time to save humanity.

https://uli.rocks/about

Ulisse Mini Dec 29, 2022, 5:56 PM
2 points
0
on: Reflections on my 5-month alignment upskilling grant
Congratulations!

Linear algebra done right is great for gaining proof skills, though for the record I’ve read it and haven’t solved alignment yet. I think I need several more passes of linear algebra :)

Ulisse Mini Dec 29, 2022, 5:50 PM
LW: 1 AF: 1
AF
on: Ulisse Mini’s Shortform
Are most uncertainties we care about logical rather than informational? All empirical ML experiments are pure computations a Bayesian superintelligence could do in its head. How much of our uncertainty comes from computational limits in practice, versus actual information bottlenecks?

Ulisse Mini Dec 25, 2022, 9:08 PM
6 points
0
on: The Twelve Virtues of Rationality
A trick to remember: the first letter of each virtue gives (in blocks): CRL EAES HP PSV, which can easily be remembered as “cooperative reinforcement learning, EAs, Harry Potter, PS: The last virtue is the void.”

(Obviously remembering these is pointless, but memorizing lists is a nice way to practice mnemonic technique.)

Ulisse Mini Dec 19, 2022, 3:22 PM
1 point
0
on: [ASoT] Reflectivity in Narrow AI
Relevant Paper

We propose Algorithm Distillation (AD), a method for distilling reinforcement learning (RL) algorithms into neural networks by modeling their training histories with a causal sequence model. Algorithm Distillation treats learning to reinforcement learn as an across-episode sequential prediction problem. A dataset of learning histories is generated by a source RL algorithm, and then a causal transformer is trained by autoregressively predicting actions given their preceding learning histories as context. Unlike sequential policy prediction architectures that distill post-learning or expert sequences, AD is able to improve its policy entirely in-context without updating its network parameters. We demonstrate that AD can reinforcement learn in-context in a variety of environments with sparse rewards, combinatorial task structure, and pixel-based observations, and find that AD learns a more data-efficient RL algorithm than the one that generated the source data.

Ulisse Mini Dec 18, 2022, 3:39 PM
2 points
0
on: Looking for an alignment tutor
EleutherAI’s #alignment channels are good to ask questions in. Some specific answers

I understand that a reward maximiser would wire-head (take control over the reward provision mechanism), but I don’t see why training an RL agent would necessarily end up in a reward-maximising agent? Turntrout’s Reward is Not the Optimisation Target shed some clarity on this, but I definitely have remaining questions.

Leo Gao’s Toward Deconfusing Wireheading and Reward Maximization sheds some light on this.

Ulisse Mini Dec 15, 2022, 2:59 AM
5 points
0
on: Is the AI timeline too short to have children?

How can I look at my children and not already be mourning their death from day 1?

Suppose you lived in the dark times, where children have a <50% of living to adulthood. Wouldn’t you still have kids? Even if probabilistically smallpox was likely to take them?

If AI kills us all, will my children suffer? Will it be my fault for having brought them into the world while knowing this would happen?

Even if they don’t live to adulthood, I’d still view their childhoods as valuable. Arguably higher average utility than adulthood.

Even if my children’s short lives are happy, wouldn’t their happiness be fundamentally false and devoid of meaning?

Our lifetimes are currently bounded, are they false and devoid of all meaning?

The negentropy in the universe is also bounded, is the universe false and devoid of all meaning?

Ulisse Mini Dec 11, 2022, 6:43 PM
LW: 1 AF: 1
AF
on: ELK prize results
Random thought: Perhaps you could carefully engineer gradient starvation in order to “avoid generalizing” and defeat the Discrete modes of prediction example. You’d only need to delay it until reflection, then the AI can solve the successor AI problem.

In general: hack our way towards getting value-preserving reflectivity before values drift from “Diamonds” → “What’s labeled as a diamond by humans”. (Replacing with “Telling the truth”, and “What the human thinks is true” respectively).

[ASoT] Natural abstractions and AlphaZero

Ulisse MiniDec 10, 2022, 5:53 PM

33 points

1 comment1 min readLW link

(arxiv.org)

Ulisse Mini Dec 10, 2022, 2:43 PM
1 point
in reply to: DirectedEvolution’s comment on: AllAmericanBreakfast’s Shortform
I disagree that the policy must be worth selling (see e.g. Jordon Belfort). Many salespeople can sell things that aren’t worth buying. See also: never split the difference for an example of negotiation when you have little/worse leverage.

(Also, I don’t think htwfaip boils down to satisfying an eager want, the other advice is super important too. E.g. don’t criticize, be genuinely interested in a person, …)

Ulisse Mini Dec 10, 2022, 5:11 AM
5 points
in reply to: DirectedEvolution’s comment on: AllAmericanBreakfast’s Shortform
Both are important, but I disagree that power is always needed. In example 3,7,9 it isn’t clear that the compromise is actually better for the convinced party. The insurance is likely -EV, The peas aren’t actually a crux to defeating the bully, the child would likely be happier outside kindergarten.

Ulisse Mini Dec 9, 2022, 12:32 AM
5 points
1
on: Linkpost for a generalist algorithmic learner: capable of carrying out sorting, shortest paths, string matching, convex hull finding in one network
From skimming the benchmark and the paper this seems overhyped (like Gato). roughly it looks like
- May 2022: Deepmind releases a new benchmark for learning algorithms
- ...Nobody cares (according to google scholar citations)
- Dec 2022: Deepmind releases a thing that beats the baselines on their benchmark
I don’t know much about GNNs & only did a surface-level skim so I’m interested to hear other takes.

Ulisse Mini Dec 8, 2022, 7:28 PM
1 point
0
in reply to: Jiro’s comment on: [ASoT] Probability Infects Concepts it Touches
Interesting perspective, kinda reminds me of the ROME paper where it seems to only do “shallow counterfactuals”.

[ASoT] Probability Infects Concepts it Touches

Ulisse MiniDec 7, 2022, 1:48 AM

10 points

4 comments1 min readLW link

Ulisse Mini Dec 4, 2022, 7:25 PM
1 point
2
in reply to: ponkaloupe’s comment on: Three Fables of Magical Girls and Longtermism
unpopular opinion: I like the ending of the subsequent film

IMO it’s a natural continuation for Homura. After spending decades of subjective time trying to save someone would you really let them go like that? Homura isn’t an altruist, she doesn’t care about the lifetime of the universe—she just wants Madoka.

Ulisse Mini Dec 4, 2022, 5:26 PM
3 points
0
in reply to: Tapatakt’s comment on: Is school good or bad?
I was directing that towards lesswrongers reading my answer, not the general population.

Ulisse Mini Dec 4, 2022, 4:28 PM
LW: 2 AF: 2
1
AF
in reply to: tailcalled’s comment on: Is school good or bad?
I think school is huge in preventing people from becoming smart and curious. I spent 1-2years where I hardly studied at all and mostly played videogames—I wish I hadn’t wasted that time, but when I quit I did so of my own free will. I think there’s a huge difference between discipline imposed from the outside vs the inside, and getting to the latter is worth a lot. (though I wish I hadn’t wasted all that time now haha)
I’m unsure which parts of my upbringing were cruxes for unschooling working. You should probably read a book or something rather than taking my (very abnormal) opinion. I just know how it went for me :)

Ulisse Mini Dec 4, 2022, 12:50 AM
LW: 16 AF: 4
15
AF
on: Is school good or bad?
Epistemic status: personal experience.
I’m unschooled and think it’s clearly better, even if you factor in my parents being significantly above average in parenting. Optimistically school is babysitting, people learn nothing there while wasting most of their childhood. Pessimistically it’s actively harmful by teaching people to hate learning/build antibodies against education.
Here’s a good documentary made by someone who’s been in and out of school. I can’t give detailed criticism since I (thankfully) never had to go to school.

EDIT: As for what the alternative should be, I honestly don’t know. Shifting equilibria is hard, though it’s easy to give better examples (e.g. dath ilan, things in the documentary I linked.) For a personal solution: Homeschool your kids.

Three Fables of Magical Girls and Longtermism

Ulisse MiniDec 2, 2022, 10:01 PM

33 points

11 comments2 min readLW link

Ulisse Mini Dec 1, 2022, 6:28 PM
3 points
in reply to: Jay Bailey’s comment on: Ulisse Mini’s Shortform
#3 is good. another good reason is so you have enough mathematical maturity to understand fancy theoretical results.

I’m probably overestimating the importance of #4, really I just like having the ability to pick up a random undergrad/early-grad math book and understand what’s going on, and I’d like to extend that further up the tree :)

Ulisse Mini Dec 1, 2022, 6:14 PM
3 points
in reply to: Jack O'Brien’s comment on: Ulisse Mini’s Shortform
(Note; I haven’t finished any of them)
Quantum computing since Democritus is great, I understand Godel’s results now! And a bunch of complexity stuff I’m still wrapping my head around.
The Road to Reality is great, I can pretend to know complex analysis after reading chapters 5,7,8 and most people can’t tell the difference! Here’s a solution to a problem in chapter 7 I wrote up.
I’ve only skimmed parts of the Princeton guides, and different articles are written by different authors—but Tao’s explanation of compactness (also in the book) is fantastic, I don’t remember specific other things I read.
Started reading “All the math you missed” but stopped before I got to the new parts, did review linear algebra usefully though. Will definitely read more at some point.
I read some of The Napkin’s guide to Group Theory, but not much else. Got a great joke from it:

Ulisse Mini

[ASoT] Nat­u­ral ab­strac­tions and AlphaZero

[ASoT] Prob­a­bil­ity In­fects Con­cepts it Touches

Three Fables of Mag­i­cal Girls and Longtermism

[ASoT] Natural abstractions and AlphaZero

[ASoT] Probability Infects Concepts it Touches

Three Fables of Magical Girls and Longtermism