Optimization

TagLast edit: 30 Dec 2024 9:33 UTC by Dakara

Optimization is any kind of process that systematically comes up with solutions that are better than the solution used before. More technically, this kind of process moves the world into a specific and unexpected set of states by searching through a large search space, hitting small and low probability targets. When this process is gradually guided by some agent into some specific state, through searching specific targets, we can say it prefers that state.

The best way to exemplify an optimization process is through a simple example: Eliezer Yudkowsky suggests natural selection is such a process. Through an implicit preference – better replicators – natural selection searches all the genetic landscape space and hit small targets: efficient mutations.

Consider the human being. We are a highly complex object with a low probability to have been created by chance—natural selection, however, over millions of years, built up the infrastructure needed to build such a functioning body. This body, as well as other organisms, had the chance (was selected) to develop because it is in itself a rather efficient replicator suitable for the environment where it came up.

Or consider the famous chessplaying computer, Deep Blue. Outside of the narrow domain of selecting moves for chess games, it can’t do anything impressive: but as a chessplayer, it was massively more effective than virtually all humans. It has a high optimization power in the chess domain but almost none in any other field. Humans or evolution, on the other hand, are more domain-general optimization processes than Deep Blue, but that doesn’t mean they’re more effective at chess specifically. (Although note in what contexts this optimization process abstraction is useful and where it fails to be useful: it’s not obvious what it would mean for “evolution” to play chess, and yet it is useful to talk about the optimization power of natural selection, or of Deep Blue.)

Measuring Optimization Power

One way to think mathematically about optimization, like evidence, is in information-theoretic bits. The optimization power is the amount of surprise we would have in the result if there were no optimization process present. Therefore we take the base-two logarithm of the reciprocal of the probability of the result. A one-in-a-million solution (a solution so good relative to your preference ordering that it would take a million random tries to find something that good or better) can be said to have log_2(1,000,000) = 19.9 bits of optimization. Compared to a random configuration of matter, any artifact you see is going to be much more optimized than this. The math describes only laws and general principles for reasoning about optimization; as with probability theory, you oftentimes can’t apply the math directly.

Further Reading & References

Optimization and the Singularity by Eliezer Yudkowsky
Measuring Optimization Power by Eliezer Yudkowsky

See also

The ground of optimization

Alex Flint20 Jun 2020 0:38 UTC

252 points

80 comments27 min readLW link 1 review

Measuring Optimization Power

Eliezer Yudkowsky27 Oct 2008 21:44 UTC

91 points

38 comments6 min readLW link

Optimization Amplifies

Scott Garrabrant27 Jun 2018 1:51 UTC

119 points

12 comments4 min readLW link

Optimization

Eliezer Yudkowsky13 Sep 2008 16:00 UTC

57 points

45 comments5 min readLW link

Selection vs Control

abramdemski2 Jun 2019 7:01 UTC

180 points

27 comments11 min readLW link 2 reviews

DL towards the unaligned Recursive Self-Optimization attractor

jacob_cannell18 Dec 2021 2:15 UTC

32 points

22 comments4 min readLW link

Risks from Learned Optimization: Introduction

evhub, Chris van Merwijk, Vlad Mikulik, Joar Skalse and Scott Garrabrant

31 May 2019 23:44 UTC

187 points

42 comments12 min readLW link 3 reviews

Aiming at the Target

Eliezer Yudkowsky26 Oct 2008 16:47 UTC

40 points

40 comments5 min readLW link

Thoughts and problems with Eliezer’s measure of optimization power

Stuart_Armstrong8 Jun 2012 9:44 UTC

36 points

24 comments5 min readLW link

Beren’s “Deconfusing Direct vs Amortised Optimisation”

DragonGod7 Apr 2023 8:57 UTC

52 points

10 comments3 min readLW link

The Optimizer’s Curse and How to Beat It

lukeprog16 Sep 2011 2:46 UTC

100 points

84 comments3 min readLW link

Optimality is the tiger, and agents are its teeth

Veedrac2 Apr 2022 0:46 UTC

345 points

46 comments16 min readLW link 1 review

Bottle Caps Aren’t Optimisers

DanielFilan31 Aug 2018 18:30 UTC

101 points

23 comments3 min readLW link 1 review

(danielfilan.com)

Optimization Concepts in the Game of Life

Vika and Ramana Kumar

16 Oct 2021 20:51 UTC

75 points

16 comments10 min readLW link

Steering systems

Max H4 Apr 2023 0:56 UTC

50 points

1 comment15 min readLW link

Towards Measures of Optimisation

mattmacdermott and Alexander Gietelink Oldenziel

12 May 2023 15:29 UTC

53 points

37 comments4 min readLW link

Requirements for a STEM-capable AGI Value Learner (my Case for Less Doom)

RogerDearnaley25 May 2023 9:26 UTC

33 points

3 comments15 min readLW link

Goodhart’s Curse and Limitations on AI Alignment

Gordon Seidoh Worley19 Aug 2019 7:57 UTC

25 points

18 comments10 min readLW link

Utility Maximization = Description Length Minimization

johnswentworth18 Feb 2021 18:04 UTC

223 points

52 comments6 min readLW link

The Credit Assignment Problem

abramdemski8 Nov 2019 2:50 UTC

107 points

40 comments17 min readLW link 1 review

Abstracting The Hardness of Alignment: Unbounded Atomic Optimization

adamShimi29 Jul 2022 18:59 UTC

75 points

3 comments16 min readLW link

Deconfusing Direct vs Amortised Optimization

beren2 Dec 2022 11:30 UTC

136 points

19 comments10 min readLW link

Defining “optimizer”

Chantiel17 Apr 2021 15:38 UTC

9 points

6 comments1 min readLW link

A new definition of “optimizer”

Chantiel9 Aug 2021 13:42 UTC

5 points

0 comments7 min readLW link

Quantifying General Intelligence

JasonBrown17 Jun 2022 21:57 UTC

9 points

6 comments13 min readLW link

Ngo and Yudkowsky on AI capability gains

Eliezer Yudkowsky and Richard_Ngo

18 Nov 2021 22:19 UTC

131 points

61 comments38 min readLW link 1 review

What is optimization power, formally?

sbenthall18 Oct 2014 18:37 UTC

18 points

16 comments2 min readLW link

Difficulty classes for alignment properties

Jozdien20 Feb 2024 9:08 UTC

34 points

5 comments2 min readLW link

Applications for Deconfusing Goal-Directedness

adamShimi8 Aug 2021 13:05 UTC

38 points

3 comments5 min readLW link 1 review

Two senses of “optimizer”

Joar Skalse21 Aug 2019 16:02 UTC

35 points

41 comments3 min readLW link

Fundamental Uncertainty: Chapter 4 - Why don’t we do what we think we should?

Gordon Seidoh Worley29 Aug 2022 19:25 UTC

15 points

6 comments13 min readLW link

In Defence of Optimizing Routine Tasks

leogao9 Nov 2021 5:09 UTC

47 points

6 comments3 min readLW link 1 review

Search versus design

Alex Flint16 Aug 2020 16:53 UTC

109 points

40 comments36 min readLW link 1 review

Vingean Agency

abramdemski24 Aug 2022 20:08 UTC

63 points

14 comments3 min readLW link

Consequentialism is in the Stars not Ourselves

DragonGod24 Apr 2023 0:02 UTC

7 points

19 comments5 min readLW link

Bits of Optimization Can Only Be Lost Over A Distance

johnswentworth23 May 2022 18:55 UTC

31 points

18 comments2 min readLW link

Gaia Network: a practical, incremental pathway to Open Agency Architecture

Roman Leventov and Rafael Kaufmann Nedal

20 Dec 2023 17:11 UTC

22 points

8 comments16 min readLW link

Defining Optimization in a Deeper Way Part 4

J Bostock28 Jul 2022 17:02 UTC

7 points

0 comments5 min readLW link

Mesa-Optimizers vs “Steered Optimizers”

Steven Byrnes10 Jul 2020 16:49 UTC

48 points

7 comments8 min readLW link

Life’s Story Continues

Eliezer Yudkowsky21 Nov 2008 23:05 UTC

24 points

14 comments5 min readLW link

Mesa-Optimizers and Over-optimization Failure (Optimizing and Goodhart Effects, Clarifying Thoughts—Part 4)

Davidmanheim12 Aug 2019 8:07 UTC

15 points

3 comments4 min readLW link

Searching for Searching for Search

Rubi J. Hudson14 Feb 2024 23:51 UTC

21 points

4 comments7 min readLW link

Draft: Detecting optimization

Alex_Altair29 Mar 2023 20:17 UTC

23 points

2 comments6 min readLW link

Fake Optimization Criteria

Eliezer Yudkowsky10 Nov 2007 0:10 UTC

73 points

21 comments3 min readLW link

Meaning & Agency

abramdemski19 Dec 2023 22:27 UTC

93 points

17 comments14 min readLW link

Defining Optimization in a Deeper Way Part 1

J Bostock1 Jul 2022 14:03 UTC

7 points

0 comments2 min readLW link

Is the term mesa optimizer too narrow?

Matthew Barnett14 Dec 2019 23:20 UTC

39 points

21 comments1 min readLW link

Mathematical Measures of Optimization Power

Alex_Altair24 Nov 2012 10:55 UTC

8 points

16 comments5 min readLW link

Notes on Simplicity

David Gross2 Dec 2020 23:14 UTC

9 points

0 comments7 min readLW link

Draft: The optimization toolbox

Alex_Altair28 Mar 2023 20:40 UTC

20 points

1 comment7 min readLW link

Optimization Provenance

Adele Lopez23 Aug 2019 20:08 UTC

38 points

5 comments5 min readLW link

Game Theory without Argmax [Part 2]

Cleo Nardo11 Nov 2023 16:02 UTC

31 points

14 comments13 min readLW link

Fat Tails Discourage Compromise

niplav17 Jun 2024 9:39 UTC

53 points

5 comments1 min readLW link

[Question] How Many Bits Of Optimization Can One Bit Of Observation Unlock?

johnswentworth26 Apr 2023 0:26 UTC

62 points

32 comments3 min readLW link

Clarifying mesa-optimization

Marius Hobbhahn and Pierre Peigné

21 Mar 2023 15:53 UTC

38 points

6 comments10 min readLW link

What I Learned Running Refine

adamShimi24 Nov 2022 14:49 UTC

108 points

5 comments4 min readLW link

Don’t align agents to evaluations of plans

TurnTrout26 Nov 2022 21:16 UTC

48 points

49 comments18 min readLW link

[Question] Do the Safety Properties of Powerful AI Systems Need to be Adversarially Robust? Why?

DragonGod9 Feb 2023 13:36 UTC

22 points

42 comments2 min readLW link

Game Theory without Argmax [Part 1]

Cleo Nardo11 Nov 2023 15:59 UTC

70 points

18 comments19 min readLW link

Draft: Introduction to optimization

Alex_Altair26 Mar 2023 17:25 UTC

43 points

8 comments16 min readLW link

The First World Takeover

Eliezer Yudkowsky19 Nov 2008 15:00 UTC

42 points

24 comments6 min readLW link

Towards a formalization of the agent structure problem

Alex_Altair29 Apr 2024 20:28 UTC

55 points

6 comments14 min readLW link

“Normal” is the equilibrium state of past optimization processes

Alex_Altair30 Oct 2022 19:03 UTC

91 points

5 comments5 min readLW link

Measurement, Optimization, and Take-off Speed

jsteinhardt10 Sep 2021 19:30 UTC

48 points

4 comments13 min readLW link

Distributed Decisions

johnswentworth29 May 2022 2:43 UTC

66 points

6 comments6 min readLW link

Family-line selection optimizer

lemonhope22 Apr 2025 7:16 UTC

2 points

0 comments1 min readLW link

Defining Optimization in a Deeper Way Part 3

J Bostock20 Jul 2022 22:06 UTC

8 points

0 comments2 min readLW link

Defining Optimization in a Deeper Way Part 2

J Bostock11 Jul 2022 20:29 UTC

7 points

0 comments4 min readLW link

Opportunity Cost Blackmail

adamShimi2 Jan 2023 13:48 UTC

70 points

11 comments2 min readLW link

(epistemologicalvigilance.substack.com)

Draft: Inferring minimizers

Alex_Altair1 Apr 2023 20:20 UTC

9 points

0 comments1 min readLW link

Adversarial attacks and optimal control

Jan22 May 2022 18:22 UTC

17 points

7 comments8 min readLW link

(universalprior.substack.com)

A Thermodynamic Theory of Intelligence: Why Extreme Optimization May Be Mathematically Impossible

Adreius29 May 2025 12:18 UTC

1 point

0 comments3 min readLW link

Degrees of Freedom

sarahconstantin2 Apr 2019 21:10 UTC

103 points

31 comments11 min readLW link

(srconstantin.wordpress.com)

Degeneracies are sticky for SGD

Guillaume Corlouer and Nicolas Macé

16 Jun 2024 21:19 UTC

56 points

1 comment16 min readLW link

Discovering Agents

zac_kenton18 Aug 2022 17:33 UTC

73 points

11 comments6 min readLW link

Runaway Optimizers in Mind Space

silentbob16 Jul 2023 14:26 UTC

16 points

0 comments12 min readLW link

Interview with Bill O’Rourke—Russian Corruption, Putin, Applied Ethics, and More

JohnGreer27 Oct 2024 17:11 UTC

2 points

0 comments6 min readLW link

Some Problems with Ordinal Optimization Frame

Mateusz Bagiński6 May 2024 5:28 UTC

9 points

0 comments7 min readLW link

Architecture-aware optimisation: train ImageNet and more without hyperparameters

Chris Mingard22 Apr 2023 21:50 UTC

6 points

2 comments2 min readLW link

Non-resolve as Resolve

Linda Linsefors10 Jul 2018 23:31 UTC

15 points

1 comment2 min readLW link

When Can Optimization Be Done Safely?

StrivingForLegibility30 Dec 2023 1:24 UTC

12 points

0 comments3 min readLW link

Siren worlds and the perils of over-optimised search

Stuart_Armstrong7 Apr 2014 11:00 UTC

84 points

418 comments7 min readLW link

Could Things Be Very Different?—How Historical Inertia Might Blind Us To Optimal Solutions

James Stephen Brown11 Sep 2024 9:53 UTC

5 points

0 comments8 min readLW link

(nonzerosum.games)

Optimisation Measures: Desiderata, Impossibility, Proposals

mattmacdermott and Alexander Gietelink Oldenziel

7 Aug 2023 15:52 UTC

36 points

9 comments1 min readLW link

Extinction Risks from AI: Invisible to Science?

VojtaKovarik, Chris van Merwijk and Ida Mattsson

21 Feb 2024 18:07 UTC

24 points

7 comments1 min readLW link

(arxiv.org)

Safety Data Sheets for Optimization Processes

StrivingForLegibility4 Jan 2024 23:30 UTC

15 points

1 comment4 min readLW link

Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity

zhanpeng_zhou20 Jul 2023 17:38 UTC

22 points

13 comments3 min readLW link

(openreview.net)

The Carnot Engine of Economics

StrivingForLegibility9 Aug 2024 15:59 UTC

5 points

0 comments5 min readLW link

Visual demonstration of Optimizer’s curse

Roman Malov30 Nov 2024 19:34 UTC

25 points

3 comments7 min readLW link

Optimization Markets

StrivingForLegibility30 Dec 2023 1:24 UTC

13 points

2 comments2 min readLW link

MONA: Managed Myopia with Approval Feedback

Seb Farquhar, David Lindner and Rohin Shah

23 Jan 2025 12:24 UTC

81 points

30 comments9 min readLW link

Hyperdimensional connection method—A Lossless Framework Preserving Meaning, Structure, and Semantic Relationships across Modalities.(A MatrixTransformer subsidiary)

fikayoAy18 Jul 2025 10:24 UTC

1 point

0 comments1 min readLW link

Understanding Gradient Hacking

peterbarnett10 Dec 2021 15:58 UTC

41 points

5 comments30 min readLW link

The AI’s Toolbox: From Soggy Toast to Optimal Solutions

Thehumanproject.ai22 Jun 2025 20:54 UTC

1 point

0 comments8 min readLW link

Evolutions Building Evolutions: Layers of Generate and Test

plex5 Feb 2021 18:21 UTC

12 points

1 comment6 min readLW link

I missed the crux of the alignment problem the whole time

zeshen13 Aug 2022 10:11 UTC

53 points

7 comments3 min readLW link

Perils of optimizing in social contexts

owencb16 Jun 2022 17:40 UTC

50 points

1 comment2 min readLW link

Aligning a toy model of optimization

paulfchristiano28 Jun 2019 20:23 UTC

53 points

25 comments3 min readLW link

The Gears of Argmax

StrivingForLegibility4 Jan 2024 23:30 UTC

11 points

0 comments3 min readLW link

Is General Intelligence “Compact”?

DragonGod4 Jul 2022 13:27 UTC

27 points

6 comments22 min readLW link

The Limits of Automation

milkandcigarettes23 Jun 2022 18:03 UTC

5 points

1 comment5 min readLW link

(milkandcigarettes.com)

Breaking Down Goal-Directed Behaviour

Oliver Sourbut16 Jun 2022 18:45 UTC

11 points

1 comment2 min readLW link

Accidental Optimizers

aysajan22 Sep 2021 13:27 UTC

7 points

2 comments3 min readLW link

Interpretable by Design—Constraint Sets with Disjoint Limit Points

Ronak_Mehta8 May 2025 21:08 UTC

24 points

2 comments9 min readLW link

(ronakrm.github.io)

Optimization happens inside the mind, not in the world

azsantosk3 Jun 2023 21:36 UTC

17 points

10 comments5 min readLW link

Plans Are Predictions, Not Optimization Targets

johnswentworth20 Oct 2022 21:17 UTC

109 points

20 comments4 min readLW link 1 review

Optimization and the Singularity

Eliezer Yudkowsky23 Jun 2008 5:55 UTC

41 points

21 comments9 min readLW link

Demons in Imperfect Search

johnswentworth11 Feb 2020 20:25 UTC

110 points

21 comments3 min readLW link

Interlude: But Who Optimizes The Optimizer?

Paul Bricman23 Sep 2022 15:30 UTC

15 points

0 comments10 min readLW link

Adam Optimizer Causes Privileged Basis in Transformer LM Residual Stream

Diego Caples and rrenaud

6 Sep 2024 17:55 UTC

70 points

7 comments4 min readLW link

Observing Optimization

Eliezer Yudkowsky21 Nov 2008 5:39 UTC

12 points

28 comments6 min readLW link

Optimization and Adequacy in Five Bullets

james.lucassen6 Jun 2022 5:48 UTC

35 points

2 comments4 min readLW link

(jlucassen.com)

Optimizing crop planting with mixed integer linear programming in Stardew Valley

hapanin5 Apr 2022 18:42 UTC

68 points

4 comments7 min readLW link

Hypothesis: gradient descent prefers general circuits

Quintin Pope8 Feb 2022 21:12 UTC

46 points

26 comments11 min readLW link

The Human’s Role in Mesa Optimization

silentbob9 May 2024 12:07 UTC

5 points

0 comments2 min readLW link

One bit of observation can unlock many of optimization—but at what cost?

dr_s29 Apr 2023 10:53 UTC

42 points

4 comments5 min readLW link

(Structural) Stability of Coupled Optimizers

Paul Bricman30 Sep 2022 11:28 UTC

25 points

0 comments10 min readLW link

Thinking about maximization and corrigibility

James Payor21 Apr 2023 21:22 UTC

63 points

4 comments5 min readLW link

Bridging Expected Utility Maximization and Optimization

Daniel Herrmann5 Aug 2022 8:18 UTC

25 points

5 comments14 min readLW link

Tessellating Hills: a toy model for demons in imperfect search

DaemonicSigil20 Feb 2020 0:12 UTC

97 points

18 comments2 min readLW link

What’s General-Purpose Search, And Why Might We Expect To See It In Trained ML Systems?

johnswentworth15 Aug 2022 22:48 UTC

157 points

18 comments10 min readLW link

Wildfire of strategicness

TsviBT5 Jun 2023 13:59 UTC

38 points

19 comments1 min readLW link

Surprising examples of non-human optimization

Jan_Rzymkowski14 Jun 2015 17:05 UTC

31 points

9 comments1 min readLW link

The Three Warnings of the Zentradi

Trevor Hill-Hand21 Nov 2024 20:28 UTC

13 points

1 comment5 min readLW link

Goldilocks and the Three Optimisers

dkl917 Aug 2023 18:15 UTC

−10 points

0 comments5 min readLW link

(dkl9.net)

Worse Than Random

Eliezer Yudkowsky11 Nov 2008 19:01 UTC

46 points

102 comments12 min readLW link

Notes on Antelligence

Aurigena13 May 2023 18:38 UTC

2 points

0 comments9 min readLW link

Transforming myopic optimization to ordinary optimization—Do we want to seek convergence for myopic optimization problems?

tailcalled11 Dec 2021 20:38 UTC

12 points

1 comment5 min readLW link

Breaking the Optimizer’s Curse, and Consequences for Existential Risks and Value Learning

Roger Dearnaley21 Feb 2023 9:05 UTC

10 points

1 comment23 min readLW link

Efficient Cross-Domain Optimization

Eliezer Yudkowsky28 Oct 2008 16:33 UTC

56 points

38 comments5 min readLW link

No free lunch theorem is irrelevant

Catnee4 Oct 2022 0:21 UTC

18 points

7 comments1 min readLW link

Hedonic asymmetries

paulfchristiano26 Jan 2020 2:10 UTC

98 points

22 comments2 min readLW link

(sideways-view.com)

The slingshot helps with learning

Wilson Wu31 Oct 2024 23:18 UTC

33 points

0 comments8 min readLW link

[Question] What are examples of someone doing a lot of work to find the best of something?

chanamessinger27 Jul 2023 15:58 UTC

29 points

16 comments1 min readLW link

Satisficers want to become maximisers

Stuart_Armstrong21 Oct 2011 16:27 UTC

38 points

70 comments1 min readLW link

Don’t design agents which exploit adversarial inputs

TurnTrout and Garrett Baker

18 Nov 2022 1:48 UTC

72 points

64 comments12 min readLW link

No comments.