Myopia

TagLast edit: Dec 30, 2024, 11:19 AM by Dakara

Myopia refers to short-sightedness in planning and decision-making processes. It describes a tendency to prioritize immediate or short-term outcomes while disregarding longer-term consequences.

The most extreme form of myopia occurs when an agent considers only immediate rewards, completely disregarding future consequences. In artificial intelligence contexts, a perfectly myopic agent would optimize solely for the current query or task without attempting to influence future outcomes.

Myopic agents demonstrate several notable properties:

Limited temporal scope in decision-making
Focus on immediate reward optimization
Reduced instrumental incentives

Partial Agency

abramdemskiSep 27, 2019, 10:04 PM

75 points

18 comments9 min readLW link

The Credit Assignment Problem

abramdemskiNov 8, 2019, 2:50 AM

105 points

40 comments17 min readLW link 1 review

Towards a mechanistic understanding of corrigibility

evhubAug 22, 2019, 11:20 PM

47 points

26 comments4 min readLW link

How LLMs are and are not myopic

janusJul 25, 2023, 2:19 AM

135 points

16 comments8 min readLW link

Open Problems with Myopia

Mark Xu and evhub

Mar 10, 2021, 6:38 PM

66 points

16 comments8 min readLW link

Steering Behaviour: Testing for (Non-)Myopia in Language Models

Evan R. Murphy and Megan Kinniment

Dec 5, 2022, 8:28 PM

40 points

19 comments10 min readLW link

Defining Myopia

abramdemskiOct 19, 2019, 9:32 PM

32 points

18 comments8 min readLW link

LCDT, A Myopic Decision Theory

adamShimi and evhub

Aug 3, 2021, 10:41 PM

57 points

50 comments15 min readLW link

Arguments against myopic training

Richard_NgoJul 9, 2020, 4:07 PM

62 points

39 comments12 min readLW link

You can still fetch the coffee today if you’re dead tomorrow

davidadDec 9, 2022, 2:06 PM

96 points

19 comments5 min readLW link

The Parable of Predict-O-Matic

abramdemskiOct 15, 2019, 12:49 AM

359 points

43 comments14 min readLW link 2 reviews

The Dualist Predict-O-Matic ($100 prize)

John_MaxwellOct 17, 2019, 6:45 AM

19 points

35 comments5 min readLW link

Evan Hubinger on Homogeneity in Takeoff Speeds, Learned Optimization and Interpretability

Michaël TrazziJun 8, 2021, 7:20 PM

28 points

0 comments55 min readLW link

2019 Review Rewrite: Seeking Power is Often Robustly Instrumental in MDPs

TurnTroutDec 23, 2020, 5:16 PM

35 points

0 comments4 min readLW link

(www.lesswrong.com)

MONA: Three Month Later—Updates and Steganography Without Optimization Pressure

David Lindner and Vikrant Varma

Apr 12, 2025, 11:15 PM

31 points

0 comments5 min readLW link

Understanding and controlling auto-induced distributional shift

L Rudolf LDec 13, 2021, 2:59 PM

33 points

4 comments16 min readLW link

Bayesian Evolving-to-Extinction

abramdemskiFeb 14, 2020, 11:55 PM

40 points

13 comments5 min readLW link

Seeking Power is Often Convergently Instrumental in MDPs

TurnTrout and Logan Riggs

Dec 5, 2019, 2:33 AM

162 points

39 comments17 min readLW link 2 reviews

(arxiv.org)

Random Thoughts on Predict-O-Matic

abramdemskiOct 17, 2019, 11:39 PM

40 points

3 comments9 min readLW link

An overview of 11 proposals for building safe advanced AI

evhubMay 29, 2020, 8:38 PM

220 points

37 comments38 min readLW link 2 reviews

MONA: Managed Myopia with Approval Feedback

Seb Farquhar, David Lindner and Rohin Shah

Jan 23, 2025, 12:24 PM

80 points

29 comments9 min readLW link

Why GPT wants to mesa-optimize & how we might change this

John_MaxwellSep 19, 2020, 1:48 PM

55 points

33 comments9 min readLW link

Self-Fulfilling Prophecies Aren’t Always About Self-Awareness

John_MaxwellNov 18, 2019, 11:11 PM

14 points

7 comments4 min readLW link

Thoughts on “Process-Based Supervision”

Steven ByrnesJul 17, 2023, 2:08 PM

74 points

4 comments23 min readLW link

Limiting an AGI’s Context Temporally

EulersApprenticeFeb 17, 2019, 3:29 AM

5 points

11 comments1 min readLW link

Acceptability Verification: A Research Agenda

David Udell and evhub

Jul 12, 2022, 8:11 PM

50 points

0 comments1 min readLW link

(docs.google.com)

Laziness in AI

Richard HenageSep 2, 2022, 5:04 PM

13 points

5 comments1 min readLW link

GPT-4 aligning with acasual decision theory when instructed to play games, but includes a CDT explanation that’s incorrect if they differ

Christopher KingMar 23, 2023, 4:16 PM

7 points

4 comments8 min readLW link

Transforming myopic optimization to ordinary optimization—Do we want to seek convergence for myopic optimization problems?

tailcalledDec 11, 2021, 8:38 PM

12 points

1 comment5 min readLW link

Underspecification of Oracle AI

Rubi J. Hudson, Adam Jermyn and Johannes Treutlein

Jan 15, 2023, 8:10 PM

30 points

12 comments19 min readLW link

Generative, Episodic Objectives for Safe AI

Michael GlassOct 5, 2022, 11:18 PM

11 points

3 comments8 min readLW link

How complex are myopic imitators?

Vivek HebbarFeb 8, 2022, 12:00 PM

26 points

1 comment15 min readLW link

Graphical World Models, Counterfactuals, and Machine Learning Agents

Koen.HoltmanFeb 17, 2021, 11:07 AM

6 points

2 comments10 min readLW link

Interpretability’s Alignment-Solving Potential: Analysis of 7 Scenarios

Evan R. MurphyMay 12, 2022, 8:01 PM

58 points

0 comments59 min readLW link

Simulators

janusSep 2, 2022, 12:45 PM

633 points

168 comments41 min readLW link 8 reviews

(generative.ink)

Non-myopia stories

lberglundNov 13, 2023, 5:52 PM

29 points

10 comments7 min readLW link

AI safety via market making

evhubJun 26, 2020, 11:07 PM

72 points

45 comments1 min readLW link

Fighting Akrasia: Incentivising Action

Gordon Seidoh WorleyApr 29, 2009, 1:48 PM

12 points

58 comments2 min readLW link

GPT-4 busted? Clear self-interest when summarizing articles about itself vs when article talks about Claude, LLaMA, or DALL·E 2

Christopher KingMar 31, 2023, 5:05 PM

6 points

4 comments4 min readLW link

No comments.