Quantilization

TagLast edit: 2 Dec 2024 17:35 UTC by Mateusz Bagiński

A Quantilizer is a proposed AI design that aims to reduce the harms from Goodhart’s law and specification gaming by selecting reasonably effective actions from a distribution of human-like actions, rather than maximizing over actions. It is more of a theoretical tool for exploring ways around these problems than a practical buildable design.

See also

Quantilizers: AI That Doesn’t Try Too Hard by Rob Miles
Arbital page on Quantilizers
Quantilizers: A Safer Alternative to Maximizers for Limited Optimization by Jessica Taylor (original paper)

Quantilizers maximize expected utility subject to a conservative cost constraint

jessicata28 Sep 2015 2:17 UTC

33 points

3 comments5 min readLW link

Another view of quantilizers: avoiding Goodhart’s Law

jessicata9 Jan 2016 4:02 UTC

26 points

2 comments2 min readLW link

When to use quantilization

RyanCarey5 Feb 2019 17:17 UTC

65 points

5 comments4 min readLW link

Computing an exact quantilal policy

Vanessa Kosoy12 Apr 2018 9:23 UTC

9 points

0 comments2 min readLW link

Quantilal control for finite MDPs

Vanessa Kosoy12 Apr 2018 9:21 UTC

14 points

0 comments13 min readLW link

Soft optimization makes the value target bigger

Jeremy Gillen2 Jan 2023 16:06 UTC

119 points

20 comments12 min readLW link

Quantilizers and Generative Models

Adam Jermyn18 Jul 2022 16:32 UTC

24 points

5 comments4 min readLW link

[Question] Why don’t quantilizers also cut off the upper end of the distribution?

Alex_Altair15 May 2023 1:40 UTC

25 points

2 comments1 min readLW link

[Aspiration-based designs] 1. Informal introduction

B Jacobs, Jobst Heitzig, Simon Fischer and Simon Dima

28 Apr 2024 13:00 UTC

44 points

4 comments8 min readLW link

Hedonic Loops and Taming RL

beren19 Jul 2023 15:12 UTC

20 points

14 comments9 min readLW link

Quantilizer ≡ Optimizer with a Bounded Amount of Output

itaibn016 Nov 2021 1:03 UTC

11 points

4 comments2 min readLW link

The murderous shortcut: a toy model of instrumental convergence

Thomas Kwa2 Oct 2024 6:48 UTC

37 points

0 comments2 min readLW link

How to safely use an optimizer

Simon Fischer28 Mar 2024 16:11 UTC

47 points

21 comments7 min readLW link

AISC team report: Soft-optimization, Bayes and Goodhart

Simon Fischer, benjaminko, jazcarretao, DFNaiff and Jeremy Gillen

27 Jun 2023 6:05 UTC

38 points

2 comments15 min readLW link

Gravitizing Quantization

dmcg2241 Jun 2025 9:05 UTC

1 point

0 comments8 min readLW link

[Aspiration-based designs] 2. Formal framework, basic algorithm

Jobst Heitzig, Simon Dima and Simon Fischer

28 Apr 2024 13:02 UTC

18 points

2 comments16 min readLW link

AISC project: SatisfIA – AI that satisfies without overdoing it

Jobst Heitzig11 Nov 2023 18:22 UTC

12 points

0 comments1 min readLW link

(docs.google.com)

Recursive Quantilizers II

abramdemski2 Dec 2020 15:26 UTC

30 points

15 comments13 min readLW link

Stable Pointers to Value III: Recursive Quantilization

abramdemski21 Jul 2018 8:06 UTC

20 points

4 comments4 min readLW link

No comments.