RSS

Quantilization

TagLast edit: Dec 2, 2024, 5:35 PM by Mateusz Bagiński

A Quantilizer is a proposed AI design that aims to reduce the harms from Goodhart’s law and specification gaming by selecting reasonably effective actions from a distribution of human-like actions, rather than maximizing over actions. It is more of a theoretical tool for exploring ways around these problems than a practical buildable design.

See also

Quan­tiliz­ers max­i­mize ex­pected util­ity sub­ject to a con­ser­va­tive cost constraint

jessicataSep 28, 2015, 2:17 AM
33 points
3 comments5 min readLW link

Another view of quan­tiliz­ers: avoid­ing Good­hart’s Law

jessicataJan 9, 2016, 4:02 AM
26 points
2 comments2 min readLW link

Com­put­ing an ex­act quan­tilal policy

Vanessa KosoyApr 12, 2018, 9:23 AM
9 points
0 comments2 min readLW link

When to use quantilization

RyanCareyFeb 5, 2019, 5:17 PM
65 points
5 comments4 min readLW link

Quan­tilal con­trol for finite MDPs

Vanessa KosoyApr 12, 2018, 9:21 AM
14 points
0 comments13 min readLW link

Soft op­ti­miza­tion makes the value tar­get bigger

Jeremy GillenJan 2, 2023, 4:06 PM
117 points
20 comments12 min readLW link

Quan­tiliz­ers and Gen­er­a­tive Models

Adam JermynJul 18, 2022, 4:32 PM
24 points
5 comments4 min readLW link

[Question] Why don’t quan­tiliz­ers also cut off the up­per end of the dis­tri­bu­tion?

Alex_AltairMay 15, 2023, 1:40 AM
25 points
2 comments1 min readLW link

Quan­tilizer ≡ Op­ti­mizer with a Bounded Amount of Output

itaibn0Nov 16, 2021, 1:03 AM
11 points
4 comments2 min readLW link

He­donic Loops and Tam­ing RL

berenJul 19, 2023, 3:12 PM
20 points
14 comments9 min readLW link

The mur­der­ous short­cut: a toy model of in­stru­men­tal convergence

Thomas KwaOct 2, 2024, 6:48 AM
37 points
0 comments2 min readLW link

[Aspira­tion-based de­signs] 1. In­for­mal in­tro­duc­tion

Apr 28, 2024, 1:00 PM
44 points
4 comments8 min readLW link

Stable Poin­t­ers to Value III: Re­cur­sive Quantilization

abramdemskiJul 21, 2018, 8:06 AM
20 points
4 comments4 min readLW link

How to safely use an optimizer

Simon FischerMar 28, 2024, 4:11 PM
47 points
21 comments7 min readLW link

AISC pro­ject: Satis­fIA – AI that satis­fies with­out over­do­ing it

Jobst HeitzigNov 11, 2023, 6:22 PM
12 points
0 comments1 min readLW link
(docs.google.com)

AISC team re­port: Soft-op­ti­miza­tion, Bayes and Goodhart

Jun 27, 2023, 6:05 AM
38 points
2 comments15 min readLW link

Re­cur­sive Quan­tiliz­ers II

abramdemskiDec 2, 2020, 3:26 PM
30 points
15 comments13 min readLW link

[Aspira­tion-based de­signs] 2. For­mal frame­work, ba­sic algorithm

Apr 28, 2024, 1:02 PM
17 points
2 comments16 min readLW link
No comments.