RSS

Stuart_Armstrong

Karma: 17,977

Re­ward splin­ter­ing as re­verse of interpretability

Stuart_ArmstrongAug 31, 2021, 10:27 PM
10 points
0 comments1 min readLW link

What are bi­ases, any­way? Mul­ti­ple type signatures

Stuart_ArmstrongAug 31, 2021, 9:16 PM
11 points
0 comments3 min readLW link

What does GPT-3 un­der­stand? Sym­bol ground­ing and Chi­nese rooms

Stuart_ArmstrongAug 3, 2021, 1:14 PM
40 points
15 comments12 min readLW link

Re­ward splin­ter­ing for AI design

Stuart_ArmstrongJul 21, 2021, 4:13 PM
30 points
1 comment8 min readLW link

Bayesi­anism ver­sus con­ser­vatism ver­sus Goodhart

Stuart_ArmstrongJul 16, 2021, 11:39 PM
15 points
2 comments6 min readLW link

Un­der­ly­ing model of an im­perfect morphism

Stuart_ArmstrongJul 16, 2021, 1:13 PM
13 points
0 comments3 min readLW link

An­thropic de­ci­sion the­ory for self-lo­cat­ing beliefs

Stuart_ArmstrongJul 12, 2021, 2:11 PM
17 points
2 comments1 min readLW link

Gen­er­al­ised mod­els: im­perfect mor­phisms and in­for­ma­tional entropy

Stuart_ArmstrongJul 9, 2021, 5:35 PM
9 points
0 comments8 min readLW link

Prac­ti­cal an­throp­ics summary

Stuart_ArmstrongJul 8, 2021, 3:10 PM
15 points
3 comments1 min readLW link

An­throp­ics and Fermi: grabby, visi­ble, zoo-keep­ing, and early aliens

Stuart_ArmstrongJul 8, 2021, 3:07 PM
15 points
1 comment4 min readLW link

The SIA pop­u­la­tion up­date can be sur­pris­ingly small

Stuart_ArmstrongJul 8, 2021, 10:45 AM
48 points
12 comments10 min readLW link

An­throp­ics in in­finite universes

Stuart_ArmstrongJul 8, 2021, 6:56 AM
13 points
4 comments1 min readLW link

Non-poi­sonous cake: an­thropic up­dates are normal

Stuart_ArmstrongJun 18, 2021, 2:51 PM
28 points
11 comments2 min readLW link

The re­verse Good­hart problem

Stuart_ArmstrongJun 8, 2021, 3:48 PM
20 points
22 comments1 min readLW link

Danger­ous op­ti­mi­sa­tion in­cludes var­i­ance minimisation

Stuart_ArmstrongJun 8, 2021, 11:34 AM
36 points
5 comments2 min readLW link

The un­der­ly­ing model of a morphism

Stuart_ArmstrongJun 4, 2021, 10:29 PM
10 points
0 comments5 min readLW link

SIA is ba­si­cally just Bayesian up­dat­ing on existence

Stuart_ArmstrongJun 4, 2021, 1:17 PM
26 points
8 comments2 min readLW link

The blue-min­imis­ing robot and model splintering

Stuart_ArmstrongMay 28, 2021, 3:09 PM
13 points
5 comments3 min readLW link1 review

Hu­man pri­ors, fea­tures and mod­els, lan­guages, and Sol­monoff induction

Stuart_ArmstrongMay 10, 2021, 10:55 AM
16 points
2 comments4 min readLW link

An­throp­ics: differ­ent prob­a­bil­ities, differ­ent questions

Stuart_ArmstrongMay 6, 2021, 1:14 PM
25 points
10 comments15 min readLW link