RSS

Stuart_Armstrong

Karma: 17,970

Value ex­trap­o­la­tion, con­cept ex­trap­o­la­tion, model splintering

Stuart_ArmstrongMar 8, 2022, 10:50 PM
16 points
1 comment2 min readLW link

[Link] Aligned AI AMA

Stuart_ArmstrongMar 1, 2022, 12:01 PM
18 points
0 comments1 min readLW link

More GPT-3 and sym­bol grounding

Stuart_ArmstrongFeb 23, 2022, 6:30 PM
21 points
7 comments3 min readLW link

Differ­ent way clas­sifiers can be diverse

Stuart_ArmstrongJan 17, 2022, 4:30 PM
10 points
5 comments2 min readLW link

Value ex­trap­o­la­tion par­tially re­solves sym­bol grounding

Stuart_ArmstrongJan 12, 2022, 4:30 PM
24 points
10 comments1 min readLW link

How an alien the­ory of mind might be unlearnable

Stuart_ArmstrongJan 3, 2022, 11:16 AM
29 points
35 comments5 min readLW link

Find­ing the mul­ti­ple ground truths of CoinRun and image classification

Stuart_ArmstrongDec 8, 2021, 6:13 PM
15 points
4 comments2 min readLW link

Declus­ter­ing, reclus­ter­ing, and filling in thingspace

Stuart_ArmstrongDec 6, 2021, 8:53 PM
16 points
6 comments3 min readLW link

Are there al­ter­na­tive to solv­ing value trans­fer and ex­trap­o­la­tion?

Stuart_ArmstrongDec 6, 2021, 6:53 PM
20 points
8 comments5 min readLW link

$100/​$50 re­wards for good references

Stuart_ArmstrongDec 3, 2021, 4:55 PM
20 points
5 comments1 min readLW link

Mo­rally un­der­defined situ­a­tions can be deadly

Stuart_ArmstrongNov 22, 2021, 2:48 PM
17 points
8 comments2 min readLW link

Gen­eral al­ign­ment plus hu­man val­ues, or al­ign­ment via hu­man val­ues?

Stuart_ArmstrongOct 22, 2021, 10:11 AM
50 points
27 comments3 min readLW link

Beyond the hu­man train­ing dis­tri­bu­tion: would the AI CEO cre­ate al­most-ille­gal ted­dies?

Stuart_ArmstrongOct 18, 2021, 9:10 PM
36 points
2 comments3 min readLW link

Clas­si­cal sym­bol ground­ing and causal graphs

Stuart_ArmstrongOct 14, 2021, 6:04 PM
22 points
2 comments5 min readLW link

Prefer­ences from (real and hy­po­thet­i­cal) psy­chol­ogy papers

Stuart_ArmstrongOct 6, 2021, 9:06 AM
15 points
0 comments2 min readLW link

Force neu­ral nets to use mod­els, then de­tect these

Stuart_ArmstrongOct 5, 2021, 11:31 AM
17 points
8 comments2 min readLW link

AI learns be­trayal and how to avoid it

Stuart_ArmstrongSep 30, 2021, 9:39 AM
30 points
4 comments2 min readLW link

AI, learn to be con­ser­va­tive, then learn to be less so: re­duc­ing side-effects, learn­ing pre­served fea­tures, and go­ing be­yond conservatism

Stuart_ArmstrongSep 20, 2021, 11:56 AM
14 points
4 comments3 min readLW link

Sig­moids be­hav­ing badly: arXiv paper

Stuart_ArmstrongSep 20, 2021, 10:29 AM
24 points
1 comment1 min readLW link

Im­mo­bile AI makes a move: anti-wire­head­ing, on­tol­ogy change, and model splintering

Stuart_ArmstrongSep 17, 2021, 3:24 PM
32 points
3 comments2 min readLW link