RSS

A.H.

Karma: 263

Should we max­i­mize the Geo­met­ric Ex­pec­ta­tion of Utility?

A.H.Apr 17, 2024, 10:37 AM
5 points
17 comments9 min readLW link

Nash Bar­gain­ing be­tween Subagents doesn’t solve the Shut­down Problem

A.H.Jan 25, 2024, 10:47 AM
22 points
1 comment9 min readLW link

A Ped­a­gog­i­cal Guide to Corrigibility

A.H.Jan 17, 2024, 11:45 AM
6 points
3 comments16 min readLW link

A Land Tax For Britain

A.H.Jan 6, 2024, 3:52 PM
6 points
9 comments4 min readLW link

Will 2024 be very hot? Should we be wor­ried?

A.H.Dec 29, 2023, 11:22 AM
51 points
12 comments10 min readLW link

[Question] A Ques­tion about Cor­rigi­bil­ity (2015)

A.H.Nov 27, 2023, 12:05 PM
4 points
2 comments1 min readLW link

UK Govern­ment pub­lishes “Fron­tier AI: ca­pa­bil­ities and risks” Dis­cus­sion Paper

A.H.Oct 26, 2023, 1:55 PM
5 points
0 comments2 min readLW link
(www.gov.uk)

Op­ti­mized for Some­thing other than Win­ning or: How Cricket Re­sists Moloch and Good­hart’s Law

A.H.Jul 5, 2023, 12:33 PM
53 points
26 comments4 min readLW link

Align­ment as Func­tion Fitting

A.H.May 6, 2023, 11:38 AM
7 points
0 comments12 min readLW link

Is Con­struc­tor The­ory a use­ful tool for AI al­ign­ment?

A.H.Nov 29, 2022, 12:35 PM
11 points
8 comments26 min readLW link