A.H.

Karma: 263

Should we maximize the Geometric Expectation of Utility?

A.H.Apr 17, 2024, 10:37 AM

5 points

17 comments9 min readLW link

Nash Bargaining between Subagents doesn’t solve the Shutdown Problem

A.H.Jan 25, 2024, 10:47 AM

22 points

1 comment9 min readLW link

A Pedagogical Guide to Corrigibility

A.H.Jan 17, 2024, 11:45 AM

6 points

3 comments16 min readLW link

A Land Tax For Britain

A.H.Jan 6, 2024, 3:52 PM

6 points

9 comments4 min readLW link

Will 2024 be very hot? Should we be worried?

A.H.Dec 29, 2023, 11:22 AM

51 points

12 comments10 min readLW link

[Question] A Question about Corrigibility (2015)

A.H.Nov 27, 2023, 12:05 PM

4 points

2 comments1 min readLW link

UK Government publishes “Frontier AI: capabilities and risks” Discussion Paper

A.H.Oct 26, 2023, 1:55 PM

5 points

0 comments2 min readLW link

(www.gov.uk)

Optimized for Something other than Winning or: How Cricket Resists Moloch and Goodhart’s Law

A.H.Jul 5, 2023, 12:33 PM

53 points

26 comments4 min readLW link

Alignment as Function Fitting

A.H.May 6, 2023, 11:38 AM

7 points

0 comments12 min readLW link

Is Constructor Theory a useful tool for AI alignment?

A.H.Nov 29, 2022, 12:35 PM

11 points

8 comments26 min readLW link