kolmplex’s Shortform

kolmplexApr 10, 2023, 11:34 PM

1 point

1 comment1 min readLW link

kolmplex Apr 10, 2023, 1:35 PM
2 points
0

I think that humans are sorta “unaligned”, in the sense of being vulnerable to Goodhart’s Law.
A lot of moral philosophy is something like:
1. Gather our odd grab bag of heterogeneous, inconsistent moral intuitions
2. Try to find a coherent “theory” that encapsulates and generalizes these moral intuitions
3. Work through the consequences of the theory and modify it until you are willing to bite all the implied bullets.
The resulting ethical system often ends up having some super bizarre implications and usually requires specifying “free variables” that are (arguably) independent of our original moral intuitions.
In fact, I imagine that optimizing the universe according to my moral framework looks quite Goodhartian to many people.
Some examples of implications of my current moral framework:
1. I think that (a) personhood is preserved when moved to simulation (b) it’s easier to control what’s happening in a simulation, and consequently easier to fulfill a person’s preferences. Therefore, it’d be ideal to upload as many people as possible. In fact, I’m not sure whether or not this should even be optional, given how horrendously inefficient the ratio of organic human atoms to “utilons” is.
2. I value future lives, so I think we have an ethical responsibility to create as many happy beings as we can, even at some cost to current beings.
3. I think that some beings are fundamentally capable of being happier than other beings. So, all else equal, we should prefer to create happier people. I think that parents should be forced to adhere to this when having kids.
4. I think that we should modify all animals so we can guarantee that they have zero consciousness, or otherwise guarantee that they don’t suffer (how do we deal with lions’ natural tendency to brutally kill gazelles?)
5. I think that people ought to do some limited amount of wire-heading (broadly increasing happiness independent of reality).
6. Complete self-determination/subjective “free-will” is both impossible and not desirable. SAI will be able to subtly, but meaningfully, guide humans down chosen paths because it can robustly predict the differential impact of seemingly minor conversational and environmental variations.
I’m sure there are many other examples.
I don’t think that my conclusions are wrong per se, but… my ethical system has some alien and potentially degenerate implications when optimized hard.
It’s also worth noting that although I stated those examples confidently (for rhetorical purposes), my stances on many of them depend on very specific details of my philosophy and have toggled back and forth many times.
No real call to action here, just some observations. Existing human ethical systems might look as exotic to the average person as some conclusions drawn by a kinda-aligned SAI.

  



Since:

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer