Noosphere89 comments on ChristianKl’s Shortform

Noosphere89 Dec 10, 2024, 6:14 PM
1 point
1
The fundamental problem is that any effective AI alignment technique is also a censorship technique, and thus you can’t advance AI alignment very much without also allowing people to censor an AI effectively, because a lot of alignment work is aiming to make AIs be censored in particular ways.
- Jozdien Dec 10, 2024, 6:25 PM
  7 points
  4
  Parent
  I disagree with the use of “any”. In principle, an effective alignment technique could create an AI that isn’t censored, but does have certain values/preferences over the world. You could call that censorship, but that doesn’t seem like the right or common usage. I agree that in practice many/most things currently purporting to be effective alignment techniques fit the word more, though.
  - Noosphere89 Dec 10, 2024, 6:42 PM
    4 points
    0
    Parent
    I admit this is possible, so I almost certainly am overconfident here (which matters a little), though I believe a lot of common methods that do work for alignment also allow you to censor an AI.
- ChristianKl Dec 10, 2024, 10:06 PM
  4 points
  2
  Parent
  If you take early writing of Eliezer, the idea is AI should be aligned with Coherent Extrapolated Volition. That’s a different goal from aligning AI with the views of credentialed experts or the leadership of AI companies.
  “How do you regulate AI companies so that they aren’t enforcing Californian values on the rest of the United States and the world?” is an alignment question. If you have a good answer to that question, it would be easier to convince someone worried about those companies having enforced Californian values via censorship industrial complex doing the same thing with AI to regulate AI companies.
  If you ignore the alignment questions that people like David Sachs care about, it’s hard to convince them that you are sincere about the other alignment questions.
  - Noosphere89 Dec 10, 2024, 10:12 PM
    2 points
    0
    Parent
    A crux here is that I basically don’t think Coherent Extrapolated Volition of humanity type alignment strategies work, and I also think that it is irrelevant that we can’t align an AI to the CEV of humanity.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer