Ebenezer Dukakis comments on My AI Model Delta Compared To Yudkowsky

Ebenezer Dukakis Jun 11, 2024, 10:21 AM
5 points
0

a very extreme failure of natural abstraction, such that human concepts cannot be faithfully and robustly translated into the system’s internal ontology at all.

This hypothetical suggests to me that the AI might not be very good at e.g. manipulating humans in an AI-box experiment, since it just doesn’t understand how humans think all that well.

I wonder what MIRI thinks about this 2013 post (“The genie knows, but doesn’t care”) nowadays. Seems like the argument is less persuasive now, with AIs that seem to learn representations first, and later are given agency by the devs. I actually suspect your model of Eliezer is wrong, because it seems to imply he believes “the AI actually just doesn’t know”, and it’s a little hard for me to imagine him saying that.

Alternatively, maybe the “faithfully and robustly” bit is supposed to be very load-bearing. However, it’s already the case that humans learn idiosyncratic, opaque neural representations of our values from sense data—yet we’re able to come into alignment with each other, without a bunch of heavy-duty interpretability or robustness techniques.
- TAG Jun 11, 2024, 12:28 PM
  1 point
  0
  Parent
  
  I wonder what MIRI thinks about this 2013 post (“The genie knows, but doesn’t care”) nowadays. Seems like the argument is less persuasive now,
  
  The genie argument was flawed at the time, for reasons pointed out at the time, and ignored at the time.
  - cubefox Jun 14, 2024, 9:41 PM
    3 points
    0
    Parent
    Ignored or downvoted. Perhaps someone could make a postmortem analysis of those comment threads today.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer