Some Thoughts on AI Alignment: Using AI to Control AI

eigenvalue21 Jun 2024 17:44 UTC

1 point

Recent news has caused me to think through some questions about AI alignment, so I collected my thoughts here. While I’m sure a lot of this stuff isn’t new, I haven’t seen all these ideas presented together in one place. I think that some of the approaches that are used in designing decentralized systems can also be useful in constructing alignment systems, so I’ve tried to do that here. Anyway, I welcome feedback on my ideas.

eigenvalue21 Jun 2024 17:44 UTC

1 point

1 comment1 min readLW link

AI AI-Assisted Alignment

Raemon 21 Jun 2024 17:44 UTC
2 points
2
FYI, these sorts of posts generally get more readership/responses if they copy over the text of the post here.