Charlie Steiner comments on Eight Strategies for Tackling the Hard Part of the Alignment Problem

Charlie Steiner 10 Jul 2023 15:40 UTC
LW: 3 AF: 2
1
AF
Thanks, this was interesting.

I’d rather say that some paths to alignment are about lacking certain capabilities.

If ultimately you want an AI to definitely-have some set of capabilities and lack some other set, you can get there by any combination of addition and subtraction (in the context of current AI that’s like a growing blob of capabilities). But if some of the capabilities we want are pretty specialized (like ones that relate to precisely interpreting our preferences), it might be faster to add or accelerate them somehow rather than waiting for capabilities to grow past them and then pruning back everything-that’s-not-what-we-want.
- scasper 12 Jul 2023 15:44 UTC
  LW: 1 AF: 1
  0
  AF Parent
  I think this is a good point, thanks.