Wei Dai comments on Alignment Newsletter #37

Wei Dai 17 Dec 2018 22:00 UTC
LW: 2 AF: 1
AF

One particular way that “aligned” AI systems could make things work is if they accidentally “corrupt” our values

Did you mean “worse” instead of “work” here?

these should be taken as disagreements on how to solve these problems, not a disagreement that the problems exist.

I’m definitely not too attached to my proposed angles of attack either, and mainly wanted to give some ideas as existence proof that there are things that can be done from a technical perspective.

I’m not sure why we’re focusing on value corruption in particular. [...] I don’t have a great answer to the problem of competing aligned superintelligent AI systems.

I thought I gave pretty reasonable answers at https://www.lesswrong.com/posts/HTgakSs6JpnogD6c2/two-neglected-problems-in-human-ai-safety#dykZxAXGr6sk7bh4e. Do you disagree with what I said?
- Rohin Shah 21 Dec 2018 12:40 UTC
  LW: 2 AF: 1
  AF Parent
  Did you mean “worse” instead of “work” here?
  Yes, fixed.
  I thought I gave pretty reasonable answers at https://www.lesswrong.com/posts/HTgakSs6JpnogD6c2/two-neglected-problems-in-human-ai-safety#dykZxAXGr6sk7bh4e. Do you disagree with what I said?
  No, I just didn’t get to that comment by the time the newsletter was sent out. (I’ve been a bit busy, and iirc that comment was either really close to or after the newsletter release time.)
  - Wei Dai 21 Dec 2018 17:48 UTC
    2 points
    Parent
    
    No, I just didn’t get to that comment by the time the newsletter was sent out.
    
    Ah, ok. Assuming you’ve changed your mind on that topic or had your questions satisfactorily answered, I wonder if there’s a way to update your newsletter readers without being too awkward or taking up too much time or space in the newsletter. (I’d guess that most of your readers aren’t following the discussions here.) Is that something you’ve thought about before?
    - Rohin Shah 23 Dec 2018 5:46 UTC
      4 points
      Parent
      Yeah, I do have a way—adding it to a subsequent newsletter after the highlights section. You’ll see that this week.