Gurkenglas comments on Approval-directed agents

Gurkenglas 24 Nov 2018 12:26 UTC
1 point
How does this differ from just running Hugh?
- paulfchristiano 24 Nov 2018 20:07 UTC
  5 points
  Parent
  Hugh is some human, Arthur is a cheap AI. For the obvious example today, compare:
  - Get mechanical turkers to label training images. Train an AI to predict the label they would assign. Use that AI to label images.
  - Use mechanical turkers to label images.
  The second one is orders of magnitude more expensive and higher latency.
  - Gurkenglas 27 Nov 2018 13:29 UTC
    1 point
    Parent
    The situation seems to me comparable to one where we upload Hugh and then let him do what he wants, such as optimizing himself by replacing parts of himself by machine learning predictors with correlating results. The hope here then sounds like that this is fine so long as we perform differential testing to limit accidental drift.