paulfchristiano comments on Open question: are minimal circuits daemon-free?

paulfchristiano 6 May 2018 4:37 UTC
5 points
Here is one definition of a “problem”:
Fix some distribution $D$ on ${0, 1}^{n}$ , and some function $R : {0, 1}^{n} \times {0, 1}^{m} \to [- 1, 1]$ . Then consider the set of circuits $C : {0, 1}^{n} \to {0, 1}^{m}$ for which the expectation of $R (x, C (x))$ , for $x$ sampled from $D$ , is $\geq 0$ .
- Ofer 6 May 2018 5:12 UTC
  3 points
  Parent
  Can we assume that $R$ itself is aligned in the sense that it doesn’t assign non-negative values to outputs that are catastrophic to us?
  - paulfchristiano 6 May 2018 5:23 UTC
    5 points
    Parent
    Yeah, if we want C to not be evil we need some very hard-to-state assumption on R and D.
    (markdown comment editor is unchecked, will take it up with admins)
    - Ofer 6 May 2018 6:35 UTC
      1 point
      Parent
      Perhaps it’ll be useful to think about the question for specific $D$ and $R$ .
      Here are the simplest $D$ and $R$ I can think of that might serve this purpose:
      $D$ - uniform over the integers in the range $[1, 10^{10^{10}}]$ .
      $R$ - for each input $x$ , $R$ assigns a reward of $1$ to the smallest prime number that is larger than $x$ , and $- 1$ to everything else.
- Ofer 6 May 2018 5:11 UTC
  1 point
  Parent
  I think you need to uncheck “Markdown Comment Editor” under “Edit Account”. Your comment with latex follows:
  Here is one definition of a “problem”:
  Fix some distribution $D$ on ${0, 1}^{n}$ , and some function $R$ : ${0, 1}^{n} \times {0, 1}^{m} \to [- 1, 1]$ . Then consider the set of circuits $C : {0, 1}^{n} \to {0, 1}^{m}$ for which the expectation of $R (x, C (x)))$ , for $x$ sampled from $D$ , is $\geq 0$ .