Adele Lopez comments on [missing post]

Adele Lopez 21 May 2023 7:32 UTC
LW: 7 AF: 4
0
AF
Nice graphic!

What stops e.g. “QACI(expensive_computation())” from being an optimization process which ends up trying to “hack its way out” into the real QACI?
- Tamsin Leake 21 May 2023 8:23 UTC
  LW: 4 AF: 2
  0
  AF Parent
  nothing fundamentally, the user has to be careful what computation they invoke.
  - Adele Lopez 21 May 2023 14:34 UTC
    LW: 4 AF: 2
    0
    AF Parent
    That… seems like a big part of what having “solved alignment” would mean, given that you have AGI-level optimization aimed at (indirectly via a counter-factual) evaluating this (IIUC).
    - Tamsin Leake 21 May 2023 15:16 UTC
      LW: 3 AF: 2
      0
      AF Parent
      one solution to this problem is to simply never use that capability (running expensive computations) at all, or to not use it before the iterated counterfactual researchers have developed proofs that any expensive computation they run is safe, or before they have very slowly and carefully built dath-ilan-style corrigible aligned AGI.