abramdemski comments on Re-Define Intent Alignment?

abramdemski 4 Aug 2021 18:16 UTC
LW: 3 AF: 3
AF
I would further add that looking for difficulties created by the simplification seems very intellectually productive. (Solving “embedded agency problems” seems to genuinely allow you to do new things, rather than just soothing philosophical worries.) But yeah, I would agree that if we’re defining mesa-objective anyway, we’re already in the business of assuming some agent/environment boundary.
- Edouard Harris 4 Aug 2021 18:37 UTC
  LW: 1 AF: 1
  AF Parent
  I would further add that looking for difficulties created by the simplification seems very intellectually productive.
  Yep, strongly agree. And a good first step to doing this is to actually build as robust a simplification as you can, and then see where it breaks. (Working on it.)