David Scott Krueger (formerly: capybaralet) comments on On the purposes of decision theory research

David Scott Krueger (formerly: capybaralet) 22 Aug 2019 18:47 UTC
LW: 1 AF: 1
AF
What’s the best description of what you mean by “metaphilosophy” you can point me to? I think I have a pretty good sense of it, but it seems worthwhile to be as rigorous / formal / descriptive / etc. as possible.
- Wei Dai 23 Aug 2019 0:19 UTC
  LW: 3 AF: 2
  AF Parent
  This description from Three Approaches to “Friendliness” perhaps gives the best idea of what I mean by “metaphilosophy”:
  
  Black-Box Metaphilosophical AI—Program the AI to use the minds of one or more human philosophers as a black box to help it solve philosophical problems, without the AI builders understanding what “doing philosophy” actually is.
  
  White-Box Metaphilosophical AI—Understand the nature of philosophy well enough to specify “doing philosophy” as an algorithm and code it into the AI.
  
  (One could also imagine approaches that are somewhere in between these two, where for example AI designers have some partial understanding of what “doing philosophy” is, and programs the AI to learn from human philosophers based on this partial understanding.)
  
  For more of my thoughts on this topic, see Some Thoughts on Metaphilosophy and the posts that it links to.