cousin_it comments on Muehlhauser-Wang Dialogue

cousin_it 23 Apr 2012 18:11 UTC
5 points
As I understand it, Eliezer’s concept of precision involves trying to find formalizations that are provably unique or optimal in some sense, not just formalizations that work. For example, Bayesianism is a unique solution under the conditions of Cox’s theorem. One of Pei’s papers points out flaws in Bayesianism and proposes an alternative approach, but without any proof of uniqueness. I see that as an avoidable mistake.
- Wei Dai 23 Apr 2012 20:36 UTC
  10 points
  Parent
  I guess Pei’s intuition is that a proof of uniqueness or optimality under unrealistic assumptions is of little practical value, and doing such proofs under realistic assumptions is unfeasible compared to the approach he is taking.
  
  ETA: When you write most kinds of software, you don’t first prove that your design is optimal or unique, but just start with something that you intuitively think would work, and then refine it by trial and error. Why shouldn’t this work for AGI?
  
  ETA 2: In case it wasn’t clear, I’m not advocating that we build AGIs by trial and error, but just trying to explain what Pei is probably thinking, and why cousin_it’s link isn’t likely to be convincing for him.
  - TheOtherDave 23 Apr 2012 21:08 UTC
    −2 points
    Parent
    If one isn’t concerned about the AGI’s ability to either (a) be able out-of-the-box to either successfully subvert the testing mechanisms being applied to it or successfully neutralize whatever mechanisms are in place to deal with it if it “fails” those tests, or (b) self-improve rapidly enough to achieve that state before those testing or dealing-with-failures mechanisms apply, then sure, a sufficiently well-designed test harness around some plausible-but-not-guaranteed algorithms will work fine, as it does for most software.
    
    Of course, if one is concerned about an AGI’s ability to do either of those things, one may not wish to rely on such a test harness.
    
    It seems to follow from this that some kind of quantification of what kinds of algorithms can do either of those things, and whether there’s any way to reliably determine whether a particular algorithm falls into that set prior to implementing it, might allow AGI developers to do trial-and-error work on algorithms that provably don’t meet that standard would be one way of making measurable progress without arousing the fears of those who consider FOOMing algorithms a plausible existential risk.
    
    Of course, that doesn’t have the “we get it right and then everything is suddenly better” aspect of successfully building a FOOMing FAI… it’s just research and development work, the same sort of incremental collective process that has resulted in, well, pretty much all human progress to date.