peterbarnett comments on Ronny and Nate discuss what sorts of minds humanity is likely to find by Machine Learning

peterbarnett 20 Dec 2023 8:23 UTC
25 points
13
(I obviously don’t speak for Ronny) I’d guess this is kinda the within-model uncertainty, he had a model of “alignment” that said you needed to specify all 10,000 bits of human values. And so the odds of doing this by default/at random was 2^-10000:1. But this doesn’t contain the uncertainty that this model is wrong, which would make the within-model uncertainty a rounding error.

According to this model there is effectively no chance of alignment by default, but this model could be wrong.
- Richard_Ngo 20 Dec 2023 9:17 UTC
  10 points
  10
  Parent
  If Ronnie had said “there is one half-baked heuristic that claims that the probability is 2^-10000” then I would be sympathetic. That seems very different to what he said, though. In some sense my objection is precisely to people giving half-baked heuristics and intuitions an amount of decision-making weight far disproportionate to their quality, by calling them “models” and claiming that the resulting credences should be taken seriously.
  - habryka 20 Dec 2023 12:26 UTC
    21 points
    8
    Parent
    I think that would be a more on-point objection to make on a single-author post, but this is a chat log between two people, optimized to communicate to each other, and as such generally comes with fewer caveats and taking a bunch of implicit things for granted (this makes it well-suited for some kind of communication, and not others). I like it in that it helps me get a much better sense of a bunch of underlying intuitions and hunches that are often hard to formalize and so rarely make it into posts, but I also think it is sometimes frustrating because it’s not optimized to be responded to.
    I would take bets that Ronny’s position was always something closer to “I had this robust-seeming inside-view argument that claimed the probability was extremely low, though of course my outside view and different levels of uncertainty caused my betting odds to be quite different”.