My recollection is that Habryka seriously considered hypotheses that involved worse and more coordinated behavior than reality, but that this is different from “this was his primary hypothesis that he gave the most probability mass to”. And then he did some empiricism and falsified the hypotheses and I’m glad those hypotheses were considered and investigated.
Here’s an example of him giving 20-25% to a hypothesis about conspiratorial behavior that I believe has turned out to be false.
Yep, that hypothesis seems mostly wrong, though I more feel like I received 1-2 bits of evidence against it. If the board had stabilized with Sam being fired, even given all I know, I would have still thought a merger with Anthropic to be like ~5%-10% likely.
My recollection is that Habryka seriously considered hypotheses that involved worse and more coordinated behavior than reality, but that this is different from “this was his primary hypothesis that he gave the most probability mass to”. And then he did some empiricism and falsified the hypotheses and I’m glad those hypotheses were considered and investigated.
Here’s an example of him giving 20-25% to a hypothesis about conspiratorial behavior that I believe has turned out to be false.
Yep, that hypothesis seems mostly wrong, though I more feel like I received 1-2 bits of evidence against it. If the board had stabilized with Sam being fired, even given all I know, I would have still thought a merger with Anthropic to be like ~5%-10% likely.