Unfortunately (fortunately?) I don’t feel like I have access to any secret truths. Most idiosyncratic things I believe are pretty tentative, and I hang out with a lot of folks who are pretty open to the kinds of weird ideas that might have ended up feeling like Paul-specific secret truths if I hung with a more normal crowd.
It feels like my biggest disagreement with people around me is something like: to what extent is it likely to be possible to develop an algorithm that really looks on paper like it should just work for aligning powerful ML systems. I’m at like 50-50 and I think that the consensus estimate of people in my community is more like “Uh, sure doesn’t sound like that’s going to happen, but we’re still excited for you to try.”
Unfortunately (fortunately?) I don’t feel like I have access to any secret truths. Most idiosyncratic things I believe are pretty tentative, and I hang out with a lot of folks who are pretty open to the kinds of weird ideas that might have ended up feeling like Paul-specific secret truths if I hung with a more normal crowd.
It feels like my biggest disagreement with people around me is something like: to what extent is it likely to be possible to develop an algorithm that really looks on paper like it should just work for aligning powerful ML systems. I’m at like 50-50 and I think that the consensus estimate of people in my community is more like “Uh, sure doesn’t sound like that’s going to happen, but we’re still excited for you to try.”