Rob Bensinger comments on Beware boasting about non-existent forecasting track records

Rob Bensinger 25 May 2022 3:35 UTC
16 points
Note that this doesn’t mean making the date range very narrow (confident), that’s unrelated.
Fair enough, but I was responding to a pair of tweets where you said:
Eliezer says that nobody knows much about AI timelines. But then keeps saying “I knew [development] would happen sooner than you guys thought”. Every time he does that, he’s conning people.
I know I’m using strong wording. But I’d say the same in any other domain.
He should create a public Metaculus profile. Place a bunch of forecasts.
If he beats the community by the landslide he claims, then I concede.
If he’s mediocre, then he was conning people.
‘It would be convenient if Eliezer would record his prediction on Metaculus, so we know with more precision how strong of an update to make when he publicly says “my median is well before 2050” and Metaculus later updates toward a nearer-term median’ is a totally fair request, but it doesn’t bear much resemblance to ‘if you record any prediction anywhere other than Metaculus (that doesn’t have similarly good tools for representing probability distributions), you’re a con artist’. Seems way too extreme.
Likewise, ‘prove that you’re better than Metaculus on a ton of forecasts or you’re a con artist’ seems like a wild response to ‘Metaculus was slower than me to update about a specific quantity in a single question’. So I’m trying to connect the dots, and I end up generating hypotheses like:
- Maybe Jotto is annoyed that Eliezer is confident about hard takeoff, not just that he has a nearer timelines median than Metaculus. And maybe Jotto specifically thinks that there’s no way you can rationally be confident about hard takeoff unless you think you’re better than Metaculus at timing tons of random narrow AI things.
  
  So then it follows that if you’re avoiding testing your mettle vs. Metaculus on a bunch of random narrow AI predictions, then you must not have any rational grounds for confidence in hard takeoff. And moreover this chain of reasoning is obvious, so Eliezer knows he has no grounds for confidence and is deliberately tricking us.
Or:
- Maybe Jotto hears Eliezer criticize Paul for endorsing soft takeoff, and hears Eliezer criticize Metaculus for endorsing Ajeya-ish timelines, and Jotto concludes ‘ah, Eliezer must think he’s amazing at predicting AI-ish events in general; this should be easy to test, so since he’s avoiding publicly testing it, he must be trying to trick us’.
In principle you could have an Eliezer-model like that and think that Eliezer has lots of nonstandard beliefs about random AI topics that make him way too confident about things like hard takeoff and yet his distributions tend to be wide, but that seems like a pretty weird combination of views to me, so I assumed that you’d also think Eliezer has relatively narrow distributions about everything.
it feels like I’m mostly debating people who think they can predict when Tetlock’s findings don’t apply, and so reliably that it’s unnecessary to forecast properly nor transparently, and it seems like they don’t understand.
Have you read Inadequate Equilibria, or R:AZ? (Or my distinction between ‘rationality as prothesis’ and ‘rationality as strength training’?)
I think there’s a good amount of overlap between MIRI- and CFAR-ish views of rationality and Tetlock-ish views, but I also don’t think of Tetlock’s tips as the be-all end-all of learning things about the world, of doing science, etc., and I don’t see his findings as showing that we should give up on inside-view model-building, not-fully-explicit-and-quantified reasoning under uncertainty, or any of the suggestions in When (Not) To Use Probabilities.
(Nor do I think Tetlock would endorse the ‘no future-related knowledge generation except via Metaculus or prediction markets’ policy you seem to be proposing. Maybe if we surveyed them we’d find out that Tetlock thinks Metaculus is 25% cooler than Eliezer does, or something? It’s not obvious to me that it matters.)
- Rob Bensinger 25 May 2022 4:01 UTC
  12 points
  Parent
  Also, I think you said on Twitter that Eliezer’s a liar unless he generates some AI prediction that lets us easily falsify his views in the near future? Which seems to require that he have very narrow confidence intervals about very near-term events in AI.
  So I continue to not understand what it is about the claims ‘the median on my AGI timeline is well before 2050’, ‘Metaculus updated away from 2050 after I publicly predicted it was well before 2050’, or ‘hard takeoff is true with very high probability’, that makes you think someone must have very narrow contra-mainstream distributions on near-term narrow-AI events or else they’re lying.
- Jotto999 26 May 2022 1:04 UTC
  −2 points
  Parent
  Some more misunderstanding:
  ‘if you record any prediction anywhere other than Metaculus (that doesn’t have similarly good tools for representing probability distributions), you’re a con artist’. Seems way too extreme.
  No, I don’t mean the distinguishing determinant of con-artist or not-con-artist trait is whether it’s recorded on Metaculus. It’s mentioned in that tweet because if you’re going to bother doing it, might as well go all the way and show a distribution.
  But even if he just posted a confidence interval, on some site other than Metaculus, that would be a huge upgrade. Because then anyone could add it to a spreadsheet scorable forecasts, and reconstruct it without too much effort.
  ‘if you record any prediction anywhere other than Metaculus (that doesn’t have similarly good tools for representing probability distributions), you’re a con artist’. Seems way too extreme.
  No, that’s not what I’m saying. The main thing is that they be scorable. But if someone is going to do it at all, then doing it on Metaculus just makes more sense—the administrative work is already taken care of, and there’s no risk of cherry-picking nor omission.
  Also, from another reply you gave:
  Also, I think you said on Twitter that Eliezer’s a liar unless he generates some AI prediction that lets us easily falsify his views in the near future? Which seems to require that he have very narrow confidence intervals about very near-term events in AI.
  I never used the term “liar”. The thing he’s doing that I think is bad is more like what a pundit does, like the guy who calls recessions, a sort of epistemic conning. “Lying” is different, at least to me.
  More importantly, no he doesn’t necessarily need to have really narrow distributions, and I don’t know why you think this. Only if he was squashed close against the “Now” side on the chart, then yes it would be “narrower”—but if that’s what Eliezer thinks, if he’s saying himself it’s earlier than x date, then on a graph that looks like it’s a bit narrower and shifted to the left, and it simply reflects what he believes.
  There’s nothing about how we score forecasters that requires him have “very narrow” confidence intervals about very near-term events in AI, in order to measure alpha. To help me understand, can you describe why you think this? Why don’t you think alpha would start being measurable with merely slightly more narrow confidence intervals than the community, and centered closer to the actual outcome?
  
  EDIT a week later: I have decided that several of your misunderstandings should be considered strawmanning, and I’ve switched from upvoting some of your comments here to downvoting them.