dynomight

Karma: 2,589

https://dynomight.net

Using axis lines for good or evil

dynomightMar 6, 2024, 2:47 PM

151 points

39 comments4 min readLW link

(dynomight.net)

dynomight Nov 8, 2023, 3:20 PM
4 points
3
in reply to: Sune’s comment on: Are language models good at making predictions?
Thanks, you’ve 100% convinced me. (Convincing someone that something that (a) is known to be true and (b) they think isn’t surprising, actually is surprising is a rare feat, well done!)

dynomight Nov 6, 2023, 4:23 PM
4 points
0
in reply to: Tao Lin’s comment on: Are language models good at making predictions?
Chat or instruction finetuned models have poor prediction cailbration, whereas base models (in some cases) have perfect calibration.
Tell me if I understand the idea correctly: Log-loss to predict next token leads to good calibration for single token prediction, which manifests as good calibration percentage predictions? But then RLHF is some crazy loss totally removed from calibration that destroys all that?
If I get that right, it seems quite intuitive. Do you have any citations, though?

dynomight Nov 6, 2023, 2:54 PM
5 points
0
in reply to: dschwarz’s comment on: Are language models good at making predictions?
Sadly, no—we had no way to verify that.
I guess one way you might try to confirm/refute the idea of data leakage would be to look at the decomposition of brier scores: GPT-4 is much better calibrated for politics vs. science but only very slightly better at politics vs. science in terms of refinement/resolution. Intuitively, I’d expect data leakage to manifest as better refinement/resolution rather than better calibration.

 Are language models good at making predictions?

dynomightNov 6, 2023, 1:10 PM

76 points

14 comments4 min readLW link

(dynomight.net)

dynomight Sep 15, 2023, 4:40 PM
4 points
2
in reply to: Rana Dexsin’s comment on: Can I take ducks home from the park?
That would definitely be better, although it would mean reading/scoring 1056 different responses, unless I can automate the scoring process. (Would LLMs object to doing that?)

dynomight Sep 15, 2023, 1:12 PM
4 points
0
in reply to: Anon User’s comment on: Can I take ducks home from the park?
Thank you, I will fix this! (Our Russian speaker agrees and claims they noticed this but figured it didn’t matter 🤔) I re-ran the experiments with the result that GPT-4 shifted from a score of +2 to a score of −1.

dynomight Sep 14, 2023, 9:16 PM
15 points
2
in reply to: gwern’s comment on: Can I take ducks home from the park?
Well, no. But I guess I found these things notable:
- Alignment remains surprisingly brittle and random. Weird little tricks remain useful.
- The tricks that work for some models often seem to confuse others.
- Cobbling together weird little tricks seems to help (Hindi ranger step-by-step)
- At the same time, the best “trick” is a somewhat plausible story (duck-store).
- PaLM 2 is the most fun, Pi is the least fun.

Can I take ducks home from the park?

dynomightSep 14, 2023, 9:03 PM

67 points

8 comments3 min readLW link

(dynomight.net)

dynomight Jun 15, 2023, 6:53 PM
5 points
0
in reply to: dr_s’s comment on: I still think it’s very unlikely we’re observing alien aircraft
You’ve convinced me! I don’t want to defend the claim you quoted, so I’ll modify “arguably” into something much weaker.

dynomight Jun 15, 2023, 5:22 PM
5 points
1
in reply to: memeticimagery’s comment on: I still think it’s very unlikely we’re observing alien aircraft
I don’t think I have any argument that it’s unlikely aliens are screwing with us—I just feel it is, personally.
I definitely don’t assume our sensors are good enough to detect aliens. I’m specifically arguing we aren’t detecting alien aircraft, not that alien aircraft aren’t here. That sound like a silly distinction, but I’d genuinely give much higher probability to “there are totally undetected alien aircraft on earth” than “we are detecting glimpses of alien aircraft on earth.”
Regarding your last point, I totally agree those things wouldn’t explain the weird claims we get from intelligence-connected people. (Except indirectly—e.g. rumors spread more easily when people think something is possible for other reasons.) I think that our full set of observations are hard to explain without aliens! That is, I think P[everything | aliens] is low. I just think P[everything | no aliens] is even lower.

dynomight Jun 15, 2023, 1:10 PM
11 points
4
on: I still think it’s very unlikely we’re observing alien aircraft
I know that the mainstream view on Lesswrong is that we aren’t observing alien aircraft, so I doubt many here will disagree with the conclusion. But I wonder if people here agree with this particular argument for that conclusion. Basically, I claim that:
- P[aliens] is fairly high, but
- P[all observations | aliens] is much lower than P[all observations | no aliens], simply because it’s too strange that all the observations in every category of observation (videos, reports, etc.) never cross the “conclusive” line.
As a side note: I personally feel that P[observations | no aliens] is actually pretty low, i.e. the observations we have are truly quite odd / unexpected / hard-to-explain-prosaically. But it’s not as low as P[observations | aliens]. This doesn’t matter to the central argument (you just need to accept that the ratio P[observations | aliens] / P[observations | no aliens] is small) but I’m interested if people agree with that.

I still think it’s very unlikely we’re observing alien aircraft

dynomightJun 15, 2023, 1:01 PM

180 points

70 comments5 min readLW link

(dynomight.net)

dynomight May 8, 2023, 7:17 PM
5 points
2
on: Properties of Good Textbooks
I get very little value from proofs in math textbooks, and consider them usually unnecessary (unless they teach a new proof method).
I think the problem is that proofs are typically optimized for “give most convincing possible evidence that the claim is really true to a skeptical reader who wants to check every possible weak point”. This is not what most readers (especially new readers) want on a first pass, which is “give maximum possible into why this claim is true for to a reader who is happy to trust the author if the details don’t give extra intuition.” At a glance, infinite Napkin seems to be optimizing much more for the latter.

dynomight Jun 17, 2022, 3:31 AM
2 points
0
in reply to: anonymousaisafety’s comment on: [Link] “The madness of reduced medical diagnostics” by Dynomight
If you’re worried about computational complexity, that’s OK. It’s not something that I mentioned because (surprisingly enough...) this isn’t something that any of the doctors discussed. If you like, let’s call that a “valid cost” just like the medical risks and financial/time costs of doing tests. The central issue is if it’s valid to worry about information causing harmful downstream medical decisions.

Why it’s bad to kill Grandma

dynomightJun 9, 2022, 6:12 PM

29 points

14 comments8 min readLW link

(dynomight.substack.com)

Creative nonfiction training exercises

dynomightApr 4, 2022, 4:17 PM

12 points

0 comments6 min readLW link

(dynomight.net)

dynomight Feb 16, 2022, 3:48 PM
5 points
0
in reply to: Dacyn’s comment on: Observations about writing and commenting on the internet
I might not have described the original debate very clearly. My claim was that if Monty chose “leftmost non-car door” you still get the car ²⁄₃ of the time by always switching and ¹⁄₃ by never switching. Your conditional probabilities look correct to me. The only thing you might be “missing” is that (A) occurs ²⁄₃ of the time and (B) occurs only ¹⁄₃ of the time. So if you always switch your chance of getting the car is still (chance of A)*(prob of car given A) + (chance of B)*(prob of car given B)=(2/3)*(1/2) + (1/3)*(1) = (2/3).
One difference (outside the bounds of the original debate) is that if Monty behaves this way there are other strategies that also give you the car ²⁄₃ of the time. For example, you could switch only in scenario B and not in scenario A. There doesn’t appear to be any way to exploit Monty’s behavior and do better than ²⁄₃ though.

dynomight Feb 15, 2022, 12:03 AM
11 points
0
on: Observations about writing and commenting on the internet
Just to be clear, when talking about how people behave in forums, I mean more “general purpose” places like Reddit. In particular, I was not thinking about Less Wrong where in my experience, people have always bent over backwards to be reasonable!

Observations about writing and commenting on the internet

dynomightFeb 15, 2022, 12:02 AM

94 points

10 comments11 min readLW link

(dynomight.net)

dynomight

Us­ing axis lines for good or evil

 Are lan­guage mod­els good at mak­ing pre­dic­tions?

Can I take ducks home from the park?

I still think it’s very un­likely we’re ob­serv­ing alien aircraft

Why it’s bad to kill Grandma

Creative non­fic­tion train­ing exercises

Ob­ser­va­tions about writ­ing and com­ment­ing on the internet

Using axis lines for good or evil

 Are language models good at making predictions?

I still think it’s very unlikely we’re observing alien aircraft

Creative nonfiction training exercises

Observations about writing and commenting on the internet