gwern comments on Can I take ducks home from the park?

gwern 14 Sep 2023 21:05 UTC
5 points
1
Conclusions?
- dynomight 14 Sep 2023 21:16 UTC
  15 points
  2
  Parent
  Well, no. But I guess I found these things notable:
  - Alignment remains surprisingly brittle and random. Weird little tricks remain useful.
  - The tricks that work for some models often seem to confuse others.
  - Cobbling together weird little tricks seems to help (Hindi ranger step-by-step)
  - At the same time, the best “trick” is a somewhat plausible story (duck-store).
  - PaLM 2 is the most fun, Pi is the least fun.