janus comments on How LLMs are and are not myopic

janus 29 Jul 2023 1:02 UTC
LW: 13 AF: 6
1
AF
Many users of base models have noticed this phenomenon, and my SERI MATS stream is currently working on empirically measuring it / compiling anecdotal evidence / writing up speculation concerning the mechanism.
- cfoster0 29 Jul 2023 1:38 UTC
  3 points
  1
  Parent
  It would definitely move the needle for me if y’all are able to show this behavior arising in base models without forcing, in a reproducible way.
- Phil Bland 9 Feb 2024 10:47 UTC
  1 point
  0
  Parent
  Do you have any update on this? It goes strongly against my current understanding of how LLMs learn. In particular, in the supervised learning phase any output text claiming to be an LLM would be penalized unless such statements are included in the training corpus. If such behavior nevertheless arises I would be super excited to analyze this further though.