niplav comments on shortplav

niplav 12 Apr 2025 23:54 UTC
6 points
2
I get a lot of trailing whitespace when using Claude code and variants of Claude Sonnet, more than short tests with base models give me. (Not rigorously tested, yet).

I wonder if the trailing whitespace encodes some information or is just some Constitutional AI/RL artefact.
- p.b. 17 Apr 2025 10:50 UTC
  2 points
  0
  Parent
  It’s probably just a difference in tokenizer. Tokenizers often produce tokens with trailing whitespace. I actually once wrote a tokenizer and trained a model to predict “negative whitespace” when a token for once shouldn’t have a trailing whitespace. But I don’t know how current tokenizers handle this, probably in different ways.
  - niplav 17 Apr 2025 13:26 UTC
    2 points
    0
    Parent
    That would be my main guess as well $_{75 %}$ , but not the overwhelmingly likely option.
- Mateusz Bagiński 13 Apr 2025 0:53 UTC
  2 points
  0
  Parent
  Steganography /j
  - niplav 13 Apr 2025 10:49 UTC
    6 points
    2
    Parent
    I think it’s possible! If it’s used to encode relevant information, then it could be tested by running software engineering benchmarks (e.g. SWE-bench) but removing any trailing whitespace during generation, and checking if the score is lower.
  - Garrett Baker 13 Apr 2025 2:49 UTC
    3 points
    0
    Parent
    If it is encoding relevant info then this would be the definition of steganography
    - Mateusz Bagiński 13 Apr 2025 4:34 UTC
      2 points
      0
      Parent
      I know. I just don’t expect it to.