bruberu comments on avturchin’s Shortform

bruberu 29 Apr 2024 22:21 UTC
1 point
0
Interesting; maybe it’s an artifact of how we formatted our questions? Or, potentially, the training samples with larger ranges of numbers were higher quality? You could try it like how I did in this failing example:
When I tried this same list with your prompt, both responses were incorrect: