Sodium comments on (Maybe) A Bag of Heuristics is All There Is & A Bag of Heuristics is All You Need

Sodium 9 Oct 2024 5:00 UTC
3 points
2
I see, I think that second tweet thread actually made a lot more sense, thanks for sharing!
McCoy’s definitions of heuristics and reasoning is sensible, although I personally would still avoid “reasoning” as a word since people probably have very different interpretations of what it means. I like the ideas of “memorizing solutions” and “generalizing solutions.”

I think where McCoy and I depart is that he’s modeling the entire network computation as a heuristic, while I’m modeling the network as compositions of bags of heuristics, which in aggregate would display behaviors he would call “reasoning.”
The explanation I gave above—heuristics that shifts the letter forward by one with limited composing abilities—is still a heuristics-based explanation. Maybe this set of composing heuristics would fit your definition of an “algorithm.” I don’t think there’s anything inherently wrong with that.
However, the heuristics based explanation gives concrete predictions of what we can look for in the actual network—individual heuristic that increments a to b, b to c, etc., and other parts of the network that compose the outputs.

This is what I meant when I said that this could be a useful framework for interpretability :)
- Noosphere89 9 Oct 2024 12:48 UTC
  4 points
  0
  Parent
  Now I understand.
  
  Though I’d still claim that this is evidence towards the view that there is a generalizing solution that is implemented inside of LLMs, and I wanted people to keep that in mind, since people often treat heuristics as meaning that it doesn’t generalize at all.
  - Sodium 9 Oct 2024 17:13 UTC
    3 points
    3
    Parent
    since people often treat heuristics as meaning that it doesn’t generalize at all.
    Yeah and I think that’s a big issue! I feel like what’s happening is that once you chain a huge number of heuristics together you can get behaviors that look a lot like complex reasoning.