carboniferous_umbraculum comments on On Developing a Mathematical Theory of Interpretability

carboniferous_umbraculum 9 Feb 2023 19:47 UTC
9 points
6
>There is no difference between natural phenomena and DNNs (LLMs, whatever). DNNs are 100% natural
I mean “natural” as opposed to “man made”. i.e. something like “occurs in nature without being built by something or someone else”. So in that sense, DNNs are obviously not natural in the way that the laws of physics are.
I don’t see information and computation as only mathematical; in fact in my analogies I write that the mathematical abstractions we build as being separate from the things that one wants to describe or make predictions about. And this applies to the computations in NNs too.
I don’t want to study AI as mathematics or believe that AI is mathematics. I write that the practice of doing mathematics will only seek out the parts of the problem that are actually amenable to it; and my focus is on interpretability and not other places in AI that one might use mathematics (like, say, decision theory).
You write “As an example, take “A mathematical framework for transformer circuits”: it doesn’t develop new mathematics. It just uses existing mathematics: tensor algebra.:” I don’t think we are using ‘new mathematics’ in the same way and I don’t think the way you are using it commonplace. Yes I am discussing the prospect of developing new mathematics, but this doesn’t only mean something like ‘making new definitions’ or ‘coming up with new objects that haven’t been studied before’. If I write a proof of a theorem that “just” uses “existing” mathematical objects, say like...matrices, or finite sets, then that seems to have little bearing on how ‘new’ the mathematics is. It may well be a new proof, of a new theorem, containing new ideas etc. etc. And it may well need to have been developed carefully over a long period of time.
- Roman Leventov 10 Feb 2023 7:24 UTC
  −1 points
  −3
  Parent
  I feel that you are redefining terms. Writing down mathematical equations (or defining other mathematical structures that are not equations, e.g., automata), describing natural phenomena, and proving some properties of these, i.e., deriving some mathematical conjectures/theorems, -- that’s exactly what physicists do, and they call it “doing physics” or “doing science” rather than “doing mathematics”.
  I mean “natural” as opposed to “man made”. i.e. something like “occurs in nature without being built by something or someone else”. So in that sense, DNNs are obviously not natural in the way that the laws of physics are.
  I wonder how would you draw the boundary between “man-made” and “non-man-made”, the boundary that would have a bearing on such a fundamental qualitative distinction of phenomena as the amenability to mathematical description.
  According to Fields et al.’s theory of semantics and observation (“quantum theory […] is increasingly viewed as a theory of the process of observation itself”), which is also consistent with predictive processing and Seth’s controlled hallucination theory which is a descendant of predictive processing, any observer’s phenomenology is what makes mathematical sense by construction. Also, here Wolfram calls approximately the same thing “coherence”.
  Of course, there are infinite phenomena both in “nature” and “among man-made things” the mathematical description of which would not fit our brains yet, but this also means that we cannot spot these phenomena. We can extend the capacity of our brains (e.g., through cyborgisation, or mind upload), as well as equip ourselves with more powerful theories that allow us to compress reality more efficiently and thus spot patterns that were not spottable before, but this automatically means that these patterns become mathematically describable.
  This, of course, implies that we ought to make our minds stronger (through technical means or developing science) precisely to timely spot the phenomena that are about to “catch us”. This is the central point of Deutsch’s “The Beginning of Infinity”.
  Anyway, there is no point in arguing this point fiercely because I’m kind of on “your side” here, arguing that your worry that developing theories of DL might be doomed is unjustified. I’d just call these theories scientific rather than mathematical :)