tailcalled comments on [missing post]

tailcalled 4 Jan 2022 8:55 UTC
5 points
In the original article, Dpi claims:

When a neuron spikes, the connections that have contributed to it are reinforced. It is Hebb’s rule. In this system, we do the same.

But this doesn’t seem to be present in your description? Unless I missed it.
- D𝜋 4 Jan 2022 15:31 UTC
  3 points
  Parent
  It is not, per se, Hebb’s rule. Hebb’s rule is very general. I personally see this as belonging to it, that’s all. I give attributions where is think it is deserved.
  - D𝜋 6 Jan 2022 13:58 UTC
    2 points
    Parent
    … and it is in this description:
    “The spiking network can adjust the weights of the active connections”
- lsusr 4 Jan 2022 10:28 UTC
  2 points
  Parent
  You didn’t miss it. Your quote and the later bit about “The question to ask is not ‘how’ to learn, but ‘when’.” seem to contradict each other. I think your quote is just a general allegory to Hebb’s rule and that it’s not meant to be taken as a literal system spec, but I could be wrong. I am a confused by the original description.
  - D𝜋 4 Jan 2022 16:09 UTC
    3 points
    Parent
    That actually brings us to the core of it.
    The way I phrased that was, deliberately, ambiguous.
    Since 1958, the question the field has been trying to answer is how to transfer the information we get when a sample is presented, to the weights, so next time it will perform better.
    BP computes the difference between what would be expected and what is measured, and the propagates it to all intermediary weights according to a set of mathematically derived rules (the generalised delta rule). A lot of work as gone into figuring out the best way to do that. This is what I called ‘how’ to learn.
    In this system, the method used is just the simplest possible, and the most intuitive, one: INC and DEC of the weights depending on wether it is or not the correct answer.
    The quantiliser, then, tell the system to only apply that simple rule under certain conditions (the Δ⊤and $Δ ⊥_{i}$ limits). That is ‘when’.
    You can use the delta rule instead of our basic update rule if you want (I tried). The result is not better and it is less stable, so you have to use small gradients. The problem, as I see it, is that the conditions under witch Jessica Taylor’s theorem is valid are not met any more and you have to ‘fix’ that. I did not investigate that extensively.