there’s the Schmidhuber Scholarpedia articles in some cases, but aside from being outdated, it’s, well, Schmidhuber.
I hate Schmimdhuber with a passion because I can smell everything he touches on Wikipedia and they are always terrible.
Sometimes when I read pages about AI, I see things that almost certainly came from him, or one of his fans. I struggle to speak of exactly what Schmidhuber’s kind of writing gives, but perhaps this will suffice: “People never give the right credit to anything. Everything of importance is either published by my research group first but miscredited to someone later, or something like that. Deep Learning? It’s done not by Hinton, but Amari, but not Amari, but by Ivanenkho. The more obscure the originator, the better, because it reveals how bad people are at credit assignment—if they were better at it, the real originators would not have been so obscure.”
For example, LSTM is actually originated by Schmidhuber… and actually, it’s also credited to Schmidhuber (… or maybe Hochreiter?). But then GAN should be credited to Schmidhuber, and also Transformers. Currently he (or his fans) kept trying to put the phrase “internal spotlights of attention” into the Transformer page, and I kept removing it. He wanted the credit so much that he went for argument-by-punning, renaming “fast weight programmer” to “linear transformers”, and to quote out of context “internal spotlights of attention” just to fortify the argument with a pun! I can do puns too! Rosenblatt (1962) even wrote about “back-propagating errors” in an MLP with a hidden layer. So what?
I actually took Schmidhuber’s claim seriously and carefully rewrote of Ivanenkho’s Group method of data handling, giving all the mathematical details, so that one may evaluate it for itself instead of Schmidhuber’s claim. A few months later someone manually reverted everything I wrote! What does it read like according to a partisan of Ivanenkho?
The development of GMDH consists of a synthesis of ideas from different areas of science: the cybernetic concept of “black box” and the principle of successive genetic selection of pairwise features, Godel’s incompleteness theorems and the Gabor’s principle of “freedom of decisions choice”, and the Beer’s principle of external additions. GMDH is the original method for solving problems for structural-parametric identification of models for experimental data under uncertainty… Since 1989 the new algorithms (AC, OCC, PF) for non-parametric modeling of fuzzy objects and SLP for expert systems were developed and investigated. Present stage of GMDH development can be described as blossom out of deep learning neuronets and parallel inductive algorithms for multiprocessor computers.
Well excuse me, “Godel’s incompleteness theorems”? “the original method”? Also, I thought “fuzzy” has stopped being fashionable since 1980s. I actually once tried to learn fuzzy logic and gave up after not seeing what is the big deal. It is filled with such pompous and self-important terminology, as if the lack of substance must be made up by the heights of spiritual exhortation. Why say “combined” when they could say “consists of a synthesis of ideas from different areas of science”?
As a side note, such turgid prose, filled with long noun-phrases is pretty common among the Soviets. I once read that this kind of massive noun-phrase had a political purpose, but I don’t remember what it is.
Edit: I found it. It’s from Yurchak, Alexei. “Soviet hegemony of form: Everything was forever, until it was no more.” Comparative studies in society and history 45.3 (2003): 480-510.
The following examples are taken from a 1977 leading article, “The Ideological Conviction of the Soviet Person” (Ideinost’ sovetskogo cheloveka, Pravda, July 1, 1977). For considerations of space, I will limit this analysis to two generative principles of block-writing: the principle of complex modification and that of complex nominalization. The first sentence in the Pravda text reads: “The high level of social consciousness of the toilers of our country, their richest collective experience and political reason, manifest themselves with an exceptional completeness in the days of the all-people discussion of the draft of the Constitution of the USSR.” I have italicized phrases that are nouns with complex modifiers that function as “building blocks” of ideological discourse.
“the high level of consciousness of the toilers,” the double-modifier “high level” conveys not only the claim that the Soviet toilers’ consciousness exists (to be high it must exist), but also that it can be measured comparatively, by different “levels.” The latter claim masks the former one, thereby making it harder to question directly...
during the 1960s and 1970s… nominal structures increased and new long nominal chains were created. This increased the circularity of ideological discourse. In the excerpt from the same 1977 leading article; the italicized phrase (which in English translation is broken into two parts) is a block of multiple nominals: “The spiritual image of the fighter and creator of the citizen of the developed socialist society reveals itself to the world in all its greatness and beauty both in the chiseled lines of the outstanding document of the contemporary times, and in the living existence, in the everyday reality of the communist construction (I v chekannykh strokakh vy- daiushchegosia dokumenta sovremennosti, i v zhivoi deistvitel’nosti, v povsed-nevnykh budniakh kommunisticheskogo stroitel’stva raskryvaetsia pered mirom vo vsem velichii i krasote dukhovnyi obraz bortsa i sozidatelia, grazhdanina razvitogo sotsialisticheskogo obshchestva).”
nominals allow one to render ideological claims implicit, masking them behind other ideas, and therefore rendering them less subject to scrutiny or multiple interpretations. This nominal chain can be deconstructed into several corresponding verbal phrases, each containing one idea: “the citizen of the developed socialist society is a fighter and creator,” “the fighter and creator possesses a spiritual image,” “the spiritual image is great and beautiful,” etcetera. Converting these verbal phrases into one nominal phrase converts claims into presuppositions, presenting ideas as pre-established facts.
the 1970s discourse was special: its sentences contained particularly long nominal chains and only one verb, often simply a copula, with the sole purpose of turning these long chains of nominals into a sentence. This style created a notoriously “wooden” sound, giving ideological discourse its popular slang name, “oak language” (dubovyi iazyk).
I hate Schmimdhuber with a passion because I can smell everything he touches on Wikipedia and they are always terrible.
Sometimes when I read pages about AI, I see things that almost certainly came from him, or one of his fans. I struggle to speak of exactly what Schmidhuber’s kind of writing gives, but perhaps this will suffice: “People never give the right credit to anything. Everything of importance is either published by my research group first but miscredited to someone later, or something like that. Deep Learning? It’s done not by Hinton, but Amari, but not Amari, but by Ivanenkho. The more obscure the originator, the better, because it reveals how bad people are at credit assignment—if they were better at it, the real originators would not have been so obscure.”
For example, LSTM is actually originated by Schmidhuber… and actually, it’s also credited to Schmidhuber (… or maybe Hochreiter?). But then GAN should be credited to Schmidhuber, and also Transformers. Currently he (or his fans) kept trying to put the phrase “internal spotlights of attention” into the Transformer page, and I kept removing it. He wanted the credit so much that he went for argument-by-punning, renaming “fast weight programmer” to “linear transformers”, and to quote out of context “internal spotlights of attention” just to fortify the argument with a pun! I can do puns too! Rosenblatt (1962) even wrote about “back-propagating errors” in an MLP with a hidden layer. So what?
I actually took Schmidhuber’s claim seriously and carefully rewrote of Ivanenkho’s Group method of data handling, giving all the mathematical details, so that one may evaluate it for itself instead of Schmidhuber’s claim. A few months later someone manually reverted everything I wrote! What does it read like according to a partisan of Ivanenkho?
Well excuse me, “Godel’s incompleteness theorems”? “the original method”? Also, I thought “fuzzy” has stopped being fashionable since 1980s. I actually once tried to learn fuzzy logic and gave up after not seeing what is the big deal. It is filled with such pompous and self-important terminology, as if the lack of substance must be made up by the heights of spiritual exhortation. Why say “combined” when they could say “consists of a synthesis of ideas from different areas of science”?
As a side note, such turgid prose, filled with long noun-phrases is pretty common among the Soviets. I once read that this kind of massive noun-phrase had a political purpose, but I don’t remember what it is.
Edit: I found it. It’s from Yurchak, Alexei. “Soviet hegemony of form: Everything was forever, until it was no more.” Comparative studies in society and history 45.3 (2003): 480-510.