The connection to features is that if the answer is no, there is no possible way the network could have arbitrary X-or combos of features that are linearly represented. It must be only representing some small subset of them. (probably the xor’s of 2 or 3 features, but not 100 features.)
Also, your maths description of the question matches what I was trying to express.
The connection to features is that if the answer is no, there is no possible way the network could have arbitrary X-or combos of features that are linearly represented. It must be only representing some small subset of them. (probably the xor’s of 2 or 3 features, but not 100 features.)
Also, your maths description of the question matches what I was trying to express.