Sam Marks comments on What’s up with LLMs representing XORs of arbitrary features?

Sam Marks 8 Jan 2024 21:58 UTC
LW: 3 AF: 2
0
AF
Thanks, you’re totally right about the equal variance thing—I had stupidly thought that the projection of $U ([0, 1]^{2})$ onto y = x would be uniform on $[- \frac{1}{\sqrt{2}}, \frac{1}{\sqrt{2}}]$ (obviously false!).
The case of a fully discrete distribution (supported in this case on four points) seems like a very special case of a something more general, where a “more typical” special case would be something like:
- if a, b are both false, then sample from $N (0, Σ)$
- if a is true and b is false, then sample from $N (μ_{a}, Σ)$
- if a is false and b is true then sample from $N (μ_{b}, Σ)$
- if a and b are true, then sample from $N (μ_{a} + μ_{b}, Σ)$
for some $μ_{a}, μ_{b} \in R^{n}$ and covariance matrix $Σ$ . In general, I don’t really expect the class-conditional distributions to be Gaussian, nor for the class-conditional covariances to be independent of the class. But I do expect something broadly like this, where the distributions are concentrated around their class-conditional means with probability falling off as you move further from the class-conditional mean (hence unimodality), and that the class-conditional variances are not too big relative to the distance between the clusters.
Given that longer explanation, does the unimodality thing still seem directionally wrong?
- Vivek Hebbar 8 Jan 2024 23:19 UTC
  LW: 1 AF: 1
  0
  AF Parent
  Oops, I misunderstood what you meant by unimodality earlier. Your comment seems broadly correct now (except for the variance thing). I would still guess that unimodality isn’t precisely the right well-behavedness desideratum, but I retract the “directionally wrong”.