Darklight comments on Darklight’s Shortform

Darklight 20 Oct 2024 15:59 UTC
1 point
0
So, my main idea is that the principle of maximum entropy aka the principle of indifference suggests a prior of 1/n where n is the number of possibilities or classes. P x 2 − 1 leads to p = 0.5 for c = 0. What I want is for c = 0 to lead to p = 1/n rather than 0.5, so that it works in the multiclass cases where n is greater than 2.
- cubefox 20 Oct 2024 16:45 UTC
  3 points
  0
  Parent
  What’s the solution?
  - Darklight 20 Oct 2024 16:53 UTC
    3 points
    0
    Parent
    p = (n^c * (c + 1)) / (2^c * n)
    As far as I know, this is unpublished in the literature. It’s a pretty obscure use case, so that’s not surprising. I have doubts I’ll ever get around to publishing the paper I wanted to write that uses this in an activation function to replace softmax in neural nets, so it probably doesn’t matter much if I show it here.