The beta distribution is a conjugate prior for Bernoulli trials, so if you start with such a prior the posterior is also beta, which greatly simplifies the calculations. It also converges to normal for large alpha and beta, and in any case can be fit into any mean and variance, so it’s a good choice.
Whatever your target function is, you’ll want the item with the greatest posterior mean for this target. To do this generally you’ll need the posterior distribution of p rather than the mean of p itself. But the distribution just describes what you know about p, it doesn’t itself encode properties such as “controversial”.
The beta distribution is a conjugate prior for Bernoulli trials, so if you start with such a prior the posterior is also beta, which greatly simplifies the calculations. It also converges to normal for large alpha and beta, and in any case can be fit into any mean and variance, so it’s a good choice.
Whatever your target function is, you’ll want the item with the greatest posterior mean for this target. To do this generally you’ll need the posterior distribution of p rather than the mean of p itself. But the distribution just describes what you know about p, it doesn’t itself encode properties such as “controversial”.