ThomasJ answers Prize: Interesting Examples of Evaluations

ThomasJ 28 Nov 2020 22:36 UTC
10 points
Microsoft TrueSkill (Multiplayer ELO-like system, https://www.wikiwand.com/en/TrueSkill)
I originally read this EA as “Evolutionary Algorithms” rather than “Effective Altruism”, which made me think of this paper on degenerate solutions to evolutionary algorithms (https://arxiv.org/pdf/1803.03453v1.pdf). One amusing example is shown in a video at https://twitter.com/jeffclune/status/973605950266331138?s=20
- ThomasJ 28 Nov 2020 22:43 UTC
  8 points
  Parent
  Some additional ideas: There’s a large variety of “loss functions” that are used in machine learning to score the quality of solutions. There are a lot of these, but some of the most popular are below. A good overview is at https://medium.com/udacity-pytorch-challengers/a-brief-overview-of-loss-functions-in-pytorch-c0ddb78068f7
  * Mean Absolute Error (a.k.a. L1 loss)
  * Mean squared error
  * Negative log-likelihood
  * Hinge loss
  * KL divergence
  * BLEU loss for machine translation (https://www.wikiwand.com/en/BLEU)
  
  There’s also a large set of “goodness of fit” measures that evaluate the quality of a model, including simple things like r^2 but also more exotic tests to do things like compare distributions. Wikipedia again has a good overview (https://www.wikiwand.com/en/Goodness_of_fit)