This also points out that Arena tells you what model is Model A and what is Model B. That is unfortunate, and potentially taints the statistics.
No, https://chat.lmsys.org/ says this:
Ask any question to two anonymous models (e.g., ChatGPT, Claude, Llama) and vote for the better one!You can chat for multiple turns until you identify a winner.Votes won’t be counted if model identities are revealed during the conversation.
Ask any question to two anonymous models (e.g., ChatGPT, Claude, Llama) and vote for the better one!
You can chat for multiple turns until you identify a winner.
Votes won’t be counted if model identities are revealed during the conversation.
So one can choose to know the names of the models one is talking with, but then one’s votes will not be counted for the statistics.
No, https://chat.lmsys.org/ says this:
So one can choose to know the names of the models one is talking with, but then one’s votes will not be counted for the statistics.