JoshuaZ comments on [Funny] Even Clippy can be blamed on the use of non-Bayesian methods

JoshuaZ 3 Oct 2011 2:31 UTC
6 points
I withdrew my vote after jsalvatier made his comment. It made me think more about it, and the fact that I have a general problem with voting too often for things that are funny instead of things that genuinely help the signal to noise ratio. I also saw the extremely high total vote as worrisome. If the vote had at the time been +10 or +15 or so I might not have felt as much of a need to withdraw my vote.

I suspect that similar thought processes occurred with other people.
- pedanterrific 3 Oct 2011 2:51 UTC
  7 points
  Parent
  Yes, this is what I assumed had happened and was commenting on. Maybe I just pay too much attention to karma because I’m green as grass, but I don’t think I’ve ever cast a vote without thinking through what I find valuable about the post and how that compares to its current total. The fact that apparently a lot of people cast what I would call impulse votes is making me reevaluate exactly what it is that ‘karma’ is measuring.
  
  Edit: Oh, I just realized—the anti-kibitzer hides karma scores as well as usernames. Probably there’s a large subset of voters who don’t and can’t take relative totals into account until someone comments on it.
  
  Of course, if someone considers that a good reason to reverse their vote I don’t know why they would be using the anti-kibitzer in the first place.
  - SarahSrinivasan 3 Oct 2011 7:07 UTC
    6 points
    Parent
    I still don’t think anyone here should feel good about paying attention to current total while deciding whether to upvote or downvote. Share evidence, not conclusions. The net karma a comment ends up at should be the result of aggregating our valuations, not a result of, say, whether those who thought it should be at +100 voted before or after those who thought it should be at +2.
    
    Edit: it’s clear to me now that I don’t have a good solution to my perceived problem.
    - AdeleneDawner 3 Oct 2011 19:14 UTC
      1 point
      Parent
      It seems to me that your suggested policy would result in comment-placement effects being even stronger than they are now. What score should a comment end up with if 50 people consider voting on it and they all think it should have a score of +2?
      - SarahSrinivasan 3 Oct 2011 21:51 UTC
        1 point
        Parent
        I communicated poorly. I don’t think “should have a score of +2” should enter into the decision to upvote, downvote, or not vote. Instead, I’d rather voting algorithms which, when implemented individually, have results which can be meaningfully summed. For example, suppose everyone upvotes exactly when they think a comment is in the top 5% of comments in “everyone should read this” ordering and downvotes for the bottom 5%. Then the sum reflects the number of people who read the comment x (the average percentage of people who thought it was in the top 5% - bottom 5%). That’s something I can understand.
        
        If I think a comment should end up with a score of +2, too bad, I have no direct way of controlling that. The resulting score is a reflection of the community’s votes, not something I try to game by altering my voting decision based on whether the score gets closer to +2.
        
        I mean, do people downvote comments that they would have otherwise not voted on if they think the comment has too many upvotes? If not, why do they decline to upvote when they otherwise would have upvoted? The two look the same from everyone else’s perspective, right?
        AdeleneDawner 3 Oct 2011 23:04 UTC
        6 points
        Parent
        I’m not saying that your proposed algorithm is wrong—not exactly, anyway. I am pointing out something that I think is a flaw.
        
        Putting the same point a different way:
        
        Consider two comments. One is posted early, and is seen by 50 people. It’s slightly good—good enough that each of those people would, by your algorithm, upvote it, but no better than that. The other is posted late, and is only seen by 10 people, but it’s very, very good. According to your algorithm, the first one would get a score of +50 and the second one would get a score of +10. By the methods currently in use, the first one will get a low score—probably +1 or +2 - and the second one will still get +10.
        
        The first comment got many more points than the second, by your algorithm, because its author was able to quickly put together something good enough to be upvoteable, and because they were at the right place at the right time to post it early in the conversation, which implies either luck or lots of time spent lurking on LW. I don’t think these are things we want to incentivise—at least not more than we want to incentivise putting time into crafting well-thought-out comments.
        
        Also:
        
        … do people downvote comments that they would have otherwise not voted on if they think the comment has too many upvotes?
        
        I do this. Not very often, but it happens.
        SarahSrinivasan 4 Oct 2011 0:18 UTC
        4 points
        Parent
        You’re right. Reviewing my feelings on this I discovered that my main “ugh, that’s terrible” feeling comes from the observation that a correlated set of people form a control system that wipes out the contributions of others not in a similar or larger implicit alliance. That doesn’t imply the solution is to vote independently of the total, though, as there are negative side effects like the one you describe.
        JoshuaZ 3 Oct 2011 23:13 UTC
        3 points
        Parent
        
        I mean, do people downvote comments that they would have otherwise not voted on if they think the comment has too many upvotes? If not, why do they decline to upvote when they otherwise would have upvoted?
        
        I often (although) not always will upvote a comment simply if it deserves it. I only very rarely downvote or don’t vote a comment if I think it is too high but should be positive. Declining to upvote a too high comment is something I do much more frequently than downvoting a too high comment. This is a passive rather than active decision. In general declining to upvote creates less negative emotional feelings in me than actively downvoting something which is too high.
        
        I do sometimes upvote comments that have been downvoted if I think they’ve simply been downvoted way too much. That seems for me at least to be the most common form of corrective voting.
        
        I have no idea how representative my behavior is of the general LWian.
        wedrifid 3 Oct 2011 22:48 UTC
        2 points
        Parent
        
        If I think a comment should end up with a score of +2, too bad, I have no direct way of controlling that. The resulting score is a reflection of the community’s votes, not something I try to game by altering my voting decision based on whether the score gets closer to +2.
        
        Ok, but that’s your self handicapping and I want no part of it myself.
        
        My decision to vote shall be determined by whatever vote I predict has the best consequences.
        SarahSrinivasan 4 Oct 2011 0:19 UTC
        1 point
        Parent
        Surely by whatever vote is recommended by the decision procedure you predict has the best consequences. ;)
        wedrifid 4 Oct 2011 0:59 UTC
        1 point
        Parent
        
        Surely by whatever vote is recommended by the decision procedure you predict has the best consequences. ;)
        
        No, I meant what I said.
        pedanterrific 3 Oct 2011 22:12 UTC
        2 points
        Parent
        
        I don’t think “should have a score of +2” should enter into the decision to upvote, downvote, or not vote.
        
        Why not? No, really: what’s wrong with that?
        
        Instead, I’d rather voting algorithms which, when implemented individually, have results which can be meaningfully summed.
        
        The current voting algorithms can be meaningfully summed, they’re just complicated, opaque and nonstandardized. I don’t understand why you think “everyone should use my voting algorithm” is a useful thing to say.
        
        If I think a comment should end up with a score of +2, too bad, I have no direct way of controlling that.
        
        In what situation would you not, given that it is possible to alter your voting decision based on whether the score gets closer to +2? Do you intend to prevent that somehow?
        
        do people downvote comments that they would have otherwise not voted on if they think the comment has too many upvotes?
        
        At least two people do. Why do you ask? (Seriously, I can’t figure out why this is phrased as a rhetorical question.)
        
        Edit: Okay, here’s the thing: I think it would be more useful if karma was the average of our valuations; i.e. if you could, say, input ‘+10’ or ‘-3’ as shorthand for ‘upvote if below this number, downvote if above’ rather than simply ‘upvote’ and ‘downvote’. What do you imagine the problem with this system would be?
        wedrifid 3 Oct 2011 22:46 UTC
        1 point
        Parent
        
        Edit: Okay, here’s the thing: I think it would be more useful if karma was the average of our valuations; i.e. if you could, say, input ‘+10’ or ‘-3’ as shorthand for ‘upvote if below this number, downvote if above’ rather than simply ‘upvote’ and ‘downvote’. What do you imagine the problem with this system would be?
        
        Not exactly a problem but a lotof my votes would either be +1000 or −1000.
  - JoshuaZ 3 Oct 2011 3:01 UTC
    2 points
    Parent
    I think that karma is a useful feedback but only at a very approximate level. If a post is heavily upvoted or heavily downvoted it is likely to be higher quality. But this is extremely approximate. The posts I’ve had most upvoted are rarely what I would consider my highest quality remarks. For example, this comment was relevant but I don’t see any reason why it is at +24 other than some sort of bandwagon effect.
    - pedanterrific 3 Oct 2011 4:11 UTC
      0 points
      Parent
      Pff, that’s nothing. Two of my highest-karma comments (try not to laugh at the totals; I’m green as grass, remember) are utterly derivative, by virtue of being simple restatements of another person’s point in a slightly funnier way. Namely this and this.
      
      It’s embarrassing, frankly.
      - JoshuaZ 3 Oct 2011 13:38 UTC
        0 points
        Parent
        Ok. But the real thing is the discrepancy between them. While that comment I made is at +24, this comment is at +2 where it uses a nearly identical level of sources and analysis about a somewhat similar set of demographic issues.
        
        It isn’t just that some funny comments get voted up a lot. It is that there’s very little general pattern to how far one comment gets up compared to another even when they are very similar comments.
        [deleted] 3 Oct 2011 14:25 UTC
        15 points
        Parent
        Comments get more upvotes, independent of quality, if they:
        
        Are in a high-traffic thread
        Are made while the thread is still new
        Get an early complimentary reply
        Make a point many people agree with and care about (especially if the first to make that point)
        Become the highest-karma comment early on (bandwagon + people may only read/vote on the first few comments, so being the top comment is valuable)
        Are closer to top-level (people don’t read deep into threads unless particularly interested)
        
        I think these effects, in aggregate, are probably much stronger determinants of comment karma than actual quality. Top-level posts, to main or discussion, suffer from fewer of these effects, so their karma is a little more reliable. But I hope no one is taking their comment karma too much to heart.
        pedanterrific 3 Oct 2011 14:29 UTC
        3 points
        Parent
        
        I think that karma is a useful feedback but only at a very approximate level. …
        
        … there’s very little general pattern to how far one comment gets voted up compared to another even when they are very similar comments.
        
        If that’s true, then… what’s the point of karma scores?
        
        How about this: keep track of total votes behind the scenes, but only report whether the karma is [- -] for k<-5, [-] for −4+10.