Signer comments on Deepmind’s Gopher—more powerful than GPT-3

Signer 8 Dec 2021 19:04 UTC
7 points

a study of ethical and social risks associated with large language models

And somehow nobody cares about potential ethical implications of simulating near-human quantities of neurons.
- Quintin Pope 8 Dec 2021 19:18 UTC
  6 points
  Parent
  280 billion parameters is still far less than the human brain. It’s closer to a rat’s brain. Maybe even smaller than that.
  - Signer 8 Dec 2021 19:46 UTC
    13 points
    Parent
    Sure, but people do worry about harming rats too much, and, more importantly, by the time we get to actual human level it may be already late. Like, there is no prepared procedure for stopping that whole process of scaling, no robust humanity-meters to know when you can safely proceed, and even no consensus on relevant abstract ethics.
  - Lost Futures 8 Dec 2021 19:46 UTC
    3 points
    Parent
    Deepmind’s recent research puts some holes in the already shaky analogy between synapses and parameters. RETRO achieved comparable performance to GPT-3 despite having 25x fewer parameters.
    - paulfchristiano 8 Dec 2021 22:59 UTC
      7 points
      Parent
      A human with google also gets way better performance than a human without google on “predict the next word of this website,” so I’m not sure this undermines the analogy.
    - Quintin Pope 8 Dec 2021 20:01 UTC
      2 points
      Parent
      We’ve know for a while that it’s possible to get good performance with far fewer parameters than BERT/GPT architectures use. E.g., Albert. The key point is that Gopher is much smaller and less capable than the human brain, even if we don’t know the appropriate metric by which we should compare such systems.
      - Lost Futures 8 Dec 2021 20:21 UTC
        2 points
        Parent
        Agreed, per Sam Altman’s statements, improving performance without scaling is also OpenAI’s plan for GPT-4. And Gopher is far less capable than a human brain. It’s just the “synapses as parameters” analogy that irks me. I see it everywhere but it isn’t reliable and (despite disclaimers that the analogy isn’t 1 to 1) leads people to even less reliable extrapolations. Hopefully, a better metric will be devised soon.