What do you mean by Scaling Hypothesis? Do you believe extremely large transformer models trained based on autoregressive loss will have superhuman capabilities?
Can’t answer the second question, but see https://www.gwern.net/Scaling-hypothesis for the first.
What do you mean by Scaling Hypothesis? Do you believe extremely large transformer models trained based on autoregressive loss will have superhuman capabilities?
Can’t answer the second question, but see https://www.gwern.net/Scaling-hypothesis for the first.