Joel Burget comments on Sparse trinary weighted RNNs as a path to better language model interpretability

Joel Burget 23 Sep 2022 23:34 UTC
1 point
0
Do you happen to know how this compares with https://github.com/BlinkDL/RWKV-LM which is described as an RNN with performance comparable to a transformer / linear attention?
- Nathan Helm-Burger 24 Sep 2022 23:47 UTC
  1 point
  0
  Parent
  I don’t know, but I’d love to know! If you find out, please tell me!