Hey, any chance you could do this replication eval for open-source models like Llama 2 and/or Falcon 180B? Probably they’ll have negligible performance but it would be interesting if they showed signs of life.
Yeah, I definitely could! It’s on my to-do list. I’ll let you know when I complete it.
Yay! Thanks in advance!
Hey, any chance you could do this replication eval for open-source models like Llama 2 and/or Falcon 180B? Probably they’ll have negligible performance but it would be interesting if they showed signs of life.
Yeah, I definitely could! It’s on my to-do list. I’ll let you know when I complete it.
Yay! Thanks in advance!