Thanks for sharing, I was planning on reading this paper too. My guess coming in was that the results would not hold up with scale, and for many of the reasons you mentioned. Kind of disappointed they didn’t mention in the abstract that they used OPT-125m.
Thanks for sharing, I was planning on reading this paper too. My guess coming in was that the results would not hold up with scale, and for many of the reasons you mentioned. Kind of disappointed they didn’t mention in the abstract that they used OPT-125m.