I wonder if there have been any experiments with feeding transformers just straight binary info. I’m guessing it hasn’t been done in this context due to potential context length limitations?
I wonder if there have been any experiments with feeding transformers just straight binary info. I’m guessing it hasn’t been done in this context due to potential context length limitations?