I admit I was focusing on a fully automated synthetic dataset and fully automated curation, with virtually 0 use of internet data, such that you can entirely make your own private datasets without having to interact with the internet at all, so you could entirely avoid the steganography and Sydney data problems at all.
I admit I was focusing on a fully automated synthetic dataset and fully automated curation, with virtually 0 use of internet data, such that you can entirely make your own private datasets without having to interact with the internet at all, so you could entirely avoid the steganography and Sydney data problems at all.