Why “set aside for now” the Japanese language tokens? I find those to be some of the most interesting, since my research indicates that all training datasets were supposedly pre-filtered (using Facebook opensource code) to have only english language in them. So how do 2 Japanese strings slip in?
Google Translate indicates that they are representative of the word “Thirty” and the phrase “Thirty-One Flavor”. Google Search brings up 1000s of pictures of young Japanese girls eating ice cream. My question is, how does one company’s marketing slogan make it into the top 100 glitchtokens when other far more market-savvy companies don’t?
So, imho, place “Baskin-Robbins + JP” alongside petertodd, SolidGoldMagikarp, and the rest of the GlitchPantheon.
Because it was 4:30 a.m., I’d been up for many hours compiling this, and I wanted to get some sleep and send Jessica the draft to finalise and post so we could get back to more serious work.
As it says:
″...set aside for now)”
Thanks for the new info. Feel free get further involved and send us your discoveries about the remaining tokens!
Why “set aside for now” the Japanese language tokens? I find those to be some of the most interesting, since my research indicates that all training datasets were supposedly pre-filtered (using Facebook opensource code) to have only english language in them. So how do 2 Japanese strings slip in?
Google Translate indicates that they are representative of the word “Thirty” and the phrase “Thirty-One Flavor”. Google Search brings up 1000s of pictures of young Japanese girls eating ice cream. My question is, how does one company’s marketing slogan make it into the top 100 glitchtokens when other far more market-savvy companies don’t?
So, imho, place “Baskin-Robbins + JP” alongside petertodd, SolidGoldMagikarp, and the rest of the GlitchPantheon.
Because it was 4:30 a.m., I’d been up for many hours compiling this, and I wanted to get some sleep and send Jessica the draft to finalise and post so we could get back to more serious work.
As it says:
″...set aside for now)”
Thanks for the new info. Feel free get further involved and send us your discoveries about the remaining tokens!