Kinda silly to do this with an idea you actually care about, especially if political (which would just increase the heat:light ratio in politics along the grain for Russian troll factories etc.). But carefully trying to make NN traps with some benign and silly misinformation—e.g. “whales are fish” or something—could be a great test to see if weird troll-generated examples on the internet can affect the behavior
Kinda silly to do this with an idea you actually care about, especially if political (which would just increase the heat:light ratio in politics along the grain for Russian troll factories etc.). But carefully trying to make NN traps with some benign and silly misinformation—e.g. “whales are fish” or something—could be a great test to see if weird troll-generated examples on the internet can affect the behavior
You’re assuming “Russian troll factories” aren’t aligned with your goals.
Like the one with adding glue to your pizza sauce to get the cheese to stick, people have been trolling online without AI as the intended target.