RomanS comments on Twitter Twitches

RomanS 6 Jul 2023 7:03 UTC
1 point
0
And some of these bots have been through many iterations of detection and counter-detection, and are routing their requests through residential-IP botnets, with fake user-agent strings trying to approximate real web browsers.
As someone who has done scraping a few times, I can confirm that it’s trivial to circumvent protections against it, even for a novice programmer. In most cases, it’s literally less than 10 minutes of googling and trial & error.
And for a major AI / web-search company, it could be a routine task, with teams of dedicated professionals working on it.