If you can successfully argue in court that it’s a copyright violation to use data without having acquired the copyright it would made it significantly harder.
Otherwise, European citizens who’s name is known by an AI system could make GDPR requests and ask what data is stored on them and then ask for that data to be deleted.
This is basically the discussion at https://www.lesswrong.com/posts/vsuMu98Rwde5krxSJ/should-we-push-for-requiring-ai-training-data-to-be-licensed
If you can successfully argue in court that it’s a copyright violation to use data without having acquired the copyright it would made it significantly harder.
Otherwise, European citizens who’s name is known by an AI system could make GDPR requests and ask what data is stored on them and then ask for that data to be deleted.