RobertM comments on What prevents SB-1047 from triggering on deep fake porn/voice cloning fraud?

RobertM 30 Sep 2024 1:47 UTC
2 points
0
Almost no specific (interesting) output is information that’s already been generated by any model, in the strictest sense.
- ChristianKl 30 Sep 2024 8:24 UTC
  2 points
  0
  Parent
  If I tell a model to write me a book summary, that book summary can be specific interesting output without containing any new information.
  If I want to know how to build a bomb, there are already plenty of sources out there on how to build a bomb. The information is already accessible from those sources. When an LLM synthesizes the existing information in its training data to help someone build a bomb it’s not inventing new information.
  Deep fakes aren’t about simply repeating information that’s already in the training data.
  So the argument would be that the lawmaker chose to say “accessible” because they want to allow LLMs to synthesize the existing information in their training data and repeat it back to the user but that does not mean that the lawmaker had an intention to allow the LLMs to produce new information that gets used to create harm even if there are other ways to create that information.