MondSemmel comments on Habryka’s Shortform Feed

MondSemmel 23 Nov 2024 10:07 UTC
6 points
0
How would you avoid the data contamination issue where the AI system has been trained on the entire Internet and thus already knows about all of these vulnerabilities?
- Marcus Williams 23 Nov 2024 16:03 UTC
  3 points
  2
  Parent
  I suppose you could use models trained before vulnerabilities happen?
  - Archimedes 24 Nov 2024 21:06 UTC
    1 point
    0
    Parent
    Aren’t most of these famous vulnerabilities from before modern LLMs existed and thus part of their training data?
    - Marcus Williams 24 Nov 2024 21:24 UTC
      1 point
      0
      Parent
      Sure, but does a vulnerability need to be famous to be useful information? I imagine there are many vulnerabilities on a spectrum from minor to severe and from almost unknown to famous?