abramdemski comments on Dream, Truth, & Good

abramdemski 25 Feb 2025 15:28 UTC
LW: 4 AF: 3
0
AF
My idea is very similar to paragraph vectors: the vectors are trained to be useful labels for predicting the tokens.
To differentiate author-vectors from other types of metadata, the author vectors should be additionally trained to predict author labels, with a heavily-reinforced constraint that the author vectors are identical for documents which have the same author. There’s also the author-vector-to-text-author-attribution network, which should be pre-trained to have a good “prior” over author-names (so we’re not getting a bunch of nonsense strings out). During training, the text author-names are being estimated alongside the vectors (where author labels are not available), so that we can penalize different author-vectors which map to the same name. (Some careful thinking should be done about how to handle people with the actual same name; perhaps some system of longer author IDs?)
Other meta-data would be handled similarly.
- Davidmanheim 25 Feb 2025 21:54 UTC
  LW: 2 AF: 1
  0
  AF Parent
  This seems reasonable, though efficacy of the learning method seems unclear to me.
  But:
  with a heavily-reinforced constraint that the author vectors are identical for documents which have the same author
  This seems wrong. To pick on myself, my peer reviewed papers, my substack, my lesswrong posts, my 1990s blog posts, and my twitter feed are all substantively different in ways that I think the author vector should capture.
  - abramdemski 26 Feb 2025 14:20 UTC
    LW: 2 AF: 2
    0
    AF Parent
    My guess is that we want to capture those differences with the time&date meta-data instead (and to some extent, location and other metadata). That way, we can easily query what you-in-particular would say at other periods in your life (such as the future). However, I agree that this is at least not obvious.
    Maybe a better way to do it would be to explicitly take both approaches, so that there’s an abstract-you vector which then gets mapped into a particular-you author space via combination with your age (ie with date&time). This attempts to explicitly capture the way you change over time (we can watch your vector move through the particular-author space), while still allowing us to query what you would say at times where we don’t have evidence in the form of writing from you.
    Ideally, imagining the most sophisticated version of the setup, the model would be able to make date&time attributions very fine-grained, guessing when specific words were written & constructing a guessed history of revisions for a document. This complicates things yet further.