Bogdan Ionut Cirstea comments on Why Don’t We Just… Shoggoth+Face+Paraphraser?

Bogdan Ionut Cirstea 20 Nov 2024 22:08 UTC
4 points
−2
Here’s a somewhat wild idea to have a ‘canary in a coalmine’ when it comes to steganography and non-human (linguistic) representations: monitor for very sharp drops in BrainScores (linear correlations between LM activations and brain measurements, on the same inputs) - e.g. like those calculated in Scaling laws for language encoding models in fMRI. (Ideally using larger, more diverse, higher-resolution brain data.)