andymatuschak comments on Distillation of Neurotech and Alignment Workshop January 2023

andymatuschak 22 May 2023 14:30 UTC
9 points
4
I’m attending the Foresight workshop with Lisa and Sumner, and I wanted to share a point we just discussed: BCIs for value loading are interesting to consider from the perspective of scalable supervision.

Compared to RLAIF, a relatively coarse signal of disgust/fear from a human may be a more reliable or trustworthy response, particularly if sourced from multiple different humans. Simple EEG might be sufficient; for that matter, galvanic skin response from a ubiquitous device like an Apple Watch might be sufficient. Maybe we can be crowdsourcing value signals through noninvasive methods like these continuously.

The key question, I suppose, is whether these signals prove more valuable or trustworthy than something like RLAIF. But happily, that seems relatively straightforward to evaluate empirically with existing technology and evaluation methods.
- Alex K. Chen (parrot) 28 Jan 2024 23:21 UTC
  2 points
  0
  Parent
  https://stream.thesephist.com/updates/1711563348
  Neurable headphones could be one way of crowdsourcing value signals b/c they’re so wearable
  Hm there are other people like https://soulsyrup.github.io/ and @guillefix and Ogi
  tFUS is a way of accelerating internal alignment (look up PropheticAI). As are the Jhourney jhana people (though people like me have so much DMN noise that tFUS is needed first). Look up