Seth Herd comments on Why Don’t We Just… Shoggoth+Face+Paraphraser?

Seth Herd 20 Nov 2024 1:23 UTC
3 points
0
This might work. Let’s remember the financial incentives. Exposing a non-aligned CoT to all users is pretty likely to generate lots of articles about how your AI is super creepy, which will create a public perception that your AI in particular is not trustworthy relative to your competition.

I agree that it would be better to expose from an alignment perspective, I’m just noting the incentives on AI companies.
- Nathan Helm-Burger 20 Nov 2024 4:24 UTC
  5 points
  0
  Parent
  Hah, true. I wasn’t thinking about the commercial incentives! Yeah, there’s a lot of temptation to make a corpo-clean safety-washed fence-sitting sycophant. As much as Elon annoys me these days, I have to give the Grok team credit for avoiding the worst of the mealy-mouthed corporate trend.