I wouldn’t want to work for Anthropic in any position where I thought I might someday be pressured to do capabilities work, or “alignment research” that had a significant chance of turning out to be capabilities work. If your impression is that there’s a good chance of that happening, or there’s some other legitimization type effect I’m not considering, then I’ll save myself the trouble of applying.
One piece of data: I haven’t been working at Anthropic for very long so far, but I have easily been able to avoid and haven’t personally felt pressured to do any capability-relevant stuff. In terms of other big labs, my guess is that would also be true at DeepMind, but would not be true at OpenAI.
One piece of data: I haven’t been working at Anthropic for very long so far, but I have easily been able to avoid and haven’t personally felt pressured to do any capability-relevant stuff. In terms of other big labs, my guess is that would also be true at DeepMind, but would not be true at OpenAI.