ryan_greenblatt comments on LLMs for Alignment Research: a safety priority?

ryan_greenblatt 5 Apr 2024 16:47 UTC
LW: 5 AF: 3
2
AF

LLMs aren’t that useful for alignment experts because it’s a highly specialized field and there isn’t much relevant training data.

Seems plausibly true for the alignment specific philosophy/conceptual work, but many people attempting to improve safety also end up doing large amounts of relatively normal work in other domains (ML, math, etc.)

The post is more centrally talking about the very alignment specific use cases of course.