IMO, the outer alignment problem is still the biggest problem in (technical) AI Alignment. We don’t know how to write down—or learn—good specifications, and people making strong AIs that optimize for proxies is still what’s most likely to get us all killed.
IMO, the outer alignment problem is still the biggest problem in (technical) AI Alignment. We don’t know how to write down—or learn—good specifications, and people making strong AIs that optimize for proxies is still what’s most likely to get us all killed.