Multimodal language models. We can already study narrow RL agents, but the intersection with alignment is not a hot area.
Multimodal language models. We can already study narrow RL agents, but the intersection with alignment is not a hot area.