The causal incentives working group should get mentioned, it’s directly on AI safety: though it’s a bit older I gained a lot of clarity about AI safety concepts via “Modeling AGI Safety Frameworks with Causal Influence Diagrams”, which is quite accessible even if you don’t have a ton of training in causality.
The causal incentives working group should get mentioned, it’s directly on AI safety: though it’s a bit older I gained a lot of clarity about AI safety concepts via “Modeling AGI Safety Frameworks with Causal Influence Diagrams”, which is quite accessible even if you don’t have a ton of training in causality.