I have an intuition that while impact measures as a way of avoiding negative side effects might work well in toy models, it will be hard or impossible to get them to work in the real world, because what counts as a negative side effect in the real world seems too complex to easily capture.
Although a far cry from “[avoiding side effects] in the real world”, see Avoiding Side Effects in Complex Environments as another piece of evidence to update on.