I was imagining humans deliberately designing and testing a training process that did this automatically. I haven’t thought about how to define, much less automatically detect, unhelpful abstractions or heuristics though.
I was imagining humans deliberately designing and testing a training process that did this automatically. I haven’t thought about how to define, much less automatically detect, unhelpful abstractions or heuristics though.