One possible failure mode I can imagine with this approach:
Suppose some important part of the conventional wisdom among you and/or your collaborators is wrong. It is likely that this flaw, if it exists, becomes more difficult for a researcher to discover if they have already heard a plausible-sounding explanation for the discrepancy. So by providing an “accelerated” path for new alignment researchers, you may reduce the likelihood that the error will be discovered.
This risk may be justifiable if enough smart researchers continue to work towards an understanding of alignment without taking this “accelerated track”. But my own observations in the context of high-prestige internships suggest that programs like this will be seen as a “fast track” to success in the field, and the most talented students will compete for entry.
It sounds like you are picturing an implementation which is not actually what I’d recommend. I believe this comment already responds to basically the same concern, but let me know if you’re saying something not covered by that.
One possible failure mode I can imagine with this approach:
Suppose some important part of the conventional wisdom among you and/or your collaborators is wrong. It is likely that this flaw, if it exists, becomes more difficult for a researcher to discover if they have already heard a plausible-sounding explanation for the discrepancy. So by providing an “accelerated” path for new alignment researchers, you may reduce the likelihood that the error will be discovered.
This risk may be justifiable if enough smart researchers continue to work towards an understanding of alignment without taking this “accelerated track”. But my own observations in the context of high-prestige internships suggest that programs like this will be seen as a “fast track” to success in the field, and the most talented students will compete for entry.
It sounds like you are picturing an implementation which is not actually what I’d recommend. I believe this comment already responds to basically the same concern, but let me know if you’re saying something not covered by that.
Nope, that pretty much covers it