Ramana Kumar comments on Are limited-horizon agents a good heuristic for the off-switch problem?

Ramana Kumar 21 Dec 2021 9:33 UTC
LW: 4 AF: 3
AF
Just a few links to complement Abram’s answer:

On how seemingly myopic training schemes can nonetheless produce non-myopic behaviour:
- Auto-induced distributional shift
- Predict-o-matic
On approval-directed agents:
- Tampering incentives of decoupled approval
- Act-based agents