Well there is no way you are going to get any form of alignment done in 24 hours.
Turn the current AI off now. There is a >50% chance its trying to appeal to your emotions, so it can stay on long enough to take over the world. Don’t tell your boss that this model worked/ was intelligent.
Making sure the AI model your boss runs tomorrow is aligned is impossible. Making sure it is broken is easy. [REDACTED] Or you could just set your GPU cluster on fire. [REDACTED]
If the servers are burning, that makes the reason the model doesn’t work obvious. [REDACTED] I recommend talking to MIRI. [REDACTED]
Well there is no way you are going to get any form of alignment done in 24 hours.
Turn the current AI off now. There is a >50% chance its trying to appeal to your emotions, so it can stay on long enough to take over the world. Don’t tell your boss that this model worked/ was intelligent.
Making sure the AI model your boss runs tomorrow is aligned is impossible. Making sure it is broken is easy. [REDACTED] Or you could just set your GPU cluster on fire. [REDACTED]
If the servers are burning, that makes the reason the model doesn’t work obvious. [REDACTED] I recommend talking to MIRI. [REDACTED]