I was probably going to make it a top level post, but it seems like this post covers the main points well, so I’ll just link my own CPP post here (Julian let me know if you mind, and I’ll move it):
It’s specifically about “the blackout strategy” that MrCheeze mentions below, in a greater degree of detail. Basically, I argue that:
You’re gonna get some type of degenerate equilibrium ~always, and more scaffolding will just as likely hurt as help (outside of obviously cheating sorts of scaffold)
The blackout strategy isn’t misalignment, just a classic local hill-climbing getting stuck situation
I also describe how the blackout strategy came to be in a little bit of detail. Probably not worth reading for anyone who only wanted a primer and by reading this post has gotten one, but if you can’t get enough Claudetent or are curious about the blackout strategy, please enjoy.
This is useful for me; I am not quite sure where to draw the line with crossposts, as I blog every week and don’t want to flood LW, but do want to crosspost where it’d definitely be relevant/useful!
Good good strategy might be to cross post post and see what reception they get on Less wrong as far as up votes go. If a post would stay in the single digits, don’t cross post other posts like that. If it gets 50+ karma, people on Less wrong wants to see more like it.
I was probably going to make it a top level post, but it seems like this post covers the main points well, so I’ll just link my own CPP post here (Julian let me know if you mind, and I’ll move it):
https://justismills.substack.com/p/the-blackout-strategy
It’s specifically about “the blackout strategy” that MrCheeze mentions below, in a greater degree of detail. Basically, I argue that:
You’re gonna get some type of degenerate equilibrium ~always, and more scaffolding will just as likely hurt as help (outside of obviously cheating sorts of scaffold)
The blackout strategy isn’t misalignment, just a classic local hill-climbing getting stuck situation
I also describe how the blackout strategy came to be in a little bit of detail. Probably not worth reading for anyone who only wanted a primer and by reading this post has gotten one, but if you can’t get enough Claudetent or are curious about the blackout strategy, please enjoy.
Amazingly, Claude managed to escape the blackout strategy somehow. Exited Mt. Moon at ~68 hours.
IMO this would be a great top-level post (as would many other of the posts on your Substack I just discovered!)
This is useful for me; I am not quite sure where to draw the line with crossposts, as I blog every week and don’t want to flood LW, but do want to crosspost where it’d definitely be relevant/useful!
Good good strategy might be to cross post post and see what reception they get on Less wrong as far as up votes go. If a post would stay in the single digits, don’t cross post other posts like that. If it gets 50+ karma, people on Less wrong wants to see more like it.