There are extensions like adnauseum which try to poison your data trace. Though it’s dubious whether they help much. You could have some kind of crawler thingy which would pretend to be like 100 normal users so you get lost in the noise. But even that could probably be filtered out if someone really wanted to—it would be hard to accurately simulate a human (I also dimly recall reading an article about it?). Maybe something that records other peoples sessions and plays them back? Or a LLM doing it (hehe)? But even that wouldn’t help in the case of being logged in to various services, and I’m guessing that most people don’t automatically log out of gmail whenever they change tabs?
One source of hope is that data gets stale quickly. People can change their minds (even if they don’t), so just because you know what I thought a year ago doesn’t mean that you know what I think now. Then again, most people don’t care either way and it would be pretty simple to remove the small number of outliers who suddenly go dark. One possible way of cleaning up would be to spend a couple of months posting more and more radically strange posts (e.g. going all in on flat earthism) before going private in order to mislead any analysis. This is hard and requires passing an ITT.
Tor + cleaning cookies + logging out of everything after using them + separate user profiles goes a long way. But it’s very inconvenient.
TOR is way too slow and google hates serving content to TOR users. I2P might be faster than TOR but the current adoption is way too low. Additionally, it doesn’t help that identity persistence is a regulatory requirement in most jurisdictions because it helps traceability against identity theft, financial theft, fraud, etc… Cookie cleaning means they have to log in every time which for most people is too annoying.
I acknowledge that there are ways to technically poison existing data. The core problem though is finding things that both normal people and also technically adept (alignment researchers/engineers/...) would actually be willing to do.
The general vibe I see right now is - * shrug shoulders * they already know so I might as well just make my life convenient and continue giving them everything...
Honestly, I don’t really even think it should be the responsibility of the average consumer to have to think about this at all. Should it be your responsibility to check every part of the engine in your car when you want to drive to make sure it is not going to blow up and kill you? Of course not, that responsibility should be on the manufacturer. Similarly, the responsibility for mitigating the adverse effects of data gathering should be on the developing companies not the consumers.
Tor + cleaning cookies + logging out of everything after using them + separate user profiles goes a long way. But it’s very inconvenient.
Uh, I’ve done this since forever and it doesn’t feel so inconvenient to me. I generally use Firefox in private browsing, configured to always throw away all cookies at the end of every session. Ten years ago it wasn’t even a privacy concern, I simply hate to exit from a webpage without proper logout, it feels like not closing the door when leaving your house...
It requires you to actively manage long lived sessions which would otherwise be handled by the site you’re using. You can often get back to where you were by just logging in again, but there are many places (especially for travel or official places) where that pretty much resets the whole flow.
There are also a lot more popups, captchas and other hoops to jump through when you don’t have a cookies trail.
The average user is lazy and doesn’t think about these things, so the web as a whole is moving in the direction of making things easier (but not simpler). This is usually viewed as a good thing by those who then only need to click a single button. Though it’s at the cost of those who want to have more control.
It might not be inconvenient to you, especially as it’s your basic flow. It’s inconvenient for me, but worth the cost, but basically unusable for most of the people I know (compared to the default flow).
There are extensions like adnauseum which try to poison your data trace. Though it’s dubious whether they help much. You could have some kind of crawler thingy which would pretend to be like 100 normal users so you get lost in the noise. But even that could probably be filtered out if someone really wanted to—it would be hard to accurately simulate a human (I also dimly recall reading an article about it?). Maybe something that records other peoples sessions and plays them back? Or a LLM doing it (hehe)? But even that wouldn’t help in the case of being logged in to various services, and I’m guessing that most people don’t automatically log out of gmail whenever they change tabs?
One source of hope is that data gets stale quickly. People can change their minds (even if they don’t), so just because you know what I thought a year ago doesn’t mean that you know what I think now. Then again, most people don’t care either way and it would be pretty simple to remove the small number of outliers who suddenly go dark. One possible way of cleaning up would be to spend a couple of months posting more and more radically strange posts (e.g. going all in on flat earthism) before going private in order to mislead any analysis. This is hard and requires passing an ITT.
Tor + cleaning cookies + logging out of everything after using them + separate user profiles goes a long way. But it’s very inconvenient.
TOR is way too slow and google hates serving content to TOR users. I2P might be faster than TOR but the current adoption is way too low. Additionally, it doesn’t help that identity persistence is a regulatory requirement in most jurisdictions because it helps traceability against identity theft, financial theft, fraud, etc… Cookie cleaning means they have to log in every time which for most people is too annoying.
I acknowledge that there are ways to technically poison existing data. The core problem though is finding things that both normal people and also technically adept (alignment researchers/engineers/...) would actually be willing to do.
The general vibe I see right now is - * shrug shoulders * they already know so I might as well just make my life convenient and continue giving them everything...
Honestly, I don’t really even think it should be the responsibility of the average consumer to have to think about this at all. Should it be your responsibility to check every part of the engine in your car when you want to drive to make sure it is not going to blow up and kill you? Of course not, that responsibility should be on the manufacturer. Similarly, the responsibility for mitigating the adverse effects of data gathering should be on the developing companies not the consumers.
Uh, I’ve done this since forever and it doesn’t feel so inconvenient to me. I generally use Firefox in private browsing, configured to always throw away all cookies at the end of every session. Ten years ago it wasn’t even a privacy concern, I simply hate to exit from a webpage without proper logout, it feels like not closing the door when leaving your house...
It requires you to actively manage long lived sessions which would otherwise be handled by the site you’re using. You can often get back to where you were by just logging in again, but there are many places (especially for travel or official places) where that pretty much resets the whole flow.
There are also a lot more popups, captchas and other hoops to jump through when you don’t have a cookies trail.
The average user is lazy and doesn’t think about these things, so the web as a whole is moving in the direction of making things easier (but not simpler). This is usually viewed as a good thing by those who then only need to click a single button. Though it’s at the cost of those who want to have more control.
It might not be inconvenient to you, especially as it’s your basic flow. It’s inconvenient for me, but worth the cost, but basically unusable for most of the people I know (compared to the default flow).