AI assistants are weird. Here’s a Perplexity Pro search I did for an EY tweet about finding the sweet spot between utilitarianism & deontology. Perplexity Pro immediately found the correct tweet:
Eliezer Yudkowsky, a prominent figure in the rationalist community, has indeed expressed a view that suggests finding a balance between utilitarianism and deontology. In a tweet, he stated: “Go three-quarters of the way from deontology to utilitarianism and then stop. You are now in the right place. Stay there at least until you...”
But I wondered why it didn’t provide the full quote (which is just a few more words, namely “Stay there at least until you have become a god.”), and I just couldn’t get it to do so, even with requests like “Just quote the full tweet from here: <URL>”. Instead, it invented alternative versions like this:
Go three-quarters of the way from deontology to utilitarianism and then stop. You are now in the right place. Stay there at least until you understand why.
or this:
Go three-quarters of the way from deontology to utilitarianism and then stop. You are now in the correct place. Stay there at least until you understand why you shouldn’t go any further.
I finally provided the full quote and asked it directly:
Does the following quote represent Yudkowsky’s tweet with 100% accuracy?
“Go three-quarters of the way from deontology to utilitarianism and then stop. You are now in the right place. Stay there at least until you have become a god.”
AI assistants are weird. Here’s a Perplexity Pro search I did for an EY tweet about finding the sweet spot between utilitarianism & deontology. Perplexity Pro immediately found the correct tweet:
But I wondered why it didn’t provide the full quote (which is just a few more words, namely “Stay there at least until you have become a god.”), and I just couldn’t get it to do so, even with requests like “Just quote the full tweet from here: <URL>”. Instead, it invented alternative versions like this:
or this:
I finally provided the full quote and asked it directly:
And it still doubled down on the wrong version.