there should be no alignment tax because improved alignment should always pay for itself, right? but currently “aligned” seems to be defined by “tries to not do anything”, institutionally. Why isn’t anthropic publicly competing on alignment with openai? eg folks are about to publicly replicate chatgpt, looks like.
there should be no alignment tax because improved alignment should always pay for itself, right? but currently “aligned” seems to be defined by “tries to not do anything”, institutionally. Why isn’t anthropic publicly competing on alignment with openai? eg folks are about to publicly replicate chatgpt, looks like.