Could you maybe add a paragraph (or comment) how exactly you define “conceptual” alignment research? What would be an example of alignment research that is not conceptual?
Basically the distinction is relevant because there are definitely more and more people working on alignment, but the vast majority of the increase actually doesn’t focus on formulating solution or deconfusing the main notions; instead they mostly work on (often relevant) experiments and empirical questions related to alignment.
I see many incorrect assumptions about what it takes to be a good conceptual researcher floating around [...] you can pick up the relevant part [of ML] and just work on approaches different to pure prosaic alignment
This seemed to imply that you might be a conceptual alignment researcher, but also work on pure prosaic alignment, which was the point were I thought: Ok, maybe I don’t know what “conceptual alignment research” means. But the link definitely clears it up, thank you!
Yeah, I see how it can be confusing. To give an example, Paul Christiano focuses on prosaic alignment (he even coined the term) yet his work is mostly on the conceptual side. So I don’t see the two as in conflict.
Could you maybe add a paragraph (or comment) how exactly you define “conceptual” alignment research? What would be an example of alignment research that is not conceptual?
Maybe I should have added this link. ;)
Basically the distinction is relevant because there are definitely more and more people working on alignment, but the vast majority of the increase actually doesn’t focus on formulating solution or deconfusing the main notions; instead they mostly work on (often relevant) experiments and empirical questions related to alignment.
This seemed to imply that you might be a conceptual alignment researcher, but also work on pure prosaic alignment, which was the point were I thought: Ok, maybe I don’t know what “conceptual alignment research” means. But the link definitely clears it up, thank you!
Yeah, I see how it can be confusing. To give an example, Paul Christiano focuses on prosaic alignment (he even coined the term) yet his work is mostly on the conceptual side. So I don’t see the two as in conflict.