Team shard is now accepting applications for summer MATS. SERI MATS is now accepting applications for their 4.0 program this summer. In particular, consider applying to the shard theory stream, especially if you have the following interests:
Steering language models via editing of forward passes,
Feel free to apply if you’re interested in shard theory more generally, although I expect to mostly supervise empirical work. Feel free to message me if you have questions!
Team shard is now accepting applications for summer MATS. SERI MATS is now accepting applications for their 4.0 program this summer. In particular, consider applying to the shard theory stream, especially if you have the following interests:
Mechanistic interpretability on RL agents,
In particular, how and why algebraic value editing works
Steering language models via editing of forward passes,
Feel free to apply if you’re interested in shard theory more generally, although I expect to mostly supervise empirical work. Feel free to message me if you have questions!