Thane Ruthenis comments on instruction tuning and autoregressive distribution shift