RSS

7vik

Karma: 115

I research intelligence and it’s emergence and expression in neural networks to ensure advanced AI is safe and beneficial.

Current interests: neural network interpretability, alignment/​safety, unsupervised learning, and deep learning theory.

For more, check out my scholar profile and personal website.

In­tri­ca­cies of Fea­ture Geom­e­try in Large Lan­guage Models

7 Dec 2024 18:10 UTC
59 points
0 comments12 min readLW link

The Geom­e­try of Feel­ings and Non­sense in Large Lan­guage Models

27 Sep 2024 17:49 UTC
58 points
10 comments4 min readLW link