RSS

Jaehyuk Lim

Karma: 37

Inner-aligning AI and trying to ask better questions.

Jailbreak­ing ChatGPT and Claude us­ing Web API Con­text Injection

Jaehyuk LimOct 21, 2024, 9:34 PM
4 points
0 comments3 min readLW link

HDBSCAN is Sur­pris­ingly Effec­tive at Find­ing In­ter­pretable Clusters of the SAE De­coder Matrix

Oct 11, 2024, 11:06 PM
8 points
2 comments10 min readLW link

Bi­as­ing VLM Re­sponse with Vi­sual Stimuli

Jaehyuk LimOct 3, 2024, 6:04 PM
5 points
0 comments8 min readLW link

[Question] SAE sparse fea­ture graph us­ing only resi­d­ual layers

Jaehyuk LimMay 23, 2024, 1:32 PM
0 points
3 comments1 min readLW link

Iden­ti­fy­ing Micro-fric­tion in the Con­text of the An­te­rior Mid-Cin­gu­late Cor­tex (aMCC)

Jaehyuk LimMar 29, 2024, 10:11 PM
3 points
0 comments3 min readLW link

Lan­guage Models Don’t Learn the Phys­i­cal Man­i­fes­ta­tion of Language

Feb 22, 2024, 6:52 PM
39 points
23 comments1 min readLW link
(arxiv.org)