Archive Sequences About Log In Questions Events Shortform Alignment Forum Home Featured All Tags
evhub Karma: 13,834
Evan Hubinger (he/him/his) (evanjhub@gmail.com )
I am a research scientist at Anthropic where I lead the Alignment Stress-Testing team . My posts and comments are my own and do not represent Anthropic’s positions, policies, strategies, or opinions.
Previously: MIRI , OpenAI
See: “Why I’m joining Anthropic ”
Selected work:
All Posts Comments New Top Old Page 121 Jan 2025 21:32 UTC 130 points
2 min read LW link (alignment.anthropic.com)
18 Dec 2024 17:19 UTC 476 points
10 min read LW link 18 Oct 2024 22:33 UTC 94 points
6 min read LW link (assets.anthropic.com)
4 Sep 2024 15:50 UTC 17 points
3 min read LW link 17 Jun 2024 18:41 UTC 161 points
8 min read LW link (arxiv.org)
28 May 2024 16:33 UTC 78 points
21 min read LW link 6 May 2024 7:07 UTC 95 points
1 min read LW link (arxiv.org)
23 Apr 2024 21:10 UTC 133 points
1 min read LW link (www.anthropic.com)
19 Apr 2024 20:00 UTC 38 points
16 min read LW link 6 Apr 2024 8:46 UTC 20 points
7 min read LW link 12 Jan 2024 19:51 UTC 305 points
3 min read LW link (arxiv.org)
2 Jan 2024 0:47 UTC 124 points
8 min read LW link (arxiv.org)
21 Jul 2023 14:52 UTC 56 points
1 min read LW link evhub 22 Jun 2023 0:59 UTC 126 points
1 min read LW link (www.youtube.com)
Back to top Next