Logan Riggs

Karma: 3,074

[Simulators seminar sequence] #1 Background & shared assumptions

Jan, Charlie Steiner, Logan Riggs, janus, jacquesthibs, metasemi, Michael Oesterle, Lucas Teixeira, peligrietzer and remember

Jan 2, 2023, 11:48 PM

50 points

4 comments3 min readLW link

Results from a survey on tool use and workflows in alignment research

jacquesthibs, Jan, janus and Logan Riggs

Dec 19, 2022, 3:19 PM

79 points

2 comments19 min readLW link

A descriptive, not prescriptive, overview of current AI Alignment Research

Jan, Logan Riggs, jacquesthibs and janus

Jun 6, 2022, 9:59 PM

139 points

21 comments7 min readLW link

Frame for Take-Off Speeds to inform compute governance & scaling alignment

Logan RiggsMay 13, 2022, 10:23 PM

15 points

2 comments2 min readLW link

Alignment as Constraints

Logan RiggsMay 13, 2022, 10:07 PM

10 points

0 comments2 min readLW link

Make a Movie Showing Alignment Failures

Logan RiggsApr 13, 2022, 9:54 PM

75 points

11 comments2 min readLW link

Convincing People of Alignment with Street Epistemology

Logan RiggsApr 12, 2022, 11:43 PM

54 points

4 comments3 min readLW link

Roam Research Mobile is Out!

Logan RiggsApr 8, 2022, 7:05 PM

12 points

0 comments1 min readLW link

Convincing All Capability Researchers

Logan RiggsApr 8, 2022, 5:40 PM

120 points

70 comments3 min readLW link

Language Model Tools for Alignment Research

Logan RiggsApr 8, 2022, 5:32 PM

28 points

0 comments2 min readLW link

5-Minute Advice for EA Global

Logan RiggsApr 5, 2022, 10:33 PM

16 points

2 comments2 min readLW link

A survey of tool use and workflows in alignment research

Logan Riggs, Jan, janus and jacquesthibs

Mar 23, 2022, 11:44 PM

45 points

4 comments1 min readLW link

Some (potentially) fundable AI Safety Ideas

Logan RiggsMar 16, 2022, 12:48 PM

22 points

5 comments5 min readLW link

Solving Interpretability Week

Logan RiggsDec 13, 2021, 5:09 PM

11 points

5 comments1 min readLW link

Solve Corrigibility Week

Logan RiggsNov 28, 2021, 5:00 PM

39 points

21 comments1 min readLW link

[Question] What Heuristics Do You Use to Think About Alignment Topics?

Logan RiggsSep 29, 2021, 2:31 AM

5 points

3 comments1 min readLW link

Wanting to Succeed on Every Metric Presented

Logan Riggs12 Apr 2021 20:43 UTC

72 points

25 comments3 min readLW link

Using GPT-N to Solve Interpretability of Neural Networks: A Research Agenda

Logan Riggs and Gurkenglas

3 Sep 2020 18:27 UTC

68 points

11 comments2 min readLW link

[Question] What’s a Decomposable Alignment Topic?

Logan Riggs21 Aug 2020 22:57 UTC

26 points

16 comments1 min readLW link

Mapping Out Alignment

Logan Riggs, adamShimi, Gurkenglas, AlexMennen and Gyrodiot

15 Aug 2020 1:02 UTC

43 points

0 comments5 min readLW link