RSS

Re­spon­si­ble Scal­ing Policies

TagLast edit: Oct 27, 2023, 7:43 PM by elifland

As proposed by ARC Evals, and with a version implemented by Anthropic

Vaniver’s thoughts on An­thropic’s RSP

VaniverOct 28, 2023, 9:06 PM
46 points
4 comments3 min readLW link

AI #35: Re­spon­si­ble Scal­ing Policies

ZviOct 26, 2023, 1:30 PM
66 points
10 comments55 min readLW link
(thezvi.wordpress.com)

RSPs are pauses done right

evhubOct 14, 2023, 4:06 AM
164 points
73 comments7 min readLW link1 review

What’s up with “Re­spon­si­ble Scal­ing Poli­cies”?

Oct 29, 2023, 4:17 AM
99 points
9 comments20 min readLW link1 review

Re­spon­si­ble Scal­ing Poli­cies Are Risk Man­age­ment Done Wrong

simeon_cOct 25, 2023, 11:46 PM
123 points
35 comments22 min readLW link1 review
(www.navigatingrisks.ai)

Thoughts on re­spon­si­ble scal­ing poli­cies and regulation

paulfchristianoOct 24, 2023, 10:21 PM
221 points
33 comments6 min readLW link

We’re Not Ready: thoughts on “paus­ing” and re­spon­si­ble scal­ing policies

HoldenKarnofskyOct 27, 2023, 3:19 PM
200 points
33 comments8 min readLW link

On ‘Re­spon­si­ble Scal­ing Poli­cies’ (RSPs)

ZviDec 5, 2023, 4:10 PM
48 points
3 comments37 min readLW link
(thezvi.wordpress.com)

ARC Evals: Re­spon­si­ble Scal­ing Policies

Zach Stein-PerlmanSep 28, 2023, 4:30 AM
40 points
10 comments2 min readLW link1 review
(evals.alignment.org)

An­thropic: Reflec­tions on our Re­spon­si­ble Scal­ing Policy

Zac Hatfield-DoddsMay 20, 2024, 4:14 AM
30 points
21 comments10 min readLW link
(www.anthropic.com)

OMMC An­nounces RIP

Apr 1, 2024, 11:20 PM
189 points
5 comments2 min readLW link

Dario Amodei’s pre­pared re­marks from the UK AI Safety Sum­mit, on An­thropic’s Re­spon­si­ble Scal­ing Policy

Zac Hatfield-DoddsNov 1, 2023, 6:10 PM
85 points
1 comment4 min readLW link
(www.anthropic.com)

Paul Chris­ti­ano on Dwarkesh Podcast

ESRogsNov 3, 2023, 10:13 PM
19 points
0 comments1 min readLW link
(www.dwarkeshpatel.com)

An­thropic’s up­dated Re­spon­si­ble Scal­ing Policy

Zac Hatfield-DoddsOct 15, 2024, 4:46 PM
52 points
3 comments3 min readLW link
(www.anthropic.com)

An­thropic rewrote its RSP

Zach Stein-PerlmanOct 15, 2024, 2:25 PM
46 points
19 comments6 min readLW link

OpenAI’s Pre­pared­ness Frame­work: Praise & Recommendations

AkashJan 2, 2024, 4:20 PM
66 points
1 comment7 min readLW link

Meta: Fron­tier AI Framework

Zach Stein-PerlmanFeb 3, 2025, 10:00 PM
33 points
2 comments1 min readLW link
(ai.meta.com)

On OpenAI’s Pre­pared­ness Framework

ZviDec 21, 2023, 2:00 PM
51 points
4 comments21 min readLW link
(thezvi.wordpress.com)

OpenAI: Pre­pared­ness framework

Zach Stein-PerlmanDec 18, 2023, 6:30 PM
70 points
23 comments4 min readLW link
(openai.com)

An­thropic’s Re­spon­si­ble Scal­ing Policy & Long-Term Benefit Trust

Zac Hatfield-DoddsSep 19, 2023, 3:09 PM
83 points
26 comments3 min readLW link1 review
(www.anthropic.com)

A call for a quan­ti­ta­tive re­port card for AI bioter­ror­ism threat models

JunoDec 4, 2023, 6:35 AM
12 points
0 comments10 min readLW link

An­thropic—The case for tar­geted regulation

anagumaNov 5, 2024, 7:07 AM
11 points
0 comments2 min readLW link
(www.anthropic.com)

How are vol­un­tary com­mit­ments on vuln­er­a­bil­ity re­port­ing go­ing?

Adam JonesFeb 22, 2024, 8:43 AM
23 points
1 comment1 min readLW link
(adamjones.me)
No comments.