Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
porby
Karma:
1,879
All
Posts
Comments
New
Top
Old
Soft Prompts for Evaluation: Measuring Conditional Distance of Capabilities
porby
Feb 2, 2024, 5:49 AM
47
points
1
comment
4
min read
LW
link
(1drv.ms)
FAQ: What the heck is goal agnosticism?
porby
Oct 8, 2023, 7:11 PM
66
points
38
comments
28
min read
LW
link
A plea for more funding shortfall transparency
porby
Aug 7, 2023, 9:33 PM
73
points
4
comments
2
min read
LW
link
Using predictors in corrigible systems
porby
Jul 19, 2023, 10:29 PM
19
points
6
comments
27
min read
LW
link
One path to coherence: conditionalization
porby
Jun 29, 2023, 1:08 AM
28
points
4
comments
4
min read
LW
link
One implementation of regulatory GPU restrictions
porby
Jun 4, 2023, 8:34 PM
42
points
6
comments
5
min read
LW
link
porby’s Shortform
porby
May 24, 2023, 9:34 PM
6
points
20
comments
LW
link
Implied “utilities” of simulators are broad, dense, and shallow
porby
Mar 1, 2023, 3:23 AM
45
points
7
comments
3
min read
LW
link
Instrumentality makes agents agenty
porby
Feb 21, 2023, 4:28 AM
20
points
7
comments
6
min read
LW
link
[Question]
How would you use video gamey tech to help with AI safety?
porby
Feb 9, 2023, 12:20 AM
9
points
5
comments
1
min read
LW
link
Against Boltzmann mesaoptimizers
porby
Jan 30, 2023, 2:55 AM
77
points
6
comments
4
min read
LW
link
FFMI Gains: A List of Vitalities
porby
Jan 12, 2023, 4:48 AM
26
points
3
comments
7
min read
LW
link
Simulators, constraints, and goal agnosticism: porbynotes vol. 1
porby
Nov 23, 2022, 4:22 AM
37
points
2
comments
35
min read
LW
link
Am I secretly excited for AI getting weird?
porby
Oct 29, 2022, 10:16 PM
116
points
4
comments
4
min read
LW
link
Why I think strong general AI is coming soon
porby
Sep 28, 2022, 5:40 AM
337
points
141
comments
34
min read
LW
link
1
review
Private alignment research sharing and coordination
porby
Sep 4, 2022, 12:01 AM
62
points
13
comments
5
min read
LW
link
Back to top