Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
SoerenMind
Karma:
1,181
All
Posts
Comments
New
Top
Old
Page
1
How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
JanB
,
Owain_Evans
and
SoerenMind
28 Sep 2023 18:53 UTC
187
points
39
comments
3
min read
LW
link
1
review
Wikipedia as an introduction to the alignment problem
SoerenMind
29 May 2023 18:43 UTC
83
points
10
comments
1
min read
LW
link
(en.wikipedia.org)
The Alignment Problem from a Deep Learning Perspective (major rewrite)
SoerenMind
,
Richard_Ngo
and
LawrenceC
10 Jan 2023 16:06 UTC
84
points
8
comments
39
min read
LW
link
(arxiv.org)
[Question]
How much to optimize for the short-timelines scenario?
SoerenMind
21 Jul 2022 10:47 UTC
20
points
3
comments
1
min read
LW
link
Inference cost limits the impact of ever larger models
SoerenMind
23 Oct 2021 10:51 UTC
42
points
29
comments
2
min read
LW
link
SoerenMind’s Shortform
SoerenMind
11 Jun 2021 20:19 UTC
5
points
2
comments
1
min read
LW
link
FHI paper published in Science: interventions against COVID-19
SoerenMind
16 Dec 2020 21:19 UTC
119
points
0
comments
3
min read
LW
link
How to do remote co-working
SoerenMind
8 May 2020 19:38 UTC
25
points
11
comments
1
min read
LW
link
[Question]
How important are model sizes to your timeline predictions?
SoerenMind
5 Sep 2019 17:34 UTC
11
points
1
comment
1
min read
LW
link
[Question]
What are some good examples of gaming that is hard to detect?
SoerenMind
16 May 2019 16:10 UTC
5
points
3
comments
1
min read
LW
link
[Question]
Any rebuttals of Christiano and AI Impacts on takeoff speeds?
SoerenMind
21 Apr 2019 20:39 UTC
67
points
26
comments
1
min read
LW
link
Some intuition on why consciousness seems subjective
SoerenMind
27 Jul 2018 22:37 UTC
20
points
10
comments
7
min read
LW
link
Updating towards the simulation hypothesis because you think about AI
SoerenMind
5 Mar 2016 22:23 UTC
11
points
21
comments
3
min read
LW
link
Working at MIRI: An interview with Malo Bourgon
SoerenMind
1 Nov 2015 12:54 UTC
13
points
2
comments
4
min read
LW
link
Meetup : ‘The Most Good Good You Can Do’ (Effective Altruism meetup)
SoerenMind
14 May 2015 18:32 UTC
2
points
0
comments
1
min read
LW
link
Meetup : Utrecht- Brainstorm and ethics discussion at the Film Café
SoerenMind
19 May 2014 20:49 UTC
2
points
2
comments
1
min read
LW
link
Meetup : Utrecht—Social discussion at the Film Café
SoerenMind
12 May 2014 13:10 UTC
2
points
0
comments
1
min read
LW
link
Meetup : Utrecht
SoerenMind
20 Apr 2014 10:14 UTC
3
points
2
comments
1
min read
LW
link
Meetup : Utrecht: Behavioural economics, game theory...
SoerenMind
7 Apr 2014 13:54 UTC
5
points
1
comment
1
min read
LW
link
Meetup : Utrecht: More on effective altruism
SoerenMind
27 Mar 2014 0:40 UTC
3
points
3
comments
1
min read
LW
link
Back to top
Next