Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Pierre Peigné
Karma:
119
All
Posts
Comments
New
Top
Old
Investing in Robust Safety Mechanisms is critical for reducing Systemic Risks
Tom DAVID
,
Pierre Peigné
,
Quentin FEUILLADE--MONTIXI
,
Kay Kozaronek
and
Miailhe Nicolas
Dec 11, 2024, 1:37 PM
8
points
3
comments
2
min read
LW
link
Workshop Report: Why current benchmarks approaches are not sufficient for safety?
Tom DAVID
and
Pierre Peigné
Nov 26, 2024, 5:20 PM
3
points
1
comment
3
min read
LW
link
The Stochastic Parrot Hypothesis is debatable for the last generation of LLMs
Quentin FEUILLADE--MONTIXI
and
Pierre Peigné
Nov 7, 2023, 4:12 PM
52
points
21
comments
6
min read
LW
link
Taking features out of superposition with sparse autoencoders more quickly with informed initialization
Pierre Peigné
Sep 23, 2023, 4:21 PM
30
points
8
comments
5
min read
LW
link
Clarifying mesa-optimization
Marius Hobbhahn
and
Pierre Peigné
Mar 21, 2023, 3:53 PM
38
points
6
comments
10
min read
LW
link
Pierre Peigné′s Shortform
Pierre Peigné
Feb 4, 2023, 3:22 AM
1
point
1
comment
LW
link
Back to top