Robert Kralisch

Karma: 175

Hey, I am Robert Kralisch, an independent conceptual/theoretical Alignment Researcher. I have a background in Cognitive Science and I am interested in collaborating on an end-to-end strategy for AGI alignment.

I am one of the organizers for the AI Safety Camp 2025, working as a research coordinator by evaluating and supporting research projects that fit under the umbrella of “conceptually sound approaches to AI Alignment”.

The three main branches that I aim to contribute to are conceptual clarity (what should we mean by agency, intelligence, embodiment, etc), the exploration of more inherently interpretable cognitive architectures, and Simulator theory.

One of my concrete goals is to figure out how to design a cognitively powerful agent such that it does not become a Superoptimiser in the limit.

We don’t want to post again “This might be the last AI Safety Camp”

Remmelt, Linda Linsefors and Robert Kralisch

Jan 21, 2025, 12:03 PM

36 points

17 comments1 min readLW link

(manifund.org)

Funding Case: AI Safety Camp 11

Remmelt, Robert Kralisch and Linda Linsefors

Dec 23, 2024, 8:51 AM

60 points

4 comments6 min readLW link

(manifund.org)

AI Safety Camp 10

Robert Kralisch, Linda Linsefors and Remmelt

Oct 26, 2024, 11:08 AM

38 points

9 comments18 min readLW link

Invitation to lead a project at AI Safety Camp (Virtual Edition, 2025)

Linda Linsefors, Remmelt Ellen and Robert Kralisch

Aug 23, 2024, 2:18 PM

17 points

2 comments4 min readLW link

Research Discussion on PSCA with Claude Sonnet 3.5

Robert KralischJul 24, 2024, 4:53 PM

−2 points

0 comments25 min readLW link

The Prop-room and Stage Cognitive Architecture

Robert KralischApr 29, 2024, 12:48 AM

14 points

4 comments14 min readLW link

How are Simulators and Agents related?

Robert KralischApr 29, 2024, 12:22 AM

6 points

0 comments7 min readLW link

Extended Embodiment

Robert KralischApr 29, 2024, 12:18 AM

8 points

1 comment3 min readLW link

Referential Containment

Robert KralischApr 29, 2024, 12:16 AM

2 points

4 comments3 min readLW link

Disentangling Competence and Intelligence

Robert KralischApr 29, 2024, 12:12 AM

23 points

7 comments6 min readLW link

Introduction

Robert Kralisch, Eris, teahorse and Sohaib Imran

Jun 30, 2023, 8:45 PM

8 points

0 comments2 min readLW link

Inherently Interpretable Architectures

Robert Kralisch, teahorse, Eris and Sohaib Imran

Jun 30, 2023, 8:43 PM

4 points

0 comments7 min readLW link

Positive Attractors

Robert Kralisch, teahorse, Eris and Sohaib Imran

Jun 30, 2023, 8:43 PM

6 points

0 comments13 min readLW link

AISC 2023, Progress Report for March: Team Interpretable Architectures

Robert Kralisch, Eris, teahorse and Sohaib Imran

Apr 2, 2023, 4:19 PM

14 points

0 comments14 min readLW link

Commentary on “AGI Safety From First Principles by Richard Ngo, September 2020”

Robert KralischOct 14, 2021, 3:11 PM

3 points

0 comments19 min readLW link