RSS

Gunnar_Zarncke

Karma: 10,953

Software engineering, parenting, cognition, meditation, other
Linkedin, Facebook, Admonymous (anonymous feedback)

Pa­ram­e­ters of Me­tacog­ni­tion—The Anes­the­sia Patient

Gunnar_Zarncke9 Jan 2026 1:20 UTC
21 points
0 comments8 min readLW link

Un­su­per­vised Agent Discovery

Gunnar_Zarncke22 Dec 2025 22:01 UTC
22 points
0 comments6 min readLW link

HERMES: Towards Effi­cient and Ver­ifi­able Math­e­mat­i­cal Rea­son­ing in LLMs

Gunnar_Zarncke1 Dec 2025 10:07 UTC
8 points
0 comments1 min readLW link
(arxiv.org)

AI Safety Interventions

Gunnar_Zarncke24 Nov 2025 22:28 UTC
29 points
0 comments10 min readLW link

Thou art rain­bow: Con­scious­ness as a Self-Refer­en­tial Phys­i­cal Process

Gunnar_Zarncke24 Nov 2025 22:23 UTC
28 points
13 comments7 min readLW link

Vic­tor Taelin’s notes on Gem­ini 3

Gunnar_Zarncke18 Nov 2025 18:30 UTC
32 points
1 comment3 min readLW link
(x.com)

[Linkpost] Com­pet­ing Mo­ti­va­tions: When More In­cen­tives Lead To Less Effort

Gunnar_Zarncke4 Nov 2025 23:02 UTC
11 points
0 comments1 min readLW link
(x.com)

When “HDMI-1” Lies To You

Gunnar_Zarncke30 Oct 2025 12:23 UTC
18 points
0 comments1 min readLW link

[Question] Is there a safe ver­sion of the com­mon crawl?

Gunnar_Zarncke12 Aug 2025 14:56 UTC
22 points
6 comments1 min readLW link

[Linkpost] How Am I Get­ting Along with AI?

Gunnar_Zarncke18 Jul 2025 22:26 UTC
11 points
0 comments1 min readLW link
(jessiefischbein.substack.com)

Hy­brid model re­veals peo­ple act less ra­tio­nally in com­plex games, more pre­dictably in sim­ple ones

Gunnar_Zarncke9 Jul 2025 10:15 UTC
9 points
0 comments1 min readLW link
(arxiv.org)

Pro­ject Vend: Can Claude run a small shop?

Gunnar_Zarncke30 Jun 2025 15:22 UTC
53 points
8 comments1 min readLW link
(www.anthropic.com)

[Linkpost] The lethal trifecta for AI agents: pri­vate data, un­trusted con­tent, and ex­ter­nal communication

Gunnar_Zarncke17 Jun 2025 16:09 UTC
13 points
3 comments1 min readLW link
(simonwillison.net)

Un­ex­pected Con­scious Entities

Gunnar_Zarncke5 May 2025 22:14 UTC
34 points
7 comments6 min readLW link

[Linkpost] The value of ini­ti­at­ing a pur­suit in tem­po­ral de­ci­sion-making

Gunnar_Zarncke27 Mar 2025 21:47 UTC
13 points
0 comments2 min readLW link

Mis­tral Large 2 (123B) seems to ex­hibit al­ign­ment faking

27 Mar 2025 15:39 UTC
81 points
4 comments13 min readLW link

Ham­burg – ACX Mee­tups Every­where Spring 2025

Gunnar_Zarncke25 Mar 2025 23:48 UTC
9 points
0 comments1 min readLW link

Re­duc­ing LLM de­cep­tion at scale with self-other over­lap fine-tuning

13 Mar 2025 19:09 UTC
162 points
46 comments6 min readLW link

RL, but don’t do any­thing I wouldn’t do

Gunnar_Zarncke7 Dec 2024 22:54 UTC
63 points
5 comments1 min readLW link
(arxiv.org)

[Linkpost] Build­ing Altru­is­tic and Mo­ral AI Agent with Brain-in­spired Affec­tive Em­pa­thy Mechanisms

Gunnar_Zarncke4 Nov 2024 10:15 UTC
13 points
0 comments1 min readLW link
(arxiv.org)