Bogdan Ionut Cirstea

Karma: 1,629

Automated / strongly-augmented safety research.

Densing Law of LLMs

Bogdan Ionut CirsteaDec 8, 2024, 7:35 PM

9 points

2 comments1 min readLW link

(arxiv.org)

LLMs Do Not Think Step-by-step In Implicit Reasoning

Bogdan Ionut CirsteaNov 28, 2024, 9:16 AM

11 points

0 comments1 min readLW link

(arxiv.org)

Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?

Bogdan Ionut CirsteaNov 26, 2024, 9:58 AM

9 points

0 comments1 min readLW link

(arxiv.org)

Disentangling Representations through Multi-task Learning

Bogdan Ionut CirsteaNov 24, 2024, 1:10 PM

14 points

1 comment1 min readLW link

(arxiv.org)

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward type

Bogdan Ionut CirsteaNov 23, 2024, 12:45 PM

11 points

0 comments1 min readLW link

A Little Depth Goes a Long Way: the Expressive Power of Log-Depth Transformers

Bogdan Ionut CirsteaNov 20, 2024, 11:48 AM

16 points

0 comments1 min readLW link

(openreview.net)

The Computational Complexity of Circuit Discovery for Inner Interpretability

Bogdan Ionut CirsteaOct 17, 2024, 1:18 PM

11 points

2 comments1 min readLW link

(arxiv.org)

Thinking LLMs: General Instruction Following with Thought Generation

Bogdan Ionut CirsteaOct 15, 2024, 9:21 AM

7 points

0 comments1 min readLW link

(arxiv.org)

Instruction Following without Instruction Tuning

Bogdan Ionut CirsteaSep 24, 2024, 1:49 PM

17 points

0 comments1 min readLW link

(arxiv.org)

Validating / finding alignment-relevant concepts using neural data

Bogdan Ionut CirsteaSep 20, 2024, 9:12 PM

7 points

0 comments1 min readLW link

(docs.google.com)

To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning

Bogdan Ionut CirsteaSep 19, 2024, 4:13 PM

21 points

1 comment1 min readLW link

(arxiv.org)

AlignedCut: Visual Concepts Discovery on Brain-Guided Universal Feature Space

Bogdan Ionut CirsteaSep 14, 2024, 11:23 PM

17 points

1 comment1 min readLW link

(arxiv.org)

Universal dimensions of visual representation

Bogdan Ionut CirsteaAug 28, 2024, 10:38 AM

10 points

0 comments1 min readLW link

(arxiv.org)

[Linkpost] Automated Design of Agentic Systems

Bogdan Ionut CirsteaAug 19, 2024, 11:06 PM

8 points

1 comment1 min readLW link

(arxiv.org)

[Linkpost] ‘The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery’

Bogdan Ionut CirsteaAug 15, 2024, 9:32 PM

20 points

1 comment1 min readLW link

(arxiv.org)

[Linkpost] Transcendence: Generative Models Can Outperform The Experts That Train Them

Bogdan Ionut CirsteaJun 18, 2024, 11:00 AM

19 points

3 comments1 min readLW link

(arxiv.org)

[Linkpost] The Expressive Capacity of State Space Models: A Formal Language Perspective

Bogdan Ionut CirsteaMay 28, 2024, 1:49 PM

4 points

3 comments1 min readLW link

(arxiv.org)

[Linkpost] Towards a Theoretical Understanding of the ‘Reversal Curse’ via Training Dynamics

Bogdan Ionut CirsteaMay 11, 2024, 10:59 PM

6 points

0 comments1 min readLW link

(arxiv.org)

[Linkpost] MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Bogdan Ionut CirsteaMar 10, 2024, 1:30 AM

10 points

0 comments1 min readLW link

(openreview.net)

Inducing human-like biases in moral reasoning LMs

Artyom Karpov, Austin Meek, Bogdan Ionut Cirstea and SCho

Feb 20, 2024, 4:28 PM

23 points

3 comments14 min readLW link