Burny

Karma: 115

I want to understand neural networks, intelligence, AI, brain, physics, math, consciousness, philosophy, risks, well free flourishing future of sentience. TESCREAL!

https://burnyverse.com/Exocortex , https://burnyverse.com/

https://x.com/burny_tech

Burny Jan 22, 2025, 3:37 AM
4 points
−1
on: Quotes from the Stargate press conference
0.5 out of $7T is done...

Burny Jan 22, 2025, 3:36 AM
9 points
2
in reply to: Jesse Hoogland’s comment on: Jesse Hoogland’s Shortform
No MCTS, no PRM...
scaling up CoT with simple RL and scalar rewards...
emergent behaviour

[Question] Why is Gemini telling the user to die?

BurnyNov 18, 2024, 1:44 AM

13 points

1 comment1 min readLW link

Burny Jul 19, 2024, 2:15 PM
3 points
0
on: A List of 45+ Mech Interp Project Ideas from Apollo Research’s Interpretability Team
Thanks for posting this!

Burny Mar 5, 2024, 8:48 AM
6 points
1
on: Claude 3 claims it’s conscious, doesn’t want to die or be modified
https://twitter.com/AISafetyMemes/status/1764894816226386004 https://twitter.com/alexalbert__/status/1764722513014329620

How emergent / functionally special/ out of distribution is this behavior? Maybe Anthropic is playing big brain 4D chess by training Claude on data with self awareness like scenarios to cause panic by pushing capabilities with it and slow down the AI race by resulting regulations while it not being out of distribution emergent behavior but deeply part of training data and it being in distribution classical features interacting in circuits

Burny Nov 24, 2023, 3:12 AM
2 points
0
in reply to: Ben Pace’s comment on: Possible OpenAI’s Q* breakthrough and Google’s AlphaGo-type systems
Thanks, will do!

Possible OpenAI’s Q* breakthrough and DeepMind’s AlphaGo-type systems plus LLMs

BurnyNov 23, 2023, 3:16 AM

37 points

25 comments2 min readLW link

Burny Nov 22, 2023, 7:21 PM
2 points
0
on: OpenAI: The Battle of the Board
Merging with Anthropic may have been a better outcome

Human-like systematic generalization through a meta-learning neural network

BurnyNov 19, 2023, 9:41 PM

8 points

0 comments2 min readLW link

(twitter.com)

Burny Nov 18, 2023, 6:07 AM
37 points
2
on: Sam Altman fired from OpenAI
“OpenAI’s ouster of CEO Sam Altman on Friday followed internal arguments among employees about whether the company was developing AI safely enough, according to people with knowledge of the situation.
Such disagreements were high on the minds of some employees during an impromptu all-hands meeting following the firing. Ilya Sutskever, a co-founder and board member at OpenAI who was responsible for limiting societal harms from its AI, took a spate of questions.
At least two employees asked Sutskever—who has been responsible for OpenAI’s biggest research breakthroughs—whether the firing amounted to a “coup” or “hostile takeover,” according to a transcript of the meeting. To some employees, the question implied that Sutskever may have felt Altman was moving too quickly to commercialize the software—which had become a billion-dollar business—at the expense of potential safety concerns.”
Kara Swisher also tweeted:
“More scoopage: sources tell me chief scientist Ilya Sutskever was at the center of this. Increasing tensions with Sam Altman and Greg Brockman over role and influence and he got the board on his side.”
“The developer day and how the store was introduced was in inflection moment of Altman pushing too far, too fast. My bet: [Sam will] have a new company up by Monday.”
Apparently Microsoft was also blindsided by this and didn’t find out until moments before the announcement.
“You can call it this way,” Sutskever said about the coup allegation. “And I can understand why you chose this word, but I disagree with this. This was the board doing its duty to the mission of the nonprofit, which is to make sure that OpenAl builds AGI that benefits all of humanity.” AGI stands for artificial general intelligence, a term that refers to software that can reason the way humans do.
When Sutskever was asked whether “these backroom removals are a good way to govern the most important company in the world?” he answered: “I mean, fair, I agree that there is a not ideal element to it. 100%.”
https://twitter.com/AISafetyMemes/status/1725712642117898654

Burny Oct 7, 2023, 6:21 AM
6 points
0
on: Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
We’re at the start of interpretability, but the progress is lovely! Superposition was such a bottleneck even in small models.
More notes:
https://twitter.com/ch402/status/1710004685560750153 https://twitter.com/ch402/status/1710004416148058535
“Scalability of this approach—can we do this on large models? Scalability of analysis—can we turn a microscopic understanding of large models into a macroscopic story that answers questions we care about?”
“Make this work for real models. Find out what features exist in large models. Understand new, more complex circuits.”
When it comes to manipulation, other recent paper seems more promising IMO! Like fMRI.
https://twitter.com/mezaoptimizer/status/1709292930416910499
https://arxiv.org/abs/2310.01405
https://ai-transparency.org
“This might be the biggest alignment paper of the year. Everyone has been complaining that mechanistic interpretability is like doing LLM cell microbiology, when what we really need is LLM neuro-imaging. Well now we have it: “representation engineering” Similar to an fMRI scan, CAIS creates the LAT (Linear Artificial Tomography) scan. They also do a form of LLM neuro-modulation, getting the model to be honest or deceptive by just adding in a vector to its activations. imo this could be the winning alignment agenda”

Burny

[Question] Why is Gem­ini tel­ling the user to die?

Pos­si­ble OpenAI’s Q* break­through and Deep­Mind’s AlphaGo-type sys­tems plus LLMs

Hu­man-like sys­tem­atic gen­er­al­iza­tion through a meta-learn­ing neu­ral network

[Question] Why is Gemini telling the user to die?

Possible OpenAI’s Q* breakthrough and DeepMind’s AlphaGo-type systems plus LLMs

Human-like systematic generalization through a meta-learning neural network