Joel Burget

Karma: 753

I write software at survivalandflourishing.com. Previously MATS, Google, Khan Academy.

Quick Thoughts on Scaling Monosemanticity

Joel BurgetMay 23, 2024, 4:22 PM

28 points

1 comment4 min readLW link

(transformer-circuits.pub)

Joel Burget May 19, 2024, 5:54 PM
2 points
0
in reply to: gwern’s comment on: Testing for parallel reasoning in LLMs
To me the strongest evidence that fine-tuning is based on LoRA or similar is the fact that pricing is based just on training and input / output and doesn’t factor in the cost of storing your fine-tuned models. Llama-3-8b-instruct is ~16GB (I think this ought to be roughly comparable, at least in the same ballpark). You’d almost surely care if you were storing that much data for each fine-tune.

[Question] How is GPT-4o Related to GPT-4?

Joel BurgetMay 15, 2024, 6:33 PM

10 points

2 comments1 min readLW link

Joel Burget Apr 27, 2024, 8:22 PM
3 points
0
in reply to: J Bostock’s comment on: So What’s Up With PUFAs Chemically?
Measuring the composition of fryer oil at different times certainly seems like a good way to test both the original hypothesis and the effect of altitude.

Joel Burget Apr 27, 2024, 4:57 PM
1 point
0
in reply to: Slapstick’s comment on: So What’s Up With PUFAs Chemically?
You’re right, my original wording was too strong. I edited it to say that it agrees with so many diets instead of explains why they work.

Joel Burget Apr 27, 2024, 3:51 PM
4 points
0
on: So What’s Up With PUFAs Chemically?
One thing I like about the PUFA breakdown theory is that it agrees with aspects of so many different diets.
- Keto avoids fried food because usually the food being fried is carbs
- Carnivore avoids vegetable oils because they’re not meat
- Paleo avoids vegetable oils because they weren’t available in the ancestral environment
- Vegans tend to emphasize raw food and fried foods often have meat or cheese in them
- Low-fat diets avoid fat of all kinds
- Ray Peat was perhaps the closest to the mark in emphasizing that saturated fats are more stable (he probably talked about PUFA breakdown specifically, I’m not sure).
Edit: I originally wrote “neatly explains why so many different diets are reported to work”

Joel Burget Apr 20, 2024, 2:07 PM
1 point
0
on: CTMU insight: maybe consciousness *can* affect quantum outcomes?
If this was true, how could we tell? In other words, is this a testable hypothesis?
What reason do we have to believe this might be true? Because we’re in a world where it looks like we’re going to develop superintelligence, so it would be a useful world to simulate?

[Question] How to Model the Future of Open-Source LLMs?

Joel BurgetApr 19, 2024, 2:28 PM

25 points

9 comments1 min readLW link

Joel Burget Apr 19, 2024, 1:48 AM
5 points
0
on: Joel Burget’s Shortform
From the latest Conversations with Tyler interview of Peter Thiel
I feel like Thiel misrepresents Bostrom here. He doesn’t really want a centralized world government or think that’s “a set of things that make sense and that are good”. He’s forced into world surveillance not because it’s good but because it’s the only alternative he sees to dangerous ASI being deployed.
I wouldn’t say he’s optimistic about human nature. In fact it’s almost the very opposite. He thinks that we’re doomed by our nature to create that which will destroy us.

Paul Christiano named as US AI Safety Institute Head of AI Safety

Joel BurgetApr 16, 2024, 4:22 PM

256 points

58 comments1 min readLW link

(www.commerce.gov)

Joel Burget Mar 26, 2024, 7:12 PM
2 points
0
on: Announcing Neuronpedia: Platform for accelerating research into Sparse Autoencoders
Three questions:
1. What format do you upload SAEs in?
2. What data do you run the SAEs over to generate the activations / samples?
3. How long of a delay is there between uploading an SAE and it being available to view?

Joel Burget Mar 26, 2024, 6:41 PM
2 points
1
on: Announcing Neuronpedia: Platform for accelerating research into Sparse Autoencoders
This is fantastic. Thank you.

Joel Burget Mar 14, 2024, 12:51 AM
1 point
0
in reply to: Steven Byrnes’s comment on: Highlights from Lex Fridman’s interview of Yann LeCun
Thanks! I added a note about LeCun’s 100,000 claim and just dropped the Chollet reference since it was misleading.

Joel Burget Mar 14, 2024, 12:45 AM
3 points
0
in reply to: ryan_greenblatt’s comment on: Highlights from Lex Fridman’s interview of Yann LeCun
Thanks for the correction! I’ve updated the post.

Highlights from Lex Fridman’s interview of Yann LeCun

Joel BurgetMar 13, 2024, 8:58 PM

48 points

15 comments41 min readLW link

Joel Burget Mar 6, 2024, 5:27 PM
1 point
0
in reply to: Gunnar_Zarncke’s comment on: Jimrandomh’s Shortform Posts
I assume the 44k PPM CO2 exhaled air is the product of respiration (I.e. the lungs have processed it), whereas the air used in mouth-to-mouth is quickly inhaled and exhaled.

Joel Burget Dec 12, 2023, 10:42 PM
3 points
0
on: Significantly Enhancing Adult Intelligence With Gene Editing May Be Possible
What’s your best guess for what percentage of cells (in the brain) receive edits?
Are edits somehow targeted at brain cells in particular or do they run throughout the body?

Joel Burget Nov 29, 2023, 4:03 PM
2 points
1
in reply to: Ben Pace’s comment on: My techno-optimism [By Vitalik Buterin]
I don’t have a well-reasoned opinion here but I’m interested in hearing from those who disagree.

Joel Burget Oct 8, 2023, 10:37 PM
LW: 6 AF: 4
2
AF
in reply to: abramdemski’s comment on: Towards Monosemanticity: Decomposing Language Models With Dictionary Learning
How would you distinguish between weak and strong methods?

Joel Burget Oct 2, 2023, 3:35 PM
3 points
1
on: My Effortless Weightloss Story: A Quick Runthrough
Re Na:K : Potassium Chloride is used as a salt substitute (which tastes surprisingly like regular salt). This makes it really easy to tweak the Na:K ratio (if it turns out to be important). OTOH, it’s some evidence that it’s not important, otherwise I’d expect someone to notice that people lose weight when they substitute it for table salt.

Joel Burget

Quick Thoughts on Scal­ing Monosemanticity

[Question] How is GPT-4o Re­lated to GPT-4?

[Question] How to Model the Fu­ture of Open-Source LLMs?

Paul Chris­ti­ano named as US AI Safety In­sti­tute Head of AI Safety

High­lights from Lex Frid­man’s in­ter­view of Yann LeCun

Quick Thoughts on Scaling Monosemanticity

[Question] How is GPT-4o Related to GPT-4?

[Question] How to Model the Future of Open-Source LLMs?

Paul Christiano named as US AI Safety Institute Head of AI Safety

Highlights from Lex Fridman’s interview of Yann LeCun