Tao Lin

Karma: 875

Tao Lin Jun 12, 2025, 3:30 PM
LW: 8 AF: 6
4
AF
on: When is it important that open-weight models aren’t released? My thoughts on the benefits and dangers of open-weight models in response to developments in CBRN capabilities.
This overestimates the impact of large models on external safety research. My impression is that the AI safety community has barely used deepseek r1 and v3 open source weights at all. I checked again and still see little evidence of v3/r1 weights in safety research. People use r1 distill 8b, and qwq 32b, but the decision to open source the most capable small model is different than the decision to open source the frontier. So then it matters when 8b or 32b models can assist with bioterrorism, which happens a bit later, and we get most of the benefits of open source until then. It’s also cheaper to filter virology or even all biology data out of training for a small models pre training data because it wouldn’t cause customers to switch providers (customers prefer large model anyway) and small models are more often narrowly focused on math or coding.

Tao Lin May 27, 2025, 10:53 PM
18 points
0
on: Season Recap of the Village: Agents raise $2,000
What are your API costs, and how do they compare to the $ raised?

Tao Lin May 20, 2025, 4:48 PM
1 point
0
in reply to: Steven Byrnes’s comment on: steve2152′s Shortform
I can somewhat see where you’re coming from about a new method being orders of magnitude more data efficient in RL, but I very strongly bet on transformers being core even after such a paradigm shift. I’m curious whether you think the transformer architecture and text input/output need to go, or whether the new training procedure / architecture fits in with transformers because transformers are just the best information mixing architecture.

Tao Lin May 20, 2025, 1:30 AM
2 points
0
in reply to: Warty’s comment on: Warty’s Shortform
Calibration is a super important signal of quality because it means you can actually act on the given probabilities! Even if someone is gaming calibration by betting given ratios on certain outcomes, you can still bet on their predictions and not lose money (often). That is far better than other news sources such as tweets or NYT or whatever. If a calibrated predictor and a random other source are both talking about the same thing, the fact that the predictor is calibrated is enough to make them the #1 source on that topic.

Tao Lin May 20, 2025, 1:23 AM
10 points
18
on: Semen and Semantics: Understanding Porn with Language Embeddings
Incest is not a subcategory of sexual violence, and it’s unethical for unrelated reasons. Then again I see the appeal of sexual violence porn but not incest porn, and maybe incest appeals to other people because they conflate it with violence?

Tao Lin May 17, 2025, 2:45 PM
12 points
7
on: How Fast Can Algorithms Advance Capabilities? | Epoch Gradient Update
Some compute dependent advancements are easier to extrapolate from small scale than others. For instance, I strongly suspect that small scale experiments + naively extrapolating memory usage is sufficient to discover (and be confident in) GQA. Note that the gpt-4 paper predicted the performance of gpt-4 from 1000x scaled down experiments! The gpt-4 scaling law extrapolation, and similar scaling laws work, is proof that a lot of advances can be extrapolated from much smaller compute scale.

Tao Lin May 17, 2025, 4:51 AM
10 points
6
on: Generating the Funniest Joke with RL (according to GPT-4.1)
Gpt-4.1 is an expecially soulless model. It’s intended for API use only, whereas chatgpt-latest is meant to chat with humans. It’s not as bad as o1-mini—that model is extremely autistic and has no concept of emotion. This would work much better with ~pretrained models. Likely you can get gpt-4-base or llama 405b base to do much better with just prompting and no RL.

Tao Lin Apr 24, 2025, 4:55 PM
17 points
8
in reply to: MichaelDickens’s comment on: MichaelDickens’s Shortform
Note that any competent capital holder has significant conflict of interest with AI, AI is already a significant fraction of the stock market and a pause would bring down most capital, not just private lab equity

Tao Lin Apr 16, 2025, 12:09 AM
15 points
1
on: Frontier AI Models Still Fail at Basic Physical Tasks: A Manufacturing Case Study
I agree frontier models severely lack spatial reasoning on images, which I attribute to a lack of in-depth spatial discussion of images on the internet. My model of frontier models’ vision capabilities is that they have very deep knowledge of aspects of images that relate to text that happens to be immediately before or after it in web text, and only a very small fraction of images on the internet have accompanying in-depth spatial discussion. The models are very good at for instance guessing the location of where photos were taken, vastly better than most humans, because locations are more often mentioned around photos. I expect that if labs want to, they can construct enough semi-synthetic data to fix this.

Tao Lin Apr 5, 2025, 1:36 AM
2 points
0
in reply to: cubefox’s comment on: Show, not tell: GPT-4o is more opinionated in images than in text
Yeah they may be the same weights. The above quote does not absolutely imply the same weights generate the text and images IMO, just that it’s based on the 4o and sees the whole prompt. OpenAI’s audio generation is also ‘native’, but it’s served as a separate model on the API with different release dates, and you can’t mix audio and some function calling in chatgpt in a way that’s consistent with them not actually being the same weights.

Tao Lin Apr 3, 2025, 9:29 PM
1 point
0
on: Show, not tell: GPT-4o is more opinionated in images than in text
Note that the weights of ‘gpt-4o image generation’ may not be the same—they may be separate finetuned models! The main 4o chat llm calls a tool start generating an image, which may use the same weights but may just use different weights that have different post training

Tao Lin Mar 31, 2025, 4:42 PM
0 points
−1
in reply to: Davidmanheim’s comment on: Why do many people who care about AI Safety not clearly endorse PauseAI?
EU AI Code of Practice is better, a little closer to stopping ai development

Tao Lin Mar 24, 2025, 4:19 PM
1 point
0
in reply to: Neel Nanda’s comment on: Good Research Takes are Not Sufficient for Good Strategic Takes
yeah there’s generalization, but I do thing that eg (AGI technical alignment strategy, AGI lab and government strategy, AI welfare, AGI capabilities strategy) are sufficiently different that experts at one will be significantly behind experts on the others

Tao Lin Mar 24, 2025, 2:20 AM
7 points
0
on: Good Research Takes are Not Sufficient for Good Strategic Takes
Also, if you’re asking a panel of people, even those skilled at strategic thinking will still be useless unless they’ve thought deeply about the particular question or adjacent ones. And skilled strategic thinkers can get outdated quickly if they haven’t thought seriously about the problem in awhile.

Tao Lin Mar 6, 2025, 6:52 PM
3 points
0
in reply to: Alexander Gietelink Oldenziel’s comment on: Daniel Kokotajlo’s Shortform
The fact that they have a short lifecycle with only 1 lifetime breeding cycle is though. A lot of intelligent animals, like humans, chimps, elephants, dolphins, orcas, have long lives with many breeding cycles and grandparent roles. Ideally we want an animal that starts breeding in 1 year AND lives for 5+ breeding cycles to be able to learn enough to be useful over its lifetime. It takes so long for humans to learn enough to be useful!

Tao Lin Mar 4, 2025, 6:18 PM
12 points
6
on: How Much Are LLMs Actually Boosting Real-World Programmer Productivity?
Empirically, we likewise don’t seem to be living in the world where the whole software industry is suddenly 5-10 times more productive. It’ll have been the case for 1-2 years now, and I, at least, have felt approximately zero impact. I don’t see 5-10x more useful features in the software I use, or 5-10x more software that’s useful to me, or that the software I’m using is suddenly working 5-10x better, etc.
Diminishing returns! Scaling laws! One concrete version of “5x productivity” is “as much productivity as 5 copies of me in parallel”, and we know that usually 5x-ing most inputs, like training compute and data, # of employees, etc, more often scales logarithmically instead of linearly

Tao Lin Mar 4, 2025, 2:34 AM
1 point
0
in reply to: Fabien Roger’s comment on: Fabien’s Shortform
I was actually just making some tree search scaffolding, and i had the choice between honestly telling each agent would be terminated if it failed or not. I ended up telling them relatively gently that they would be terminated if they failed. Your results are maybe useful to me lol

Tao Lin Feb 20, 2025, 7:27 PM
10 points
4
in reply to: Daniel Kokotajlo’s comment on: Daniel Kokotajlo’s Shortform
Maybe, you could define it that way. I think R1, which uses ~naive policy gradient, is evidence that long generations are different and much easier than long eposides with environment interaction—GRPO (pretty much naive policy gradient) does no attribution to steps or parts of the trajectory, it just trains on the whole trajectory. Naive policy gradient is known to completely fail at more traditional long horizon tasks like real time video games. R1 is more like brainstorming lots of random stuff that doesn’t matter and then selecting the good stuff at the end than taking actions that actually have to be good before the final output

Tao Lin Feb 20, 2025, 6:11 AM
5 points
4
in reply to: Daniel Kokotajlo’s comment on: Daniel Kokotajlo’s Shortform
If by “new thing” you mean reasoning models, that is not long-horizon RL. That’s many generation steps with a very small number of environment interaction steps per eposide, whereas I think “long-horizon RL” means lots of environment interaction steps

Tao Lin Jan 31, 2025, 6:32 PM
14 points
0
on: Catastrophe through Chaos
I agree with this so much! Like you I very much expect benefits to be much greater than harms pre superintelligence. If people are following the default algorithm “Deploy all AI which is individually net positive for humanity in the near term” (which is very reasonable from many perspectives), they will deploy TEDAI and not slow down until it’s too late.

I expect AI to get better at research slightly sooner than you expect.