Tao Lin

Karma: 487

Tao Lin 31 May 2024 3:06 UTC
1 point
0
in reply to: ryan_greenblatt’s comment on: MIRI 2024 Communications Strategy
I don’t think slaughtering billions of people would be very useful. As a reference point, wars between countries almost never result in slaughtering that large a fraction of people

Tao Lin 23 May 2024 18:11 UTC
1 point
−9
in reply to: habryka’s comment on: Open Thread Spring 2024
lol Paul is a very non-disparaging person. He always makes his criticism constructive, i don’t know if there’s any public evidence of him disparaging anyone regardless of NDAs

Tao Lin 13 Mar 2024 23:50 UTC
1 point
0
on: Some ways of spending your time are better than others
I’ve recently gotten into partner dancing and I think it’s a pretty superior activity

Tao Lin 12 Mar 2024 1:22 UTC
1 point
9
in reply to: TurnTrout’s comment on: TurnTrout’s shortform feed
One lesson you could take away from this is “pay attention to the data, not the process”—this happened because the data had longer successes than failures. If successes were more numerous than failures, many algorithms would have imitated those as well with null reward.

Tao Lin 18 Feb 2024 17:53 UTC
5 points
0
on: OpenAI’s Sora is an agent
I think the “fraction of Training compute” going towards agency vs nkn agency will be lower in video models than llms, and llms will likely continue to be bigger, so video models will stay behind llms in overall agency

Tao Lin 8 Feb 2024 6:12 UTC
7 points
0
on: Debating with More Persuasive LLMs Leads to More Truthful Answers
Helpfullness finetuning might make these models more capable when they’re on the correct side of the debate. Sometimes RLHF(like) models simply perform worse on tasks they’re finetuned to avoid even when they don’t refuse or give up. Would be nice to try base model debaters

Tao Lin 6 Feb 2024 20:37 UTC
5 points
3
in reply to: Gerald Monroe’s comment on: Preventing model exfiltration with upload limits
A core advantage of bandwidth limiting over other cybersec interventions is its a simple system we can make stronger arguments about, implemented on a simple processor, without the complexity and uncertainty of modern processors and OSes

Tao Lin 30 Jan 2024 1:04 UTC
1 point
0
in reply to: Ege Erdil’s comment on: Processor clock speeds are not how fast AIs think
no clock speed stays the same, but clock cycle latency of communication between regions increases. Just like CPUs require more clock cycles to access memory than they used to.

Tao Lin 26 Jan 2024 20:25 UTC
3 points
0
in reply to: Thomas Kwa’s comment on: Will quantum randomness affect the 2028 election?
do we have any reason to believe that particular election won’t be close

Tao Lin 22 Jan 2024 3:39 UTC
8 points
5
in reply to: Thomas Kwa’s comment on: There is way too much serendipity
I’d expect artificial sweeteners are already very cheap, and most people want more tested chemicals.

Tao Lin 21 Jan 2024 21:12 UTC
1 point
0
on: Any Interest in a VRChat Meetup?
There exists an Effective Altruism VR discord group. It used to have regular VRChat meetups in like 2021 but doesn’t have much activity now i think

Tao Lin 7 Jan 2024 23:56 UTC
1 point
0
on: What’s up with LLMs representing XORs of arbitrary features?
I’d be interested in experiments with more diverse data. Maybe this only works because the passages are very short and simple and uniform, and are using very superposition-y information that wouldn’t exist in longer and more diverse text

Tao Lin 22 Dec 2023 21:09 UTC
1 point
−2
in reply to: Richard_Ngo’s comment on: Succession
i thought about this for a minute and landed on no counting for lorentz factor. Things hitting on the side have about the same relative velocity as things hitting from the front . Because they’re hitting the side they could either bounce off or dump all their tangent kinetic energy into each other. like because all the relative velocity is tangent, they could in principle interact without exchanging significant energy. But probably the side impacts are just as dangerous. Which might make them more dangerous because you have less armor on the side

Tao Lin 22 Dec 2023 19:40 UTC
7 points
6
in reply to: ryan_greenblatt’s comment on: Succession
probes probably want a very skinny aspect ratio. If cosmic dust travels at 20km/s, that’s 15k times slower than the probe is travelling, so maybe that means the probe should be eg 10cm wide and 1.5km long

Tao Lin 12 Dec 2023 17:51 UTC
3 points
0
on: When will GPT-5 come out? Prediction markets vs. Extrapolation
important to note that gpt4 is more like 300x scale equivalent of gpt3, not 100x, based on gpt4 being trained with (rumored) 2e25 flops vs contemporary gpt3-level models (llama2-7b) being trained on 8e22 flops ( 250 times the compute for that particular pair)

Tao Lin 12 Dec 2023 17:44 UTC
4 points
0
in reply to: Malte’s comment on: When will GPT-5 come out? Prediction markets vs. Extrapolation
Some months before release they had a RLHF-ed model, where the RLHF was significantly worse on most dimensions than the model they finally released. This early RLHF-ed model was mentioned in eg Sparks of AGI.

Send us example gnarly bugs

10 Dec 2023 5:23 UTC

77 points

Tao Lin 9 Dec 2023 18:00 UTC
6 points
2
on: The Offense-Defense Balance Rarely Changes
if AI does change the offence defence balance, it could be because defending an AI (that doesnt need to protect humans) is fundamentally different than defending humans, allowing the AI to spend much less on defence

Tao Lin 6 Dec 2023 23:31 UTC
6 points
0
in reply to: sludgepuddle’s comment on: Google Gemini Announced
video can get extremely expensive without specific architectural support. Eg a folder of images takes up >10x the space of the equivalent video, and using eg 1000 tokens per frame for 30 frames/second is a lot of compute

Tao Lin 6 Dec 2023 19:24 UTC
7 points
2
on: Google Gemini Announced
looks slightly behind gpt-4-base in benchmarks. On the tasks where gemini uses chain-of-thought best-of-32 with optimized prompts it beats gpt-4-base, but ones where it doesnt its same or behind