davekasten

Karma: 914

davekasten 28 Dec 2024 18:41 UTC
9 points
2
in reply to: J Bostock’s comment on: evhub’s Shortform
FWIW re: the Dario 2025 comment, Anthropic very recently posted a few job openings for recruiters focused on policy and comms specifically, which I assume is a leading indicator for hiring. One plausible rationale there is that someone on the executive team smashed the “we need more people working on this, make it happen” button.

davekasten 21 Dec 2024 17:41 UTC
3 points
1
in reply to: habryka’s comment on: Habryka’s Shortform Feed
In an ideal world (perhaps not reasonable given your scale), you would have some sort of permissions and logging against some sensitive types of queries on DM metadata. (E.G., perhaps you would let any Lighthaven team member see on the dashboard “rate of DMs from accounts <1 month in age compared to historic baseline” aggregate number, but “how many DMs has Bob (an account over 90 days old) sent to Alice” would require more guardrails.

Edit: to be clear, I am comfortable with you doing this without such logging at your current scale and think it is reasonable to do so.

davekasten 17 Dec 2024 18:51 UTC
4 points
2
on: davekasten’s Shortform
I have a few weeks off coming up shortly, and I’m planning on spending some of it monkeying around AI and code stuff. I can think of two obvious tacks: 1. Go do some fundamentals learning on technical stuff I don’t have hands-on technical experience with or 2. go build on new fun stuff.

Does anyone have particular lists of learning topics / syllabi / similar things like that that would be a good fit for “fairly familiar with the broad policy/technical space, but his largest shipped chunk of code is a few hundred lines of python” person like me?

davekasten 14 Dec 2024 4:45 UTC
15 points
0
in reply to: Casey B.’s comment on: What is MIRI currently doing?
Note also that this work isn’t just papers; e.g., as a matter of public record MIRI has submitted formal comments to regulators to inform draft regulation based on this work.

(For those less familiar, yes, such comments are indeed actually weirdly impactful in the American regulatory system).

davekasten 9 Dec 2024 4:03 UTC
1 point
1
on: A good way to build many air filters on the cheap
In a hypothetical, bad future where we have to do VaccinateCA 2.0 against e.g. bird flu, I personally wonder if “aggressively help people source air filters” would be a pre-vaccine-distribution-time step we would consider. (Not canon! Might be very wrong! Just idle musing)

davekasten 30 Nov 2024 5:31 UTC
18 points
1
in reply to: davekasten’s comment on: (The) Lightcone is nothing without its people: LW + Lighthaven’s big fundraiser
Also, I would generally volunteer to help with selling Lighthaven as an event venue to boring consultant things that will give you piles of money, and IIRC Patrick Ward is interested in this as well, so please let us know how we can help.

davekasten 30 Nov 2024 5:29 UTC
12 points
2
on: (The) Lightcone is nothing without its people: LW + Lighthaven’s big fundraiser
I am excited for this grounds of “we deserve to have nice things,” though for boring financial planning reasons I am not sure whether I will donate additional funds prior to calendar year end or in calendar year 2025.

(Note that I made a similar statement in the past and then donated $100 to Lighthaven very shortly thereafter, so, like, don’t attempt to reverse-engineer my financial status from this or whatever.)

davekasten 27 Nov 2024 15:23 UTC
6 points
0
in reply to: Daniel Kokotajlo’s comment on: Dave Kasten’s AGI-by-2027 vignette
I think I’m also learning that people are way more interested in this detail than I expected!
I debated changing it to “203X” when posting to avoid this becoming the focus of the discussion but figured, “eh, keep it as I actually wrote it in the workshop” for good epistemic hygiene.

davekasten 27 Nov 2024 0:22 UTC
5 points
0
in reply to: Nathan Helm-Burger’s comment on: Dave Kasten’s AGI-by-2027 vignette
Oh, it very possibly is the wrongest part of the piece! I put it in the original workshop draft as I was running out of time and wanted to provoke debate.

A brief gesture at a sketch of the intuition: imagine a different, crueler world, where there were orders of magnitude more nation-states, but at the start only a few nuclear powers, like in our world, with a 1950s-level tech base. If the few nuclear powers want to keep control, they’ll have to divert huge chunks of their breeder reactors’ output to pre-emptively nuking any site in the many many non-nuclear-club states that could be an arms program to prevent breakouts, then any of the nuclear powers would have to wait a fairly long time to assemble an arms stockpile sufficient to launch a Project Orion into space.

Dave Kasten’s AGI-by-2027 vignette

davekasten26 Nov 2024 23:20 UTC

49 points

8 comments5 min readLW link

davekasten 20 Nov 2024 22:13 UTC
14 points
4
in reply to: Akash’s comment on: Akash’s Shortform
As you know, I have huge respect for USG natsec folks. But there are (at least!) two flavors of them: 1) the cautious, measure-twice-cut-once sort that have carefully managed deterrencefor decades, and 2) the “fuck you, I’m doing Iran-Contra” folks. Which do you expect will get in control of such a program ? It’s not immediately clear to me which ones would.

davekasten 20 Nov 2024 19:57 UTC
4 points
3
in reply to: Akash’s comment on: Akash’s Shortform
I think this is a (c) leaning (b), especially given that we’re doing it in public. Remember, the Manhattan Project was a highly-classified effort and we know it by an innocuous name given to it to avoid attention.

Saying publicly, “yo, China, we view this as an all-costs priority, hbu” is a great way to trigger a race with China...

But if it turned out that we knew from ironclad intel with perfect sourcing that China was already racing (I don’t expect this to be the case), then I would lean back more towards (c).

davekasten 20 Nov 2024 3:54 UTC
1 point
0
on: davekasten’s Shortform
I’ll be in Berkeley Weds evening through next Monday, would love to chat with, well, basically anyone who wants to chat. (I’ll be at The Curve Fri-Sun, so if you’re already gonna be there, come find me there between the raindrops!)

davekasten 18 Nov 2024 21:35 UTC
1 point
0
in reply to: otto.barten’s comment on: Proposing the Conditional AI Safety Treaty (linkpost TIME)
Thanks, looking forward to it! Please do let us folks who worked on A Narrow Path (especially me, @Tolga , and @Andrea_Miotti ) know if we can be helpful in bouncing around ideas as you work on the treaty proposal!

davekasten 18 Nov 2024 13:38 UTC
1 point
0
on: Proposing the Conditional AI Safety Treaty (linkpost TIME)
Is there a longer-form version with draft treaty langugage (even an outline)? I’d be curious to read it.

davekasten 9 Nov 2024 4:06 UTC
1 point
0
in reply to: evhub’s comment on: evhub’s Shortform
I think people opposing this have a belief that the counterfactual is “USG doesn’t have LLMs” instead of “USG spins up its own LLM development effort using the NSA’s no-doubt-substantial GPU clusters”.

Needless to say, I think the latter is far more likely.

davekasten 6 Nov 2024 15:08 UTC
1 point
0
in reply to: Daniel Kokotajlo’s comment on: Daniel Kokotajlo’s Shortform
I think the thing that you’re not considering is that when tunnels are more prevalent and more densely packed, the incentives to use the defensive strategy of “dig a tunnel, then set off a very big bomb in it that collapses many tunnels” gets far higher. It wouldn’t always be infantry combat, it would often be a subterranean equivalent of indirect fires.

davekasten 31 Oct 2024 20:57 UTC
−1 points
0
on: davekasten’s Shortform
Ok, so Anthropic’s new policy post (explicitly NOT linkposting it properly since I assume @Zac Hatfield-Dodds or @Evan Hubinger or someone else from Anthropic will, and figure the main convo should happen there, and don’t want to incentivize fragmenting of conversation) seems to have a very obvious implication.

Unrelated, I just slammed a big AGI-by-2028 order on Manifold Markets.

davekasten 29 Oct 2024 4:31 UTC
6 points
5
in reply to: gwern’s comment on: davekasten’s Shortform
Yup. The fact that the profession that writes the news sees “I should resign in protest” as their own responsibility in this circumstance really reveals something.

davekasten 28 Oct 2024 16:37 UTC
24 points
6
on: davekasten’s Shortform
At LessOnline, there was a big discussion one night around the picnic tables with @Eliezer_Yudkovsky , @habryka , and some interlocutors from the frontier labs (you’ll momentarily see why I’m being vague on the latter names).

One question was: “does DC actually listen to whistleblowers?” and I contributed that, in fact, DC does indeed have a script for this, and resigning in protest is a key part of it, especially ever since the Nixon years.

Here is a usefully publicly-shareable anecdote on how strongly this norm is embedded in national security decision-making, from the New Yorker article “The U.S. Spies Who Sound the Alarm About Election Interference” by David Kirkpatrick, Oct 21, 2024:
(https://archive.ph/8Nkx5)
The experts’ chair insisted that in this cycle the intelligence agencies had not withheld information “that met all five of the criteria”—and did not risk exposing sources and methods. Nor had the leaders’ group ever overruled a recommendation by the career experts. And if they did? It would be the job of the chair of the experts’ group to stand up or speak out, she told me: “That is why we pick a career civil servant who is retirement-eligible.” In other words, she can resign in protest.

davekasten

Dave Kas­ten’s AGI-by-2027 vignette

Dave Kasten’s AGI-by-2027 vignette