Joseph Miller

Karma: 1,782

Joseph Miller Feb 26, 2025, 11:23 PM
5 points
0
in reply to: Campbell Hutcheson’s comment on: Campbell Hutcheson’s Shortform
You can see what he’s referring to in the pictures Webb published of the scene.

Joseph Miller Feb 26, 2025, 11:20 PM
10 points
0
in reply to: LoganStrohl’s comment on: BrienneYudkowsky’s Shortform
What is prospective memory training?

Joseph Miller Feb 24, 2025, 9:12 AM
4 points
0
in reply to: leogao’s comment on: leogao’s Shortform
I think there’s a spectrum between great man theory and structural forces theory and I would classify your view as much closer to the structural forces view, rather than a combination of the two.

The strongest counter-example might be Mao. It seems like one man’s idiosyncratic whims really did set the trajectory for hundreds of millions of people. Although of course as soon as he died most of the power vanished, but surely China and the world would be extremely different today without him.

Joseph Miller Feb 24, 2025, 9:06 AM
4 points
0
in reply to: Thomas Kwa’s comment on: leogao’s Shortform
The Duke of Wellington said that Napoleon’s presence on a battlefield “was worth forty thousand men”.
This would be about 4% of France’s military size in 1812.

Joseph Miller Feb 22, 2025, 10:09 PM
4 points
0
in reply to: Eric Neyman’s comment on: Joseph Miller’s Shortform
I first encountered it in chapter 18 of The Looming Tower by Lawrence Wright.
But here’s a easily linkable online source: https://ctc.westpoint.edu/revisiting-al-qaidas-anthrax-program/

Joseph Miller Feb 22, 2025, 5:27 PM
33 points
7
on: Joseph Miller’s Shortform
“Despite their extreme danger, we only became aware of them when the enemy drew our attention to them by repeatedly expressing concerns that they can be produced simply with easily available materials.”

Ayman al-Zawahiri, former leader of Al-Qaeda, on chemical/biological weapons.
I don’t think this is a knock-down argument against discussing CBRN risks from AI, but it seems worth considering.

Joseph Miller Feb 21, 2025, 2:25 PM
2 points
0
on: Literature Review of Text AutoEncoders
This is great, thanks. I think these could be very helpful for interpretability.

Joseph Miller Feb 19, 2025, 9:32 AM
10 points
2
on: A History of the Future, 2025-2040
Thanks I enjoyed this.
The main thing that seems wrong to me, similar to some of your other recent posts, is that AI progress seems to mysteriously decelerate around 2030. I predict that things will look much more sci-fi after that point than in your story (if we’re still alive).

Joseph Miller Feb 18, 2025, 11:09 AM
9 points
−2
on: Joseph Miller’s Shortform
xAI claims to have a cluster of 200k GPUs, presumably H100s, online for long enough to train Grok 3.
I think this is faster datacenter scaling than any predictions I’ve heard.

Source: https://x.com/xai/status/1891699715298730482

Joseph Miller Feb 16, 2025, 4:47 PM
7 points
0
in reply to: Chris Monteiro’s comment on: Murder plots are infohazards
DM’d

Joseph Miller Feb 15, 2025, 12:02 PM
16 points
1
in reply to: Chris Monteiro’s comment on: Murder plots are infohazards
In that case I would consider applying for EA funds if you are willing to do the work professionally or set up a charity to do it. I think you could make a strong case that it meets the highest bar for important, neglected and tractable work.

Joseph Miller Feb 15, 2025, 9:57 AM
19 points
3
on: Murder plots are infohazards
How long does it take you to save one life on average? GiveWell’s top charities save a life for about $5000. If you can get close to that there should be many EA philanthropists willing to fund you or a charity you create.
And I think they should be willing to go up to like $10-20k at least because murders are probably especially bad deaths in terms of their effects on the world.

Joseph Miller Feb 13, 2025, 8:51 PM
2 points
0
on: interpreting GPT: the logit lens
I just found the paper BERT’s output layer recognizes all hidden layers? Some Intriguing Phenomena and a simple way to boost BERT, which precedes this post by a few months and invents essentially the same technique as the logit lens.
So consider also citing that paper when citing this post.
As an aside, I would guess that this is the most cited LessWrong post in the academic literature, but it would be cool if anyone had stats on that.

Joseph Miller Feb 3, 2025, 2:59 PM
2 points
0
in reply to: Mateusz Bagiński’s comment on: Viliam’s Shortform
Yeah I guess, but actually the more I think about it, the more impractical it seems.

Joseph Miller Feb 3, 2025, 11:55 AM
2 points
0
in reply to: Viliam’s comment on: Viliam’s Shortform
I think the solution would be something like adopting a security mindset with respect to preventing community members going off the rails.

The costs would be great because then everyone would be under suspicion by default, but maybe it would be worth it.

Joseph Miller Feb 2, 2025, 11:29 PM
14 points
4
on: Joseph Miller’s Shortform
The next international PauseAI protest is taking place in one week in London, New York, Stockholm (Sunday 9th Feb), Paris (Mon 10 Feb) and many other cities around the world.

We are calling for AI Safety to be the focus of the upcoming Paris AI Action Summit. If you’re on the fence, take a look at Why I’m doing PauseAI.

Joseph Miller Jan 28, 2025, 12:20 PM
4 points
0
in reply to: TsviBT’s comment on: TsviBT’s Shortform
For those in Europe, Tomorrow Biostasis makes the process a lot easier and they have people who will talk you through step by step.

Joseph Miller Jan 25, 2025, 7:14 AM
5 points
0
on: Reality has a surprising amount of detail
A good example of surprising detail I just read.
It turns out that the UI for a simple handheld calculator is a large design space with no easy solutions.
https://lcamtuf.substack.com/p/ui-is-hell-four-function-calculators

Joseph Miller Jan 20, 2025, 10:51 PM
4 points
−5
in reply to: Thane Ruthenis’s comment on: Thane Ruthenis’s Shortform
- Following OpenAI Twitter freakouts is a colossal, utterly pointless waste of your time and you shouldn’t do it ever.
I feel like for the same reasons, this shortform is kind of an engaging waste of my time. One reason I read LessWrong is to avoid twitter garbage.

Joseph Miller Jan 17, 2025, 1:26 AM
23 points
15
in reply to: Leon Lang’s comment on: Leon Lang’s Shortform
we thought that forecasting AI trends was important to be able to have us taken seriously
This might be the most dramatic example ever of forecasting affecting the outcome.
Similarly I’m concerned that a lot of alignment people are putting work into evals and benchmarks which may be having some accelerating affect on the AI capabilities which they are trying to understand.
“That which is measured improves. That which is measured and reported improves exponentially.”

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer