Robert Miles

Karma: 1,234

Robert Miles Apr 29, 2025, 8:44 PM
8 points
3
in reply to: James_Miller’s comment on: Our Reality: A Simulation Run by a Paperclip Maximizer
I disagree with the insistence on “paperclip maximiser”. As an emerging ASI you want to know about other ASIs you’ll meet, especially grabby ones. But there are aligned grabby ASIs. You’d want an accurate prior, so I don’t think this updates me on probability of alignment, or even much on grabbiness, since it’s hard to know ahead of time, that’s why you’d run a simulation in the first place.
I don’t take it very seriously because (1) it is a big pile of assumptions and I don’t trust anthropic reasoning much at the best of times, it’s very confusing and hard to think about (2) the simulation is most useful if it’s using the same physics as base reality, so it kind of doesn’t matter if we’re simulated or not or both, our actions should be the same. So it’s not really decision-relevant, I’d follow ~the same policy either way

Robert Miles Apr 28, 2025, 3:03 AM
10 points
0
on: Our Reality: A Simulation Run by a Paperclip Maximizer
I tweeted about something a lot like this
https://xcancel.com/robertskmiles/status/1877486270143934881

Robert Miles Mar 14, 2025, 10:06 PM
3 points
0
on: Eliezer’s Lost Alignment Articles / The Arbital Sequence
FYI, Relative URLs don’t work in emails, the email version I received has all the links going to http://w/<post-title> and thus broken

Robert Miles Jan 17, 2025, 8:23 PM
13 points
11
on: Everywhere I Look, I See Kat Woods
You understand you can just block her on Reddit and Facebook, and move on with your life?

Robert Miles Dec 28, 2024, 8:52 AM
2 points
0
on: Bay Winter Solstice 2024: Speech Auditions
Dang, I missed this. Here’s my audition for 500 Million though, I guess for next year

https://m.youtube.com/watch?v=ljmifo4Klss

Robert Miles Oct 3, 2024, 8:47 PM
22 points
2
on: Three Subtle Examples of Data Leakage
Very interesting! I think this is one of the rare times where I feel like a post would benefit from an up-front Definition. What actually is Leakage, by intensional definition?

Robert Miles Sep 16, 2024, 9:10 PM
59 points
4
in reply to: gwern’s comment on: Did Christopher Hitchens change his mind about waterboarding?
This is one technical point that younger people are often amazed to hear, that for a long time the overwhelming majority of TV broadcast was perfectly ephemeral, producing no records at all. Not just that the original copies were lost or never digitised or impossible to track down or whatever, but that nothing of the sort ever existed. The technology for capturing, broadcasting, and displaying a TV signal is so much easier than the tech for recording one, that there were several decades when the only recordings of TV came from someone setting up a literal film camera pointed at a TV and capturing the screen on photographic film, and that didn’t happen much.

(This also meant that old TV was amazingly low latency. The camera sensor scanned through, producing the signal, which went through some analog circuitry and straight onto the air, into the circuitry of your TV and right onto the screen. The scanning of the electron beam across the screen was synchronised with the scanning of the camera sensor. At no point was even a single frame stored—I need to check the numbers but I think if you were close to the TV station, you’re looking at the top of the frame before the bottom of the frame is even captured by the camera)

Robert Miles Sep 15, 2024, 9:27 AM
2 points
0
on: The Incredible Fentanyl-Detecting Machine
What about NMR or XRF? XRF can non-destructively tell you the elemental composition of a sample, which (if the sample is pure) can often pin down the compound, and NMR spectroscopy is also non destructive and can give you some info about chemical structure too

Robert Miles Aug 22, 2024, 12:04 PM
6 points
2
on: Medical Image Registration: The obscure field where Deep Mesaoptimizers are already at the top of the benchmarks. (post + colab notebook)
This is an interesting post!

I’m new to alignment research—any tips on how to prove what the inner goal actually is?

Haha! haaaaa 😢

Robert Miles May 24, 2024, 4:57 PM
5 points
0
in reply to: Eigil Rischel’s comment on: Transportation as a Constraint
Not least being the military implications. If you have widely available tech that lets you quickly and cheaply accelerate something car-sized to a velocity of Mach Fuck (they’re meant to circle the earth in 4.2 hours, making them 2 or 3 times faster than a rifle bullet), that’s certainly a dual use technology.

AI Safety Chatbot

markov and Robert Miles

Dec 21, 2023, 2:06 PM

61 points

11 comments4 min readLW link

Robert Miles Oct 29, 2023, 12:46 AM
3 points
0
in reply to: Richard_Ngo’s comment on: Holly Elmore and Rob Miles dialogue on AI Safety Advocacy
Covid was a big learning experience for me, but I’d like to think about more than one example. Covid is interesting because, compared to my examples of birth control and animal-free meat, it seems like with covid humanity smashed the technical problem out of the park, but still overall failed by my lights because of the political situation.
How likely does it seem that we could get full marks on solving alignment but still fail due to politics? I tend to think of building a properly aligned AGI as a straightforward win condition, but that’s not a very deeply considered view. I guess we could solve it on a whiteboard somewhere but for political reasons it doesn’t get implemented in time?

Holly Elmore and Rob Miles dialogue on AI Safety Advocacy

Bird Concept, Robert Miles and Holly_Elmore

Oct 20, 2023, 9:04 PM

162 points

30 comments27 min readLW link

Stampy’s AI Safety Info soft launch

steven0461 and Robert Miles

Oct 5, 2023, 10:13 PM

120 points

9 comments2 min readLW link

Robert Miles Sep 17, 2023, 10:57 AM
4 points
on: (Confusion Phrases) AKA: Things You Might Say or Think When You’re Confused to Use as Triggers for Internal TAPs
I think almost all of these are things that I’d only think after I’d already noticed confusion, and most are things I’d never say in my head anyway. A little way into the list I thought “Wait, did he just ask ChatGPT for different ways to say “I’m confused”?”.

I expect there are things that pop up in my inner monologue when I’m confused about something, that I wouldn’t notice, and it would be very useful to have a list of such phrases, but your list contains ~none of them.

Edit: Actually the last three are reasonable. Are they human written?

Robert Miles Sep 10, 2023, 4:32 PM
6 points
2
in reply to: lunatic_at_large’s comment on: Probabilistic argument relationships and an invitation to the argument mapping community
One way of framing the difficulty with the lanternflies thing is that the question straddles the is-ought gap. It decomposes pretty cleanly into two questions: “What states of the universe are likely to result from me killing vs not killing lanternflies” (about which Bayes Rule fully applies and is enormously useful), and “Which states of the universe do I prefer?”, where the only evidence you have will come from things like introspection about your own moral intuitions and values. Your values are also a fact about the universe, because you are part of the universe, so Bayes still applies I guess, but it’s quite a different question to think about.
If you have well defined values, for example some function from states (or histories) of the universe to real numbers, such that larger numbers represent universe states that you would always prefer over smaller numbers, then every “should I do X or Y” question has an answer in terms of those values. In practice we’ll never have that, but still it’s worth thinking separately about “What are the expected consequences of the proposed policy?” and “What consequences do I want”, which a ‘should’ question implicitly mixes together.

Robert Miles Jun 18, 2023, 11:12 PM
7 points
3
on: Solomonoff induction still works if the universe is uncomputable, and its usefulness doesn’t require knowing Occam’s razor
I’ve always thought of it like, it doesn’t rely on the universe being computable, just on the universe having a computable approximation. So if the universe is computable, SI does perfectly, if it’s not, SI does as well as any algorithm could hope to.

Robert Miles Jun 15, 2023, 11:30 PM
5 points
0
on: Why libertarians are advocating for regulation on AI
A slightly surreal experience to read a post saying something I was just tweeting about, written by a username that could plausibly be mine.

Robert Miles Jun 7, 2023, 10:35 AM
7 points
2
in reply to: TurnTrout’s comment on: The Sharp Right Turn: sudden deceptive alignment as a convergent goal
Do we even need a whole new term for this? Why not “Sudden Deceptive Alignment”?

Robert Miles Jun 5, 2023, 9:15 AM
15 points
6
on: Meta-conversation shouldn’t be taboo
I think in some significant subset of such situations, almost everyone present is aware of the problem, so you don’t always have to describe the problem yourself or explicitly propose solutions (which can seem weird from a power dynamics perspective). Sometimes just drawing the group’s attention to the meta level at all, initiating a meta-discussion, is sufficient to allow the group to fix the problem.

Robert Miles

AI Safety Chatbot

Holly El­more and Rob Miles di­alogue on AI Safety Advocacy

Stampy’s AI Safety Info soft launch

Holly Elmore and Rob Miles dialogue on AI Safety Advocacy