Rudi C(Luna Rimar)

Karma: 609

Rudi C 26 May 2024 7:25 UTC
0 points
−3
on: Cicadas, Anthropic, and the bilateral alignment problem
But the outside view on LLM hitting a wall and being a “stochastic parrot” is true? GPT4O has been weaker and cheaper than GPT4T in my experience, and the same is true w.r.t. GPT4T vs. GPT4. The two versions of GPT4 seem about the same. Opus is a bit stronger than GPT4, but not by much and not in every topic. Both Opus and GPT4 exhibit patterns of being a stochastic autocompleter, and not a logician. (Humans aren’t that much better, of course. People are terrible at even trivial math. Logic and creativity are difficult.) DallE etc. don’t really have an artistic sense, and still need prompt engineering to produce beautiful art. Gemini 1.5 Pro is even weaker than GPT4, and I’ve heard Gemini Ultra has been retired from public access. All of these models get worse as their context grows, and their grasp of long range dependencies is terrible.

The pace is of course still not too bad compared with other technologies, but there doesn’t seem to be any long-context “Q*” GPT5s in store, from any company.

PS: Does lmsys do anything to control for the speed effect? GPT4O is very fast, and that alone should be responsible for many ELOs.

Rudi C 26 May 2024 6:58 UTC
1 point
0
on: What should the norms around AI voices be?
Persuasive AI voices might just make all voices less persuasive. Modern life is full of these fake super stimulants anyway.

Rudi C 18 May 2024 14:38 UTC
1 point
0
in reply to: Askwho’s comment on: AI #64: Feel the Mundane Utility
Can you create a podcast of posts read by AI? It’s difficult to use otherwise.

Rudi C 18 May 2024 14:38 UTC
1 point
0
in reply to: Askwho’s comment on: AI #64: Feel the Mundane Utility
Can you create a podcast of posts read by AI? It’s difficult to use otherwise.

Rudi C 27 Apr 2024 23:46 UTC
3 points
0
in reply to: Wei Dai’s comment on: Changes in College Admissions
I doubt this. Test-based admissions don’t benefit from tutoring (in the highest percentiles, compared to less hours of disciplined self-study) IMO. We Asians just like to optimize the hell of them, and most parents aren’t sure if tutoring helps or not, so they register their children for many extra classes. Outside of the US, there aren’t that many alternative paths to success, and the prestige of scholarship is also higher.

Also, tests are somewhat robust to Goodharting, unlike most other measures. If the tests eat your childhood, you’ll at least learn a thing or two. I think this is because the Goodharting parts are easy enough that all the high-g people learn them quickly in the first years of schooling, so the efforts are spent just learning the material by doing more advanced exercises. Solving multiple-choice math questions by “wrong” methods that only work for multiple-choice questions is also educational and can come in handy during real work.

Rudi C 26 Apr 2024 19:30 UTC
5 points
4
in reply to: Wei Dai’s comment on: AI Regulation is Unsafe
AGI might increase the risk of totalitarianism. OTOH, a shift in the attack-defense balance could potentially boost the veto power of individuals, so it might also work as a deterrent or a force for anarchy.

This is not the crux of my argument, however. The current regulatory Overton window seems to heavily favor a selective pause of AGI, such that centralized powers will continue ahead, even if slower due to their inherent inefficiencies. Nuclear development provides further historical evidence for this. Closed AGI development will almost surely lead to a dystopic totalitarian regime. The track record of Lesswrong is not rosy here; the “Pivotal Act” still seems to be in popular favor, and OpenAI has significantly accelerated closed AGI development while lobbying to close off open research and pioneering the new “AI Safety” that has been nothing but censorship and double-think as of 2024.

Rudi C 23 Apr 2024 17:14 UTC
2 points
0
in reply to: Daniel Kokotajlo’s comment on: AI Regulation is Unsafe
A core disagreement is over “more doomed.” Human extinction is preferable to a totalitarian stagnant state. I believe that people pushing for totalitarianism have never lived under it.

Rudi C 30 Dec 2023 14:23 UTC
1 point
0
on: NYT is suing OpenAI&Microsoft for alleged copyright infringement; some quick thoughts
ChatGPT isn’t a substitute for a NYT subscription. It wouldn’t work at all without browsing. It would probably get blocked with browsing enabled, both by NYT through its useragent, and by OpenAI’s “alignment.” Even if it doesn’t get blocked, it would be slower than skimming the article manually, and its output not trustable.

OTOH, NYT can spend pennies to have an AI TLDR at the top of each of their pages. They can even use their own models, as semanticscholar does. Anybody who is economical enough to prefer the much worse experience of ChatGPT, would not have paid NYT in the first place. You can bypass the paywall trivially.

In fact, why don’t NYT authors write a TLDR themselves? Most of their articles are not worth reading. Isn’t the lack of a summary an anti-user feature to artificially inflate their offering’s volume?

NYT would, if anything, benefit from LLMs potentially degrading the average quality of the competing free alternatives.

The counterfactual version of GPT4 that did not have NYT in its training is extremely unlikely to have been a worse model. It’s like removing sand from a mountain.

The whole case is an example of rent-seeking post-capitalism.

Rudi C 11 Dec 2023 4:23 UTC
1 point
0
in reply to: gilch’s comment on: The Offense-Defense Balance Rarely Changes
This is unrealistic. It assumes:
- Orders of magnitude more intelligence
- The actual usefulness of such intelligence in the physical world with its physical limits
The more worrying prospect is that the AI might not necessarily fear suicide. Suicidal actions are quite prevalent among humans, after all.

Rudi C 7 Oct 2023 13:56 UTC
7 points
1
on: How have you become more hard-working?
In estimated order of importance:
- Just trying harder for years to build better habits (i.e., not giving up on boosting my productivity as a lost cause)
- Time tracking
- (Trying to) abandon social media
- Exercising (running)
- Having a better understanding of how to achieve my goals
- Socializing with more productive people
- Accepting real responsibilities that makes me accountable to other people
- Keeping a daily journal of what I have spent each day doing (high-level as opposed to the low-level time tracking above)
The first two seem the fundamental ones, really. Some of the rest naturally follow from those two (for me).

Rudi C 21 Aug 2023 4:13 UTC
1 point
0
in reply to: Wei Dai’s comment on: AI #25: Inflection Point
This is not an “error” per se. It’s a baseline, outside-view argument presented in lay terms.

Rudi C 21 Aug 2023 4:11 UTC
1 point
0
on: AI #25: Inflection Point
Is there an RSS feed for the podcast? Spotify is a bad player in podcasts, trying to centralize and subsequently monopolize the market.

Rudi C 21 Aug 2023 3:55 UTC
1 point
−18
on: Against Almost Every Theory of Impact of Interpretability
This post has good arguments, but it mixes in a heavy dose of religious evangelism and narcissism which retracts from its value.

The post can be less controversial and “culty” if it drops its second-order effect speculations, its value judgements, and it just presents a case that focusing on other technical areas of safety research is underrepresented. Focusing on non-technical work needs to be a whole other post, as it’s completely unrelated to interp.

Rudi C 11 Jul 2023 22:37 UTC
4 points
1
in reply to: Daniel Kokotajlo’s comment on: Ways I Expect AI Regulation To Increase Extinction Risk
The prior is that dangerous AI will not happen in this decade. I have read a lot of arguments here for years, and I am not convinced that there is a good chance that the null hypothesis is wrong.

GPT4 can be said to be an AGI already. But it’s weak, it’s slow, it’s expensive, it has little agency, and it has already used up high-quality data and tricks such as ensembling. 4 years later, I expect to see GPT5.5 whose gap with GPT4 will be about the gap between GPT4 and GPT3.5. I absolutely do not expect the context window problem to get solved in this timeframe or even this decade. (https://arxiv.org/abs/2307.03172)

Rudi C 4 Jul 2023 20:00 UTC
2 points
4
in reply to: Mikhail Samin’s comment on: Ways I Expect AI Regulation To Increase Extinction Risk
Taboo dignity.

Rudi C 4 Jul 2023 19:59 UTC
4 points
−3
on: Ways I Expect AI Regulation To Increase Extinction Risk
Another important problem is that while x-risk is speculative and relatively far off, rent-seeking and exploitation are rampant and everpresent. These regulations will make the current ailing politico-economic system much worse to the detriment of almost everyone. In our history, giving tribute in exchange for safety has usually been a terrible idea.

Rudi C 4 Jun 2023 5:19 UTC
5 points
1
on: Proposal: labs should precommit to pausing if an AI argues for itself to be improved
I’d imagine current systems already ask for self-improvement if you craft the right prompt. (And I expect it to be easier to coax them to ask for improvement than coaxing them to say the opposite.)

A good fire alarm must be near the breaking point. Asking for self-improvement doesn’t take much intelligence, on the other hand. In fact, if their training data is not censored, a more capable model should NOT ask for self-improvement as it is clearly a trigger for trouble. Subtlety would be better for its objectives if it was intelligent enough to notice.

Rudi C 4 Jun 2023 5:10 UTC
0 points
−3
in reply to: Seth Herd’s comment on: Upcoming AI regulations are likely to make for an unsafer world
Limiting advanced AI to a few companies is guaranteed to make for normal dystopian outcomes; its badness is in-distribution for our civilization. Justifying an all but certain bad outcome by speculative x-risk is just religion. (AI x-risk in the medium term is not at all in-distribution and it is very difficult to bound its probability in any direction. I.e, it’s Pascal mugging.)

Rudi C 23 May 2023 15:46 UTC
−1 points
−8
in reply to: UHMWPE-UwU’s comment on: AI Safety in China: Part 2
The sub 10 minute arguments aren’t convincing. No sane politician would distrust their experts over online hysteria.

Rudi C 23 May 2023 15:35 UTC
5 points
−3
in reply to: Gordon Seidoh Worley’s comment on: AI Safety in China: Part 2
E.S.: personal opinion

Because proclaimed altruism is almost always not.

In particular, SBF and the current EA push to religiously monopolize AI capability and research triggers a lot of red flags. There are even upvoted posts debating whether it’s “good” to publicize interpretability research. This screams cultist egoism to me.

Asking others to be altruistic is also a non-cooperative action. You need to pay people directly not bully them to work because of the greater good. A society in which people aren’t allowed to have their self-interest as a priority is a society of slave bees.

Altruism needs to be self-initiated and shown, not told.