Iknownothing

Karma: 82

Making a research platform for AI Alignment at https://ai-plans.com/
Come critique AI Alignment plans and get feedback on your alignment plan!

Iknownothing Nov 6, 2023, 1:26 PM
8 points
2
in reply to: trevor’s comment on: AI Safety is Dropping the Ball on Clown Attacks, and Mind Control in General
When I say media, I mean social media, movies, videos, books etc- any type of recording or something that you believe you’re using as entertainment.

I’m trying this myself. Done singular days before, sometimes 2 or 3 days, but failed to keep it consistent. I did find that when I did it, my work output was far higher and greater quality, I had a much better sleeping schedule and was generally in a much more enjoyable mood.
I also ended up spending more time with friends and family, meeting new people, trying interesting things, spending time outdoors, etc.

This time I’m building up to it- starting with 1 media free hour a day, then 2 hours, then 3, etc.
I think building up to it will let me build new habits which will stick more.

Iknownothing Nov 5, 2023, 1:36 AM
8 points
0
on: AI Safety is Dropping the Ball on Clown Attacks, and Mind Control in General
A challenge for folks interested: spend 2 weeks without media based entertainment.

Iknownothing Nov 4, 2023, 11:07 PM
9 points
6
in reply to: aog’s comment on: My thoughts on the social response to AI risk
“CESI’s Artificial Intelligence Standardization White Paper released in 2018 states
that “AI systems that have a direct impact on the safety of humanity and the safety of life,
and may constitute threats to humans” must be regulated and assessed, suggesting a broad
threat perception (Section 4.5.7).42 In addition, a TC260 white paper released in 2019 on AI
safety/security worries that “emergence” (涌现性) by AI algorithms can exacerbate the
black box effect and “autonomy” can lead to algorithmic “self-improvement” (Section
3.2.1.3).43”
From https://concordia-consulting.com/wp-content/uploads/2023/10/State-of-AI-Safety-in-China.pdf

Iknownothing Oct 31, 2023, 2:08 AM
1 point
0
on: An Ignorant View on Ineffectiveness of AI Safety
I disagree with this paragraph today: “A lot of what AI does currently, that is visible to the general public seems like it could be replicated without AI”

Iknownothing Oct 2, 2023, 5:58 PM
1 point
0
in reply to: orthonormal’s comment on: EA Vegan Advocacy is not truthseeking, and it’s everyone’s problem
I was talking about for a farmer. For a consumer, they can get their eggs/milk from such a farmer and fund/invest in such a farm, if they can.
Or talk to a local farm about setting aside some chickens, pay for them to be given extra space, better treatment, etc.

I don’t really know what you mean about the EA reducetarian stuff.

Also, if you as an individual want to be healthy, not contribute to harming animal and have the time, space, money, willingness etc to raise some chickens, why not?

Iknownothing Oct 2, 2023, 9:21 AM
1 point
0
in reply to: Firinn’s comment on: List of how people have become more hard-working
Exercise in general is pretty great, yes. Especially if done outdoors, imo.

Iknownothing Oct 2, 2023, 9:18 AM
−3 points
−3
on: EA Vegan Advocacy is not truthseeking, and it’s everyone’s problem
Could a solution to some of this be to raise some chickens for eggs, treat them nicely, give them space to roam, etc?
Obviously the best would be to raise cows as well, treat them well, don’t kill the male calves, etc- but that’s much less of an option for most.

Iknownothing Oct 2, 2023, 9:12 AM
1 point
0
on: AI Alignment Breakthroughs this Week [new substack]
This is great! Thank you for doing this! Might add some of these to ai-plans.com!

Iknownothing Sep 28, 2023, 7:11 PM
1 point
−2
in reply to: Firinn’s comment on: The point of a game is not to win, and you shouldn’t even pretend that it is
Yes, winning if fun!

Iknownothing Sep 26, 2023, 10:49 PM
−1 points
0
on: It Is Powerful, It Can’t Be Aimed
I think this kind of thing makes people feel like you’re pushing a message, to which the automatic response is to push back.
What I’ve found works is to be agreeable, inviting, meet them at their own values and present how it as a hard problem to solve which isn’t being competently tackled by this other dumb group (not us, we wouldn’t do this).
That kind of thing. Had a 100% success rate so far.
I’m simplifying my approach, since I’m not spending a lot of time on this, but if you imagine I’m not a dumbass and think about what kind of approach like this could work a lot, while not being dumb in that it doesn’t actually address the problem, you’ll probably get what I mean.

Iknownothing Sep 26, 2023, 8:07 PM
5 points
−5
on: Automatic Rate Limiting on LessWrong
I’m generally disincentivized to post or put effort into a post from the system where someone can just heavily downvote my post, without even giving a reason.

Iknownothing Sep 26, 2023, 8:04 PM
2 points
0
on: Automatic Rate Limiting on LessWrong
A simple way to improve this system would be to require someone to comment/give a reason when heavily upvoting/heavily downvoting things.

Iknownothing Sep 23, 2023, 12:44 PM
2 points
on: Politics is the Mind-Killer
“In the ancestral environment, politics was a matter of life and death.”—this is a pretty strong statement to make with no evidence to back it up.

Iknownothing Sep 23, 2023, 12:30 PM
−7 points
−1
in reply to: trevor’s comment on: There should be more AI safety orgs
What about orgs such as ai-plans.com, which aim to be exponentially useful for AI Safety?

Iknownothing Sep 23, 2023, 12:29 PM
2 points
2
in reply to: Nathan Helm-Burger’s comment on: There should be more AI safety orgs
I think your ideas are some of the most promising I’ve seen- I’d love to see them pursued further, though I’m concerned about the air-gaping

Iknownothing Sep 17, 2023, 2:49 AM
1 point
0
in reply to: Ruby’s comment on: AI-Plans.com—a contributable compendium
Hi Ruby! Thanks for the great feedback!! Sorry for the late reply, I’ve been working on the site!

So, we’re not doing just criticisms anymore- we’re ranking plans by Total Strength score—Total Vulnerabilities scores. Quite a few researchers have been posting their plans on the site!
Going to do a full rebuild soon, to make the site look nicer and be even faster to work on.
We’re also holding regular critique-a-thons. The last one went very well!
We had 40+ submissions and produced what I think is really great work!
We also made a Broad List of Vulnerabilities in the first two days! https://docs.google.com/document/d/1tCMrvJEueePNgb2_nOEUMc_UGce7TxKdqI5rOJ1G7C0/edit?usp=sharing

On not getting all of a plan’s details without talking to the person a lot- I think this is a vulnerability in communication.
A serious plan, with the intention of actually solving the problem, should have the effort put into it to make it clear to a reader what it actually is, what problems it aims to solve, why it aims to solve them and how it seeks to do so.
A failure to do so is silly for any serious strategy.

The good thing is, that if such a vulnerability is pointed out, on AI-Plans.com, the poster can see the vulnerability and iterate on it!

Iknownothing Sep 11, 2023, 11:51 PM
1 point
0
on: AI presidents discuss AI alignment agendas
This was really great. Thanks for making it.

Iknownothing Sep 11, 2023, 11:50 PM
2 points
0
in reply to: TurnTrout’s comment on: AI presidents discuss AI alignment agendas
I was curious why Trump was dropping some of the best takes!

Iknownothing Aug 24, 2023, 11:02 AM
1 point
0
in reply to: Rafael Harth’s comment on: ‘Rationality’ is an overblown little part of honesty
Yeah, I think you’re right- at least about the sequences.
I think something more specific about attitudes would be more accurate and useful.

Looking for judges for critiques of Alignment Plans

IknownothingAug 17, 2023, 10:35 PM

6 points

0 comments1 min readLW link

Iknownothing

Look­ing for judges for cri­tiques of Align­ment Plans

Looking for judges for critiques of Alignment Plans