Ongoing project on moral AI

Jan 23, 2024, 7:49 PM

For lack of a better name =)

The idea is to use current AI technologies, like language models, to get an impartial AI that understands ethics as humans do, possibly even better.

You heard me right: in the same way as an AI can be smarter than a human, we should also accept the fact that we are not morally perfect creatures, and that it’s possible to create an AI which is better than us at, for example, spotting injustice. See Free agents for more details.

In case you are familiar with philosophical language, my objective is a philosopher AI that figures out epistemology and ethics on its own, and then communicates its beliefs.

In case you look at reality through the lenses of AI alignment, I am saying that going for ‘safe’ or ‘aligned’ is kind of lame, and that aiming for ‘moral’ is better. Instead of trying to limit the side effects of, or fix, agents which are morally clueless, I’d like to see more people working on agents which perceive and interpret the world from a human-like point of view.

If you are looking for a place to start, I suggest that you have a look at Free agents and decide where to go from there. Although it touches on some technical subjects, I tried to write that post for a relatively broad audience.

This sequence is just a collection of posts about the same topic, in chronological order. The next post will probably be somewhat mathematical in nature; later on, I expect that posts will become more algorithmic and, finally, about practical experiments run on hardware.

You can find this sequence also on the EA Forum.

Naturalism and AI alignment

Michele CampoloApr 24, 2021, 4:16 PM

5 points

12 comments8 min readLW link

From language to ethics by automated reasoning

Michele CampoloNov 21, 2021, 3:16 PM

4 points

4 comments6 min readLW link

Criticism of the main framework in AI alignment

Michele CampoloJan 31, 2023, 11:01 PM

19 points

2 comments6 min readLW link

On value in humans, other animals, and AI

Michele CampoloJan 31, 2023, 11:33 PM

3 points

17 comments5 min readLW link

Free agents

Michele CampoloDec 27, 2023, 8:20 PM

6 points

19 comments13 min readLW link

Agents that act for reasons: a thought experiment

Michele CampoloJan 24, 2024, 4:47 PM

3 points

0 comments3 min readLW link

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer

Ongoing project on moral AI

Nat­u­ral­ism and AI alignment

From lan­guage to ethics by au­to­mated reasoning

Crit­i­cism of the main frame­work in AI alignment