RSS

AI Robustness

TagLast edit: Oct 24, 2022, 10:37 PM by markov

AI Robustness is an agents ability to maintain its goal and its capabilities when exposed to different data distributions or environments.

Ro­bust­ness to Scale

Scott GarrabrantFeb 21, 2018, 10:55 PM
130 points
23 comments2 min readLW link1 review

AI Safety in a World of Vuln­er­a­ble Ma­chine Learn­ing Systems

Mar 8, 2023, 2:40 AM
70 points
28 comments29 min readLW link
(far.ai)

Desider­ata for an AI

Nathan Helm-BurgerJul 19, 2023, 4:18 PM
9 points
0 comments4 min readLW link

Ran­dom Ob­ser­va­tion on AI goals

FTPickleApr 8, 2023, 7:28 PM
−11 points
2 comments1 min readLW link

Ro­bust­ness to Scal­ing Down: More Im­por­tant Than I Thought

adamShimiJul 23, 2022, 11:40 AM
38 points
5 comments3 min readLW link

AXRP Epi­sode 17 - Train­ing for Very High Reli­a­bil­ity with Daniel Ziegler

DanielFilanAug 21, 2022, 11:50 PM
16 points
0 comments35 min readLW link

[Question] Do the Safety Prop­er­ties of Pow­er­ful AI Sys­tems Need to be Ad­ver­sar­i­ally Ro­bust? Why?

DragonGodFeb 9, 2023, 1:36 PM
22 points
42 comments2 min readLW link

Squeez­ing foun­da­tions re­search as­sis­tance out of for­mal logic nar­row AI.

Donald HobsonMar 8, 2023, 9:38 AM
16 points
1 comment2 min readLW link

What’s new at FAR AI

Dec 4, 2023, 9:18 PM
41 points
0 comments5 min readLW link
(far.ai)

2023 Align­ment Re­search Up­dates from FAR AI

Dec 4, 2023, 10:32 PM
18 points
0 comments8 min readLW link
(far.ai)

Ar­gu­ments for Ro­bust­ness in AI Alignment

Fabian SchimpfJan 19, 2024, 10:24 AM
2 points
1 comment1 min readLW link

On In­ter­pretabil­ity’s Robustness

WCargoOct 18, 2023, 1:18 PM
11 points
0 comments4 min readLW link

Is there a ML agent that aban­dons it’s util­ity func­tion out-of-dis­tri­bu­tion with­out los­ing ca­pa­bil­ities?

Christopher KingFeb 22, 2023, 4:49 PM
1 point
7 comments1 min readLW link

Beyond the Board: Ex­plor­ing AI Ro­bust­ness Through Go

AdamGleaveJun 19, 2024, 4:40 PM
41 points
2 comments1 min readLW link
(far.ai)

Ro­bust­ness & Evolu­tion [MLAISU W02]

Esben KranJan 13, 2023, 3:47 PM
10 points
0 comments3 min readLW link
(newsletter.apartresearch.com)

Does ro­bust­ness im­prove with scale?

Jul 25, 2024, 8:55 PM
14 points
0 comments1 min readLW link
(far.ai)

Why Re­cur­sive Self-Im­prove­ment Might Not Be the Ex­is­ten­tial Risk We Fear

Nassim_ANov 24, 2024, 5:17 PM
1 point
0 comments9 min readLW link

Work­shop Re­port: Why cur­rent bench­marks ap­proaches are not suffi­cient for safety?

Nov 26, 2024, 5:20 PM
3 points
1 comment3 min readLW link

Gra­di­ent Anatomy’s—Hal­lu­ci­na­tion Ro­bust­ness in Med­i­cal Q&A

DieSabFeb 12, 2025, 7:16 PM
2 points
0 comments10 min readLW link

In­vest­ing in Ro­bust Safety Mechanisms is crit­i­cal for re­duc­ing Sys­temic Risks

Dec 11, 2024, 1:37 PM
8 points
3 comments2 min readLW link
No comments.