Formal Proof

TagLast edit: Sep 26, 2021, 10:04 PM by Pablo

A Formal Proof is a finite sequence of steps from axiom(s) or previous derived proof(s) which strictly follow the allowed rules of inference of the mathematical system in which it exists. They are used to establish statements as true within a mathematical framework in a way which can be independently verified with extremely high certainty, with the most reliable flavor of proof being machine-checked proofs generated by proof assistants since they have even less room for human error.

Proofs, Implications, and Models

Eliezer YudkowskyOct 30, 2012, 1:02 PM

131 points

218 comments12 min readLW link

Compact Proofs of Model Performance via Mechanistic Interpretability

LawrenceC, rajashree, Adrià Garriga-alonso and Jason Gross

Jun 24, 2024, 7:27 PM

96 points

4 comments8 min readLW link

(arxiv.org)

A List of things I might do with a Proof Oracle

Logan ZoellnerFeb 5, 2023, 6:14 PM

−14 points

13 comments3 min readLW link

Most Minds are Irrational

DavidmanheimDec 10, 2024, 9:36 AM

17 points

4 comments10 min readLW link

[Question] What Programming Language Characteristics Would Allow Provably Safe AI?

DavidmanheimAug 28, 2019, 10:46 AM

4 points

9 comments1 min readLW link

AXRP Episode 40 - Jason Gross on Compact Proofs and Interpretability

DanielFilanMar 28, 2025, 6:40 PM

22 points

0 comments89 min readLW link

Squeezing foundations research assistance out of formal logic narrow AI.

Donald HobsonMar 8, 2023, 9:38 AM

16 points

1 comment2 min readLW link

Eleuther releases Llemma: An Open Language Model For Mathematics

mako yassOct 17, 2023, 8:03 PM

22 points

0 comments1 min readLW link

(blog.eleuther.ai)

Davidad’s Bold Plan for Alignment: An In-Depth Explanation

Charbel-Raphaël and Gabin

Apr 19, 2023, 4:09 PM

168 points

40 comments21 min readLW link 2 reviews

Formal Proof: O(n) Is a Cognitive Illusion

Daniil StrizhovMar 28, 2025, 6:26 PM

0 points

0 comments38 min readLW link

Roadmap for a collaborative prototype of an Open Agency Architecture

Deger TuranMay 10, 2023, 5:41 PM

31 points

0 comments12 min readLW link

I bet $500 on AI winning the IMO gold medal by 2026

azsantoskMay 11, 2023, 2:46 PM

37 points

29 comments1 min readLW link

Fundamentals of Formalisation Level 5: Formal Proof

philip_bJul 9, 2018, 8:55 PM

13 points

0 comments1 min readLW link

Infra-Domain proofs 1

DiffractorMar 28, 2021, 9:16 AM

13 points

0 comments23 min readLW link

Infra-Domain Proofs 2

DiffractorMar 28, 2021, 9:15 AM

13 points

0 comments21 min readLW link

Allowing a formal proof system to self improve while avoiding Lobian obstacles.

Donald HobsonJan 23, 2019, 11:04 PM

6 points

4 comments2 min readLW link

[Math] Towards Proof Writing as a Skill In Itself

Andrew QuinnJun 13, 2018, 4:39 AM

25 points

8 comments2 min readLW link

The value of learning mathematical proof

JonahSJun 2, 2015, 3:15 AM

8 points

42 comments1 min readLW link

An Illustrated Proof of the No Free Lunch Theorem

lifelonglearnerJun 8, 2020, 1:54 AM

19 points

0 comments1 min readLW link

(mlu.red)

An example of self-fulfilling spurious proofs in UDT

cousin_itMar 25, 2012, 11:47 AM

33 points

43 comments2 min readLW link

Weak HCH accesses EXP

evhubJul 22, 2020, 10:36 PM

16 points

0 comments3 min readLW link

Alignment proposals and complexity classes

evhubJul 16, 2020, 12:27 AM

40 points

26 comments13 min readLW link

LBIT Proofs 5: Propositions 29-38

DiffractorDec 16, 2020, 3:35 AM

8 points

0 comments21 min readLW link

LBIT Proofs 1: Propositions 1-9

DiffractorDec 16, 2020, 3:48 AM

7 points

0 comments25 min readLW link

LBIT Proofs 6: Propositions 39-47

DiffractorDec 16, 2020, 3:33 AM

7 points

0 comments23 min readLW link

LBIT Proofs 2: Propositions 10-18

DiffractorDec 16, 2020, 3:45 AM

7 points

0 comments20 min readLW link

Proofs Section 2.3 (Updates, Decision Theory)

DiffractorAug 27, 2020, 7:49 AM

8 points

0 comments31 min readLW link

Proofs Section 2.2 (Isomorphism to Expectations)

DiffractorAug 27, 2020, 7:52 AM

8 points

0 comments46 min readLW link

A proof of Löb’s theorem in Haskell

cousin_itSep 19, 2014, 1:01 PM

52 points

8 comments3 min readLW link

Counterfactual Induction (Algorithm Sketch, Fixpoint proof)

DiffractorDec 17, 2019, 5:04 AM

5 points

2 comments7 min readLW link

Logical inductor limits are dense under pointwise convergence

SamEisenstatOct 6, 2016, 8:07 AM

5 points

0 comments6 min readLW link

Formalized math: dream vs reality

cousin_itJul 9, 2009, 8:51 PM

19 points

10 comments2 min readLW link

Progress on automated mathematical theorem proving?

JonahSJul 3, 2013, 6:40 PM

26 points

65 comments1 min readLW link

Proofs Section 1.1 (Initial results to LF-duality)

DiffractorAug 27, 2020, 7:59 AM

8 points

0 comments20 min readLW link

Proofs Section 1.2 (Mixtures, Updates, Pushforwards)

DiffractorAug 27, 2020, 7:57 AM

8 points

0 comments14 min readLW link

Proofs Section 2.1 (Theorem 1, Lemmas)

DiffractorAug 27, 2020, 7:54 AM

8 points

0 comments36 min readLW link

LBIT Proofs 4: Propositions 22-28

DiffractorDec 16, 2020, 3:38 AM

7 points

0 comments17 min readLW link

LBIT Proofs 7: Propositions 48-52

DiffractorDec 16, 2020, 3:31 AM

7 points

0 comments20 min readLW link

LBIT Proofs 8: Propositions 53-58

DiffractorDec 16, 2020, 3:29 AM

7 points

0 comments18 min readLW link

LBIT Proofs 3: Propositions 19-22

DiffractorDec 16, 2020, 3:40 AM

8 points

0 comments17 min readLW link

Social Choice Theory and Logical Handshakes

StrivingForLegibilityDec 29, 2023, 3:49 AM

17 points

0 comments4 min readLW link

Interview Daniel Murfet on Universal Phenomena in Learning Machines

Alexander Gietelink OldenzielFeb 6, 2023, 12:00 AM

50 points

1 comment16 min readLW link

Speedrunning 4 mistakes you make when your alignment strategy is based on formal proof

QuinnFeb 16, 2023, 1:13 AM

63 points

18 comments2 min readLW link

Question/Issue with the 5/10 Problem

acgtNov 29, 2021, 10:45 AM

6 points

3 comments3 min readLW link

Planning to build a cryptographic box with perfect secrecy

Lysandre TerrisseDec 31, 2023, 9:31 AM

40 points

6 comments11 min readLW link

Limitations on Formal Verification for AI Safety

Andrew DicksonAug 19, 2024, 11:03 PM

134 points

60 comments23 min readLW link

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

Joar SkalseMay 17, 2024, 7:13 PM

67 points

10 comments2 min readLW link

A list of core AI safety problems and how I hope to solve them

davidadAug 26, 2023, 3:12 PM

165 points

29 comments5 min readLW link

An Opinionated Look at Inference Rules

Gianluca CalcagniSep 3, 2024, 1:32 PM

−5 points

2 comments13 min readLW link

Measuring Nonlinear Feature Interactions in Sparse Crosscoders [Project Proposal]

Jason Gross and rajashree

Jan 6, 2025, 4:22 AM

19 points

0 comments12 min readLW link

Video Intro to Guaranteed Safe AI

Mike Vaiana, Diogo de Lucena and AE Studio

Jul 11, 2024, 5:53 PM

27 points

0 comments1 min readLW link

(youtu.be)

[Question] Searching for Impossibility Results or No-Go Theorems for provable safety.

MaelstromSep 27, 2024, 8:12 PM

2 points

1 comment1 min readLW link

The Fundamental Circularity Theorem: Why Some Mathematical Behaviours Are Inherently Unprovable

Alister MundayJan 22, 2025, 6:20 PM

−11 points

2 comments4 min readLW link

No comments.

For­mal Proof

Formal Proof