My main question, which I didn’t see an answer to on my first read, is: If the agent proves that action a leads to the goal G being achieved, are there any conditions under which this implies that the goal actually is achieved? The procrastination paradox shows that this isn’t true in general. Is there a class of sentences (e.g., all Π1 sentences, though I don’t expect that to be true in this case) such that if PA⋆⊢φ then N⊨φ? In other words, do we have some guarantee that we do better at actually achieving G than an agent which uses the system BAD:=PA+Con(BAD)?
A more technical point: Given that you claim that Theorem 3.2 (reflectively coherent quantified belief) follows “trivially” from Theorem 2.4, you presumably mean Theorem 2.4 to state that PA⋆⊢∀x.□┌φ(¯¯¯x)┐→φ(x), rather than PA⋆⊢∀x.□┌φ┐→φ. (Even if you consider φ to implicitly contain an occurrence of x, it is essential that in □┌φ(¯¯¯x)┐ the free variable is replaced by the numeral corresponding to the outer value of x.)
Minor note: in equation 1, I think the i should be an n.
I’m not all that familiar with paraconsistent logic, so many of the details are still opaque to me. However, I do have some intuitions about where there might be gremlins:
Solution 4.1 reads, “The agent could, upon realizing the contradiction, …” You’ve got to be a bit careful here: the formalism you’re using doesn’t contain a reasoner that does something like “realize the contradiction.” As stated, the agent is simply constructed to simply execute an action a if it can prove ┌¯¯¯a→G┐; it is not constructed to also reason about whether that proof was contradictory.
You could perhaps construct a system with an action condition of ┌(¯¯¯a→G)∧¬(¯¯¯a→¬G)┐, but I expect that this will re-introduce many of the difficulties faced in a consistent logic (because this basically says “execute a if a consistently achieves G,” and my current guess is that it’s pretty hard to say “consistently” in a paraconsistent logic.
Or, in other words, I pretty strongly suspect that if you attempt to formalize a solution such as solution 4.1, you’ll find lots of gremlins.
For similar reasons, I also expect solution 4.2 to be very difficult to formalize. What precisely is the action condition of an agent that “notices” when both ¯¯¯a→G and ¯¯¯a→¬G? I don’t know paraconsistent logic well enough yet to know how the obvious agent (with the action condition from two paragraphs above) behaves, but I’m guessing it’s going to be a little difficult to work with.
Regardless, there do seem to be some promising aspects to the paraconsistent approach, and I’m glad you’re looking into it!
Section 4.1 is fairly unpolished. I’m still looking for better ways of handling the problems it brings up; solutions 4.1 and 4.2 are very preliminary stabs in that direction.
The action condition you mention might work. I don’t think it would re-introduce Löbian or similar difficulties, as it merely requires that ¯¯¯a implies that G is only true, which is a truth value found in LP. Furthermore, we still have our internally provable T-schema, which does not depend on the action condition, from which we can derive that if the child can prove (¯¯¯a→G)∧¬(¯¯¯a→¬G), then so can the parent. It is important to note that “most” (almost everything we are interested in) of PA⋆ is consistent without problem.
Now that I think about it, your action condition should be a requirement for paraconsistent agents, as otherwise they will be willing to do things that they can prove will not accomplish G. There may yet be a situation which breaks this, but I have not come across it.
Does LP⊃ from this paper also work instead of LP? If not, do you have a link to a presentation of LP’s axioms and rules of inference?
I’m curious what step in Lob’s theorem fails in PA*. Obviously the proof uses modus ponens many times, but I don’t have a good intuition for which are allowed and which aren’t in PA*.
Any extension of LP will work as well unless it leads to triviality; however, we cannot in general validate the T-schema for an arbitrary conditional. For example, any conditional → that satisfies the inference of contraction (α→(α→β)⊩α→β) will succumb to Curry’s paradox and lead to triviality (consider the fixed point formula γ of the form T(γ)→⊥).
Now, as far as I can tell, we can validate the T-schema of the conditional for RM3; since LP⊃’s conditional is expressively equivalent, it seems like it shouldn’t be too hard to show that we can also validate its T-schema.
For further discussion on the topic (with proofs, etc.), see chapter 8 of Priest’s part of volume XI of the second edition of Gabbay and Geunther’s Handbook of Philosophical Logic (what a mouthful of of’s).
A thorough presentation of LP can be found in Priest’s An Introduction to Non-Classical Logics, section 7.4. Sections 7.1-7.3 introduce the general structures of such logics and delineate their differences. This book does not say much about truth schemas.
Priest’s paper “The Logic of Paradox” from ’79 also describes LP, although I find the presentation a bit clunky. For a better grasp of how inference works in LP, look at section III.9; there are examples of both inferences that are true and inferences that do not always hold even though we ay expect them to.
I will probably be writing up a minimal formal description soon, as I code the PROLOG meta-interpreter.
We can prove non-triviality of LP and that the T-schema holds, so as a consequence L”ob must fail. This seems somewhat unsatisfactory as an answer to which step fails. I can think of a few reasons why L”ob might fail; I haven’t seen a discussion of this in the literature, so I will try to cobble together an answer or the shape thereof from what I know.
One possible reason is that pseudo Modus Ponens fails in LP. That is, though A,A→B might entail B for a well chosen conditional, A∧(A→B)→B must fail or else lead to triviality due to Curry’s paradox.
More likely is that this is a consequence of the way paraconsistency works with modal logic. From the wikipedia article you linked, we have that “in normal modal logic, Löb’s axiom is equivalent to the conjunction of the axiom schema 4, (□□A→□A) and the existence of modal fixed points.” We have both of those, so the difference must stem from the phrase “normal modal logics.” The proof uses the fact of the existence of a modal fixed point of a certain form; this may not hold in a modal extension of LP (which would be analogous to KM3, the modal extension of RM3, just as K is the modal extension of classical logic.). More details on modal extensions of paraconsistent logics can be found in McGinnis’ paper “Tableau Systems for Some Paraconsistent Modal Logics,” though this paper does not mention modal fixed points. In fact, none of the papers I’ve read so far on paraconsistent modal logics have mentioned fixed points. I would construct a proof on my own, but I am currently too unfamiliar with modal logic; if it is important, I can go learn enough and do it.
Thanks for your comments! I hope to have at least somewhat answered your questions. Paraconsistency is still to me a very strange subject, so my studies and understanding are still very much incomplete.
This is very interesting!
My main question, which I didn’t see an answer to on my first read, is: If the agent proves that action a leads to the goal G being achieved, are there any conditions under which this implies that the goal actually is achieved? The procrastination paradox shows that this isn’t true in general. Is there a class of sentences (e.g., all Π1 sentences, though I don’t expect that to be true in this case) such that if PA⋆⊢φ then N⊨φ? In other words, do we have some guarantee that we do better at actually achieving G than an agent which uses the system BAD:=PA+Con(BAD)?
A more technical point: Given that you claim that Theorem 3.2 (reflectively coherent quantified belief) follows “trivially” from Theorem 2.4, you presumably mean Theorem 2.4 to state that PA⋆⊢∀x.□┌φ(¯¯¯x)┐→φ(x), rather than PA⋆⊢∀x.□┌φ┐→φ. (Even if you consider φ to implicitly contain an occurrence of x, it is essential that in □┌φ(¯¯¯x)┐ the free variable is replaced by the numeral corresponding to the outer value of x.)
Nice work!
Minor note: in equation 1, I think the i should be an n.
I’m not all that familiar with paraconsistent logic, so many of the details are still opaque to me. However, I do have some intuitions about where there might be gremlins:
Solution 4.1 reads, “The agent could, upon realizing the contradiction, …” You’ve got to be a bit careful here: the formalism you’re using doesn’t contain a reasoner that does something like “realize the contradiction.” As stated, the agent is simply constructed to simply execute an action a if it can prove ┌¯¯¯a→G┐; it is not constructed to also reason about whether that proof was contradictory.
You could perhaps construct a system with an action condition of ┌(¯¯¯a→G)∧¬(¯¯¯a→¬G)┐, but I expect that this will re-introduce many of the difficulties faced in a consistent logic (because this basically says “execute a if a consistently achieves G,” and my current guess is that it’s pretty hard to say “consistently” in a paraconsistent logic.
Or, in other words, I pretty strongly suspect that if you attempt to formalize a solution such as solution 4.1, you’ll find lots of gremlins.
For similar reasons, I also expect solution 4.2 to be very difficult to formalize. What precisely is the action condition of an agent that “notices” when both ¯¯¯a→G and ¯¯¯a→¬G? I don’t know paraconsistent logic well enough yet to know how the obvious agent (with the action condition from two paragraphs above) behaves, but I’m guessing it’s going to be a little difficult to work with.
Regardless, there do seem to be some promising aspects to the paraconsistent approach, and I’m glad you’re looking into it!
Section 4.1 is fairly unpolished. I’m still looking for better ways of handling the problems it brings up; solutions 4.1 and 4.2 are very preliminary stabs in that direction.
The action condition you mention might work. I don’t think it would re-introduce Löbian or similar difficulties, as it merely requires that ¯¯¯a implies that G is only true, which is a truth value found in LP. Furthermore, we still have our internally provable T-schema, which does not depend on the action condition, from which we can derive that if the child can prove (¯¯¯a→G)∧¬(¯¯¯a→¬G), then so can the parent. It is important to note that “most” (almost everything we are interested in) of PA⋆ is consistent without problem.
Now that I think about it, your action condition should be a requirement for paraconsistent agents, as otherwise they will be willing to do things that they can prove will not accomplish G. There may yet be a situation which breaks this, but I have not come across it.
Thanks for writing this up! A couple questions:
Does LP⊃ from this paper also work instead of LP? If not, do you have a link to a presentation of LP’s axioms and rules of inference?
I’m curious what step in Lob’s theorem fails in PA*. Obviously the proof uses modus ponens many times, but I don’t have a good intuition for which are allowed and which aren’t in PA*.
Any extension of LP will work as well unless it leads to triviality; however, we cannot in general validate the T-schema for an arbitrary conditional. For example, any conditional → that satisfies the inference of contraction (α→(α→β)⊩α→β) will succumb to Curry’s paradox and lead to triviality (consider the fixed point formula γ of the form T(γ)→⊥).
Now, as far as I can tell, we can validate the T-schema of the conditional for RM3; since LP⊃’s conditional is expressively equivalent, it seems like it shouldn’t be too hard to show that we can also validate its T-schema. For further discussion on the topic (with proofs, etc.), see chapter 8 of Priest’s part of volume XI of the second edition of Gabbay and Geunther’s Handbook of Philosophical Logic (what a mouthful of of’s).
A thorough presentation of LP can be found in Priest’s An Introduction to Non-Classical Logics, section 7.4. Sections 7.1-7.3 introduce the general structures of such logics and delineate their differences. This book does not say much about truth schemas.
Priest’s paper “The Logic of Paradox” from ’79 also describes LP, although I find the presentation a bit clunky. For a better grasp of how inference works in LP, look at section III.9; there are examples of both inferences that are true and inferences that do not always hold even though we ay expect them to.
I will probably be writing up a minimal formal description soon, as I code the PROLOG meta-interpreter.
We can prove non-triviality of LP and that the T-schema holds, so as a consequence L”ob must fail. This seems somewhat unsatisfactory as an answer to which step fails. I can think of a few reasons why L”ob might fail; I haven’t seen a discussion of this in the literature, so I will try to cobble together an answer or the shape thereof from what I know.
One possible reason is that pseudo Modus Ponens fails in LP. That is, though A,A→B might entail B for a well chosen conditional, A∧(A→B)→B must fail or else lead to triviality due to Curry’s paradox.
More likely is that this is a consequence of the way paraconsistency works with modal logic. From the wikipedia article you linked, we have that “in normal modal logic, Löb’s axiom is equivalent to the conjunction of the axiom schema 4, (□□A→□A) and the existence of modal fixed points.” We have both of those, so the difference must stem from the phrase “normal modal logics.” The proof uses the fact of the existence of a modal fixed point of a certain form; this may not hold in a modal extension of LP (which would be analogous to KM3, the modal extension of RM3, just as K is the modal extension of classical logic.). More details on modal extensions of paraconsistent logics can be found in McGinnis’ paper “Tableau Systems for Some Paraconsistent Modal Logics,” though this paper does not mention modal fixed points. In fact, none of the papers I’ve read so far on paraconsistent modal logics have mentioned fixed points. I would construct a proof on my own, but I am currently too unfamiliar with modal logic; if it is important, I can go learn enough and do it.
Thanks for your comments! I hope to have at least somewhat answered your questions. Paraconsistency is still to me a very strange subject, so my studies and understanding are still very much incomplete.