Propositional Logic

Preface: these notes are primarily based on Chapter 2 of Ertel's text.

We introduce both the syntax and semantics of propositional logic.

Syntax

We assume that there is a set, Σ, of symbols that represent our base atomic formulas.
We let symbols t and f be literals for true and false, respectively
We consider the following operators: ¬, ∧, ∨, ⇒, ⇔

Well-formed formula

∅ represents the empty formula
literals t and f are well-formed formulas
any literal from Σ is a well-formed formula
If A and B are nonempty well-formed formulas, then so are the following:
- (A)
- ¬A
- A ∧ B
- A ∨ B
- A ⇒ B
- A ⇔ B

Semantics

DEFINITION:
An assignment (aka, interpreation, world, model) is a function, Σ → {t, f}, that maps each base literal to either true of false.

For a given assignment, any well-formed formula will evaluate to either true or false, using standard Boolean algebra. Specifically, our operators are as follows:

syntax operator pronunciation

¬A negation "not A"

A ∧ B conjunction "A and B"

A ∨ B disjunction "A or B"

A ⇒ B implication "if A then B"

A ⇔ B equivalence "A if and only if B"

When forming compound formula, we assume precedence of operators ordered as given in the above table, with equivalent operators evaluated left-to-right. Therefore the formula:

syntax	operator	pronunciation
¬A	negation	"not A"
A ∧ B	conjunction	"A and B"
A ∨ B	disjunction	"A or B"
A ⇒ B	implication	"if A then B"
A ⇔ B	equivalence	"A if and only if B"

A ∧ B ⇒ ¬ C ∨ D ∧ E ⇔ F ⇒ G

would be interpreted as if parenthesized in the following way

((A ∧ B) ⇒ ((¬ C) ∨ (D ∧ E))) ⇔ (F ⇒ G)

We will try to avoid relying on left-to-right ordering of implications and equivalence, although technically

A ⇒ B ⇒ C

is interpretted as

(A ⇒ B) ⇒ C

It is easiest to describe the semantics by providing a truth table, giving the value of a formula for each assignment of the base literals.

A B ∅ ¬A A ∧ B A ∨ B A ⇒ B A ⇔ B

t t t f t t t t

t f t f f t f f

f t t t f t t f

f f t t f f t t

A	B	∅	¬A	A ∧ B	A ∨ B	A ⇒ B	A ⇔ B
t	t	t	f	t	t	t	t
t	f	t	f	f	t	f	f
f	t	t	t	f	t	t	f
f	f	t	t	f	f	t	t

DEFINITION:
We say that a given formula is:

satisfiable if there is at least one interpretation for which it is true.
unsatisfiable if there is not any interpretation for which it is true.
logically valid if it is true for any interpretation. Such a formula is known as a tautology.

DEFINITION:
Two formulas F and G are said to be semantically equivalent if they take on the same truth value for any interpretation.
We denote this as F ≡ G. In propositional logic, this is the same as saying that the formula F ⇔ G is a tautology.

THEOREM:
The following are tautologies

¬A ∨ B ⇔ A ⇒ B (implication)

A ⇒ B ⇔ ¬B ⇒ ¬A (contraposition)

(A ⇒ B) ∧ (B ⇒ A) ⇔ (A ⇔ B) (equivalence)

¬(A ∧ B) ⇔ ¬A ∨ ¬B (De Morgan's law)

¬(A ∨ B) ⇔ ¬A ∧ ¬B

A ∨ (B ∧ C) ⇔ (A ∨ B) ∧ (A ∨ C) (distributive law)

A ∧ (B ∨ C) ⇔ (A ∧ B) ∨ (A ∧ C)

A ∨ ¬A ⇔ t (tautology)

A ∧ ¬A ⇔ f (contradition)

A ∨ f ⇔ A

A ∨ t ⇔ t

A ∧ f ⇔ f

A ∧ t ⇔ A

Proof Systems

We next consider the notion of a formula logically following from a set of existing knowledge, as well as ways in which we might formally prove such entailment.

DEFINITION:
We say that a knowledge base KB entails a formula Q (or equivalently that Q follows from KB) if every model of KB is also a model of Q.
We write KB ⊨ Q.

THEOREM:
KB ⊨ Q if and only if KB ⇒ Q is a tautology.

A Proof System is a constructive way to prove that a knowledge base entails a formula. In propositional logic, there are finitely many (2ⁿ) possible assignments, and so examination of a truth table can be used as a proof system. Unfortunately, this is not a very efficient system.

As a step in another direction, we note that entailment can be proven by contradiction based on the following variant of the earlier theorem.

THEOREM:
KB ⊨ Q if and only if KB ∧ ¬Q is unsatisfiable.
Proof: Note that (KB ∧ ¬Q) ≡ ¬(KB ⇒ Q), so it being unsatisfiable is the same as saying that (KB ⇒ Q) is a tautology.

This approach is used by PROLOG (and other systems), by temporarily adding the negated query ¬Q to the knowledge base and then deriving a contradition (typically of the form A ∧ ¬A for some A).

A calculus is a purely syntactic proof system that uses discrete steps resulting in a derivation. We use the notation KB ⊢ Q if Q can be derived from KB in accordance with a given calculus.

Example: we could consider a calculus in which we only apply modus ponens, namely that if our knowledge base has formula A and formula A ⇒ B, then we add formula B to our knowledge base.
Example: another calculus is based on the resolution rule, in which we use the two formulas A ∨ B and B ⇒ C are used to derive the resolvent A ∨ C.

DEFINITION:
A calculus is sound if every derived formula is valid. That is, for any KB and Q, if KB ⊢ Q, then KB ⊨ Q.

DEFINITION:
A calculus is complete if every formula that is valid can be derived. That is, for any KB and Q, if KB ⊨ Q, then KB ⊢ Q.

Examples: As isolated rules, both modus ponens and resolution are sound (this can be shown by truth table). However, neither is complete. For example, we know that (A ⇒ B) ⊨ (¬B ⇒ ¬A) but there would be no way to derive that result from a direct step in such a calculus.

Normal Forms

To keep automatic proof systems simple, they typically require that boolean formulas adhere to a particular conventional syntax known as a normal form.

A commonly used form is conjunctive normal form (CNF), in which the formula is expressed as a conjunction of clauses, with each clause being a disjunction of positive or negative literals.

(A ∨ ¬B) ∧ (¬C) ∧ (¬A ∨ C ∨ ¬D)

THEOREM:
Any well-formed formula can be transformed into an equivalent conjunctive normal form.

Proof: not given in text book, but constructive algorithm can be defined:

Remove use of ⇒ and ⇔ operators by using equivalent formulation with ¬, ∧ and ∨
Use De Morgan's law to "push" all negations down to the literals (and cancelling any double-negatives)
1. If outermost operator is an ∧, then independently compute CNF for each of its operands and trivially combine them.
2. Otherwise, outermost operator is ∨; compute CNF of each of its operands and then repeatedly use distributive law to combine those results.

Simplifications can be made to remove redundant literals in clauses, and if a contradiction exists within a clause, then the entire formula is unsatisfiable and thus equivalent to f.

Unfortunately, there are formulas that require exponential increase in size when converting to equivalent CNF.

Example:

(A ∨ B) ⇒ (C ∧ D)

≡ ¬(A ∨ B) ∨ (C ∧ D) via implication

≡ (¬A ∧ ¬B) ∨ (C ∧ D) via De Morgan's law

≡ (¬A ∨ (C ∧ D)) ∧ (¬B ∨ (C ∧ D)) via distribution

≡ ( (¬A ∨ C) ∧ (¬A ∨ D)) ∧ ( (¬B ∨ C) ∧ (¬B ∨ D)) via distribution

≡ (¬A ∨ C) ∧ (¬A ∨ D) ∧ (¬B ∨ C) ∧ (¬B ∨ D) via associative law

Example (for students):
---Spoiler Alert---

(A ⇒ B) ⇔ C

≡ ((A ⇒ B) ⇒ C) ∧ (C ⇒ (A ⇒ B)) via removal of ⇔

≡ ((¬A ∨ B) ⇒ C) ∧ (C ⇒ (¬A ∨ B)) via removal of nested ⇒ operators

≡ (¬(¬A ∨ B) ∨ C) ∧ (¬C ∨ (¬A ∨ B)) via removal of remaining ⇒ operators

≡ ((A ∧ ¬B) ∨ C) ∧ (¬C ∨ ¬A ∨ B) via De Morgan's law (and double-negative)

≡ (A ∨ C) ∧ (¬B ∨ C) ∧ (¬C ∨ ¬A ∨ B) via distributive law

Conjunctive normal form is particularly appealing when representing a knowledge base KB, as the knowledge is simply a collection of clauses, and new knowledge can be reincorporated by adding further clauses.

Resolution

We will restate the resolution rule as follows. If we have a precondition (A ∨ B) ∧ (¬B ∨ C) then we can derive the resolvent (A ∨ C). Notice that complementary literals B and ¬B appear in the respective two clauses

We could allow A to be the literal f, in which case the resolution rule represents modus ponens, in that B ∧ (B ⇒ C) ⊢ C.

Since A and C could themselves be compound disjunctions, a more general view of resolution is as follows:

(A₁ ∨ A₂ ∨ ... ∨ A_m ∨ B) ∧ (¬B ∨ C₁ ∨ C₂ ∨ ... ∨ C_n)
derives the resolvent
(A₁ ∨ A₂ ∨ ... ∨ A_m ∨ C₁ ∨ C₂ ∨ ... ∨ C_n)

It is also worth noting that if a pair of clauses have two or more complementary literals, there is no benefit to resolving them, as the result is trivially true (for example, if we resolve on complementary B₁ and there exists a complementary B₂ elsewhere, the resolvant clause will thus contain B₂ ∨ ¬B₂ and be trivially a tautology.

Finally, if the two initial clauses have a shared literal with the same sign, we will only list that literal once in the resolvent, so as to avoid duplicates. For example, we allow (A ∨ B ∨ C) ∧ (A ∨ ¬B ∨ ¬D) to resolve to (A ∨ C ∨ ¬D), rather than the redundant syntax (A ∨ C ∨ A ∨ ¬D).

Resolution Calculus
Given an initial knowledge base, the resolution calculus states that we can repeatedly resolve any pair of clauses that have exactly one complementary pair of literals, adding the resolvent to the knowledge base.

If we ever find two complementary clauses, such as (A) and (¬A), we conventionally say that these resolve to an empty clause, reflecting the inherent inconsistency of the original knowledge base. Although resolution is not complete for derivation of general propositional logic formulas, we have the following fundamental theorem:

Example: (adapted from Neopolitan/Jiang)
Assume we have the following premises:
Premise 1) ¬B
Premise 2) A ⇒ B
Premise 3) ¬A ⇒ (C ∨ D)
Premise 4) C ⇒ D
Premise 5) F
Premise 6) F ⇒ ¬E

Goal is to prove D.

---Spoiler Alert---
We convert to CNF, add the negated goal, and then begin resolution (choice of resolutions steps is indeterminate):

Fact 1) ¬B (premise 1)

Fact 2) ¬A ∨ B (premise 2)

Fact 3) A ∨ C ∨ D (premise 3)

Fact 4) ¬C ∨ D (premise 4)

Fact 5) F (premise 5)

Fact 6) ¬F ∨ E (premise 6)

Fact 7) ¬D (the negated goal)

Fact 8) A ∨ C (Resolvent of 3 and 7)

Fact 9) B ∨ C (Resolvent of 2 and 8)

Fact 10) C (Resolvent of 1 and 9)

Fact 11) D (Resolvent of 4 and 10)

Fact 12) False (Resolvent of 7 and 11)

THEOREM:
The resolution calculus for the proof of unsatisfiability of formulas in conjunctive normal form is sound and complete.

Proof: (omitted from book) Clearly, the calculus is sound. And so if we resolve the empty clause, the original formula was unsatisfiable. What remains is to show that if we do all resolutions and cannot generate the empty clause, then the formula is satisfiable. Fortunately, we can constructively build a model by assigning truth values to literals to satisfy all clauses.

Construction considers symbols in arbitrary order and assigns each a truth value, giving the symbol the necessary value if there is any clause that requires that value given all previously assigned values (if no such clause, then we assign arbitrary truth value to symbol). We claim this is feasible, that is, there will never be a variable that is required to have opposite truth values by two different clauses. Proof by contradiction: Assume that variable s_k is the first symbol considered for which there was no legal assignment. Consider two clauses demanding opposite values for s_k. The resolvent of those clauses uses only symbols already considered, and thus it should have been satisfied.

As an extension to this theorem, we can use resolution to determine whether KB ⊨ Q for arbitrary query Q by adding the complement of Q to the knowledge base; if (KB ∧ ¬Q) is unsatisfiable then it must be that KB ⊨ Q.

Example from book (paraphrased):
A traveler who is unsure of his English is speaking to an English family. As a result, there is some ambiguity in interpreting what he hears.
- The father's statement was either "We are going to Spain" or "We are from Newcastle".
- The mother's statement was either "We are not going to Spain and are from Newcastle" or "We stopped in Paris and are not going to Spain".
- The daughter's statement was either "We are not from Newcastle" or "We stopped in Paris"
You are to determine whether this combination of interpretations is consistent, and if so whether it is known if they are or are not from Newcastle, are or are not going to Spain, and did or did not stop in Paris.

Create a propositional logic formula; convert it to CNF; apply resolution to test consistency or to derive further knowledge.

---Spoiler Alert---
Original formula: (S ∨ N) ∧ ( (¬S ∧ N) ∨ (P ∧ ¬S) ) ∧ (¬N ∨ P)
CNF: (S ∨ N) ∧ (¬S) ∧ (N ∨ P) ∧ (¬N ∨ P)
For purpose of resolution, we label the clauses as follows:
1) (S ∨ N)
2) (¬S)
3) (N ∨ P)
4) (¬N ∨ P)

As next step in resolution, we could resolve(1,2) or could resolve(3,4) or resolve(1,4). This gets us respectively
5) (N)
6) (P)
7) (S ∨ P)

There are no further resolutions possible. Therefore the original knowledge base is consistent, and we note that our updated knowledge base includes the facts N ∧ ¬S ∧ P.
Example from the book (paraphrased):
Three girls practice high jump for their physical education final exam. The bar is set to 1.20 meters.
- The first girl says to the second "I bet that I will make it over if, and only if, you don't"
- The second girl says the same to the third.
- The third girl says the same to the first.
Is it possible for all three to win their bets, and if so, how?

---Spoiler Alert---
Formula: (A ⇔ ¬B) ∧ (B ⇔ ¬C) ∧ (C ⇔ ¬A)
CNF: (¬A ∨ ¬B) ∧ (A ∨ B) ∧ (¬B ∨ ¬C) ∧ (B ∨ C) ∧ (¬C ∨ ¬A) ∧ (C ∨ A)
Resolution can be used to eventually derive contradiction.

Horn Clauses

In general, disjunctive clauses have any number of positive and negative literals.

DEFINITION:
A disjunctive clause with at most one positive literal is known as a Horn clause.

Examples of valid horn clauses:

B
B ∨ ¬ C
B ∨ ¬ C or ¬ D
¬B
¬B or ¬ C
¬B ∨ ¬C ∨ ¬ D

DEFINITION:
A clause with a single positive literal is a fact. (e.g., B)
If a clause has a positive literal and one or more negative literals, the positive literal is called the head of the clause.
If a clause has only negative literals, this is often known as a goal clause (for reasons we will explain below).

It is easier to interpret a Horn clause, such as
(B ∨ ¬A₁ ∨ ¬A₂ ∨ ... ∨ ¬A_n)
using the following equivalent formula
(A₁ ∧ A₂ ∧ ... ∧ A_n) ⇒ B
and thus we see that a series of facts, A₁, A₂, ..., A_n can be used to derive a new fact B.

Notice that the resolution of two Horn clauses is guaranteed to itself be a Horn clause. (try it!)

Horn clauses are significant because if we restrict a proof system to be a CNF formula using only Horn clauses, then we can consider a more efficient calculus for theorem proving. (We will later see that PROLOG is based on use of Horn clauses for expressing knowledge and queries.)

Forward and Backward Chaining

The first resolution calculus we considered was based on what is known as forward chaining; we started with clauses that we knew to be true (perhaps even including a negated query), and then proceeded to derive additional clauses until reaching our goal or exhausting all possibilities. Unfortunately, that can be time consuming since we in some way have to consider all possibly resolutions of pairwise clauses (including newly derived clauses) to determine if they have a complementary literal, and this may result in an exponential time computation.

In fact, for general CNF formula, there does not exist any algorithm for testing satisfiability in polynomial time unless P=NP, as the satisfiability problem for CNF formula is among the first decision problems shown to be NP-complete.

However, when working only with Horn clauses, entailment of a goal can be tested in worst-case linear time. We consider a form of backward chaining in which we start with the goal we wish to prove and then determine what facts we would need in order to prove it. Specifically, we consider a calculus known as Selection rule-driven Linear resolution for Definitive clauses (SLD).

Consider the following example (using implication syntax to express the clauses equivalently).

Horn clause Equivalent impliciation

Q ∨ ¬P P ⇒ Q

P ∨ ¬L ∨ ¬M L ∧ M ⇒ P

M ∨ ¬B ∨ ¬L B ∧ L ⇒ M

L ∨ ¬A ∨ ¬P A ∧ P ⇒ L

L ∨ ¬A ∨ ¬B A ∧ B ⇒ L

A A

B B

If viewing standard resolution algorithm, to attempt to prove the query Q, we add the goal clause ¬Q to the system.

Horn clause	Equivalent impliciation
Q ∨ ¬P	P ⇒ Q
P ∨ ¬L ∨ ¬M	L ∧ M ⇒ P
M ∨ ¬B ∨ ¬L	B ∧ L ⇒ M
L ∨ ¬A ∨ ¬P	A ∧ P ⇒ L
L ∨ ¬A ∨ ¬B	A ∧ B ⇒ L
A	A
B	B

For backward chaining, in order to prove the goal Q, there must be some other clause that has Q as its head; otherwise Q is not entailed. If there are one or more such clauses, we consider them one at a time (in a fixed order); for each, we declare all literals in the conjunctive condition as new subgoals for our proving system.

The relation between the various Horn clauses can be expressed as a directed graph known as an AND-OR graph. It is easier to describe the algorithm when considering such a graph.

Consider the above example:

The only clause with Q as its head is P ⇒ Q, so we set P as a subgoal (knowing that if we are able to prove P, then we will have proven Q).
There is only one clause with P as its head: L ∧ M ⇒ P. In this case, we must set both L and M as subgoals, as we need both to infer P. In SLD, we fix the order for literals and attempt to prove the subgoals in a "depth-first manner". So suppose that we first try to prove L.
There are two different clauses with L as the head, and we could choose to consider either one. The first is A ∧ P ⇒ L. However, that would require proving P as a subgoal, and the implicit "stack" of our depth-first search already has P as a subgoal (that is, we are currently in a recursive process trying to prove P). So to avoid circular efforts, we disregard use of this rule. Instead, we consider A ∧ B ⇒ L.
In this case, we would make both A and B as subgoals, and quickly find that both of those exist as facts. This allows us to infer L as a fact.
In the recursive process, we now return to the fact that we had M as a subgoal. To prove that, we must use the clause B ∧ L ⇒ M. We consider both B and L as subgoals and at this point, we can avoid repeating previous work by having recorded that we have already verified that both B and L are facts in our system. Therefore M is a fact, and thus L ∧ M ⇒ P, and P ⇒ Q.

Intuitively, the algorithm is performing dynamic programming on the directed AND-OR graph, and therefore its overall running time is proportional to the number of nodes and edges in that graph (which is linear in the size of the original knowledge base).

Michael Goldwasser

Last modified: Tuesday, 03 September 2013

¬A ∨ B	⇔	A ⇒ B	(implication)
A ⇒ B	⇔	¬B ⇒ ¬A	(contraposition)
(A ⇒ B) ∧ (B ⇒ A)	⇔	(A ⇔ B)	(equivalence)
¬(A ∧ B)	⇔	¬A ∨ ¬B	(De Morgan's law)
¬(A ∨ B)	⇔	¬A ∧ ¬B
A ∨ (B ∧ C)	⇔	(A ∨ B) ∧ (A ∨ C)	(distributive law)
A ∧ (B ∨ C)	⇔	(A ∧ B) ∨ (A ∧ C)
A ∨ ¬A	⇔	t	(tautology)
A ∧ ¬A	⇔	f	(contradition)
A ∨ f	⇔	A
A ∨ t	⇔	t
A ∧ f	⇔	f
A ∧ t	⇔	A

(A ∨ B) ⇒ (C ∧ D)
≡ ¬(A ∨ B) ∨ (C ∧ D)	via implication
≡ (¬A ∧ ¬B) ∨ (C ∧ D)	via De Morgan's law
≡ (¬A ∨ (C ∧ D)) ∧ (¬B ∨ (C ∧ D))	via distribution
≡ ( (¬A ∨ C) ∧ (¬A ∨ D)) ∧ ( (¬B ∨ C) ∧ (¬B ∨ D))	via distribution
≡ (¬A ∨ C) ∧ (¬A ∨ D) ∧ (¬B ∨ C) ∧ (¬B ∨ D)	via associative law

(A ⇒ B) ⇔ C
≡ ((A ⇒ B) ⇒ C) ∧ (C ⇒ (A ⇒ B))	via removal of ⇔
≡ ((¬A ∨ B) ⇒ C) ∧ (C ⇒ (¬A ∨ B))	via removal of nested ⇒ operators
≡ (¬(¬A ∨ B) ∨ C) ∧ (¬C ∨ (¬A ∨ B))	via removal of remaining ⇒ operators
≡ ((A ∧ ¬B) ∨ C) ∧ (¬C ∨ ¬A ∨ B)	via De Morgan's law (and double-negative)
≡ (A ∨ C) ∧ (¬B ∨ C) ∧ (¬C ∨ ¬A ∨ B)	via distributive law

Fact 1)	¬B	(premise 1)
Fact 2)	¬A ∨ B	(premise 2)
Fact 3)	A ∨ C ∨ D	(premise 3)
Fact 4)	¬C ∨ D	(premise 4)
Fact 5)	F	(premise 5)
Fact 6)	¬F ∨ E	(premise 6)
Fact 7)	¬D	(the negated goal)
Fact 8)	A ∨ C	(Resolvent of 3 and 7)
Fact 9)	B ∨ C	(Resolvent of 2 and 8)
Fact 10)	C	(Resolvent of 1 and 9)
Fact 11)	D	(Resolvent of 4 and 10)
Fact 12)	False	(Resolvent of 7 and 11)