Implementing transitive closure in arithmetic

Let T(x,y) be an arithmetical formula with two free variables x and y. Its reflexive and transitive closure T*(x,y) is the smallest predicate satisfying

T*(x,x),
if T(x,y) then T*(x,y),
and if T*(x,y) and T*(y,z) then T*(x,z).

Equivalently, T*(x,y) holds if and only if their exists a nonempty finite sequence of natural numbers a_n ... a₂ a₁ a₀ such that x = a_n, y = a₀ and for i = 0,...,n-1 one has T(a_i,a_i+1).

In this note I will show how to implement T*(x,y) in the language of arithmetic.

Any nonempty finite sequence of natural numbers a_n ... a₂ a₁ a₀ can be encoded as a single number t, namely by selecting a number p that is larger than all of them and regarding a_n...a₂a₁a₀ to be the p-ary representation of t. The sequence 3 5 7 0 1 for instance could be encoded by the number 35701, if one selects p to be 10. The number t is calculated by the formula t = Sum _i=0ⁿ a_i pⁱ. The sequence 3 5 13 0 1 can not be encoded in base 10 however. For this sequence one needs to choose p larger than 13. In general the choice of p depends on the sequence, so the sequence is encoded not just by t, but really by the pair (t,p).

This encoding is not unique however. The pair (35701,10) for instance is the encoding of the sequence 3 5 7 0 1, but it is also the encoding of the sequence 0 3 5 7 0 1. Because uniqueness is important here, I choose (35701,10) to be the encoding of the sequence 3 5 7 0 1 only, i.e. I only encode sequences a_n ... a₂ a₁ a₀ with a_n > 0. Now a pair (0,p) does not encode a sequence, but any pair (t,p) with t > 0 does.

Write Last(t,p,y) if y is the last number in the sequence encoded by t and p, i.e. if y = a₀. The formula Last(t,p,y) can be expressed arithmetically as

y < p there
exists an r (t = rp+y).

Write First(t,p,x) if x is the first number in the sequence encoded by t and p, i.e. if x = a_n. The formula First(t,p,x) can be expressed arithmetically as

0 < x < p there
exists an n there exists an s (t = xpⁿ+s s < pⁿ).

Write Neighbours(t,p,u,v) if u and v are neighbours in the sequence encoded by t and p, i.e. if there is an i < n such that u = a_i+1 and v = a_i. The formula Neighbours(t,p,u,v) can be expressed arithmetically as

u < p v < p there exists an i r s (t = rpⁱ⁺²+upⁱ⁺¹+vpⁱ+s s < pⁱ r+u > 0).

Here the condition u+r > 0 says that either u or r is positive, which is the case exactly when i < n. I couldn't write i < n directly, as the value of n varies with the sequence.

Now the reflexive and transitive closure T*(x,y) of a formula T(x,y) with free variables x and y can almost be defined by

there exists a t p (First(t,p,x) Last(t,p,y) (Neighbours(t,p,u,v) T(u,v)).

This formula says that there is a sequence a_n ... a₂ a₁ a₀ of numbers, such that the first one (a_n) is x, the last one is y, and whenever u and v are neighbours in the sequence, then T(u,v). There is however one mistake in this formula, namely it insists that x > 0. This problem can be solved by encoding the sequence a_n+1 ... a₂+1 a₁+1 a₀+1 as a pair (t,p). Doing so doesn't require a_n to be positive, as a_n+1 is always positive. A correct definition of T*(x,y) therefore is

there exists a t p (First(t,p,x+1) Last(t,p,y+1) (Neighbours(t,p,u+1,v+1) T(u,v)).

The construction above generalizes easily from relations between numbers to relations between vectors of numbers. I will illustrate this for vectors of length 3, but the number 3 plays no special role here. The case of length 3 is what I need in my proof of Goedel's incompleteness theorem.

Let T(x₁,x₂,x₃,y₁,y₂,y₃) be an arithmetical formula with six free variables x₁, x₂, x₃, y₁, y₂ and y₃. Its reflexive and transitive closure T*(x₁,x₂,x₃,y₁,y₂,y₃) is the smallest predicate satisfying

T*(x₁,x₂,x₃,x₁,x₂,x₃),
if T(x₁,x₂,x₃,y₁,y₂,y₃) then T*(x₁,x₂,x₃,y₁,y₂,y₃),
and if T*(x₁,x₂,x₃,y₁,y₂,y₃) and T*(y₁,y₂,y₃,z₁,z₂,z₃) then T*(x₁,x₂,x₃,z₁,z₂,z₃).

Equivalently, T*(x₁,x₂,x₃,y₁,y₂,y₃) holds if and only if their exists three equally long nonempty finite sequences of natural numbers a_n ... a₂ a₁ a₀ and b_n ... b₂ b₁ b₀ and c_n ... c₂ c₁ c₀ such that x₁ = a_n, x₂ = b_n, x₃ = c_n, y₁ = a₀, y₂ = b₀, y₃ = c₀ and for i = 0,...,n-1 one has T(a_i,b_i,c_i,a_i+1,b_i+1,c_i+1).

Next I will show how to implement T*(x₁,x₂,x₃,y₁,y₂,y₃) in the language of arithmetic.

Three equally long nonempty finite sequences of natural numbers a_n ... a₂ a₁ a₀ and b_n ... b₂ b₁ b₀ and c_n ... c₂ c₁ c₀ can be encoded by three numbers t₁, t₂ and t₃, namely by selecting a number p that is larger than all numbers in the three sequences and regarding a_n...a₂a₁a₀ to be the p-ary representation of t₁, and b_n...b₂b₁b₀ to be the p-ary representation of t₂, and c_n...c₂c₁c₀ to be the p-ary representation of t₃. The triple of sequences is thus encoded by the quadruple (t₁,t₂,t₃,p).

In order to make the encoding unique, I only encode sequences in which a_n > 0 and b_n > 0 and c_n > 0.

Write Last(t₁,t₂,t₃,p,y₁,y₂,y₃) if y₁, y₂ and y₃ are the last numbers in the sequences, i.e. if (y₁,y₂,y₃) = (a₀,b₀,c₀). The formula Last(t₁,t₂,t₃,p,y₁,y₂,y₃) can be expressed arithmetically as

y₁ < p there
exists an r₁ (t₁=r₁p+y₁) y₂ < p r₂ (t₂ = r₂p+y₂) y₃ < p r₃ (t₃ = r₃p+y₃).

Write First(t₁,t₂,t₃,p,x₁,x₂,x₃) if x₁, x₂ and x₃ are the first numbers in the sequences, i.e. if (x₁,x₂,x₃) = (a_n,b_n,c_n). The formula First(t₁,t₂,t₃,p,x₁,x₂,x₃) can be expressed arithmetically as

0 < x₁ < p 0 < x₂ < p 0 < x₃ < p there exists an n ( s₁ (t₁ = x₁pⁿ+s₁ s₁ < pⁿ) s₂ (t₂ = x₂pⁿ+s₂ s₂ < pⁿ) s₃ (t₃ = x₃pⁿ+s₃ s₃ < pⁿ) ).

Write Neighbours(t₁,t₂,t₃,p,u₁,u₂,u₃,v₁,v₂,v₃) if (u₁,u₂,u₃) and (v₁,v₂,v₃) are neighbours, i.e. if there is an i < n such that u₁ = a_i+1 and u₂ = b_i+1 and u₃ = c_i+1 and v₁ = a_i and v₂ = b_i and v₃ = a_i. The formula Neighbours(t₁,t₂,t₃,p,u₁,u₂,u₃,v₁,v₂,v₃) can be expressed arithmetically as

u₁ < p u₂ < p u₃ < p v₁ < p v₂ < p v₃ < p
there exists an i ( r₁ s₁ (t₁=r₁pⁱ⁺²+u₁pⁱ⁺¹+v₁pⁱ+s₁ s₁ < pⁱ r₁+u₁ > 0)
r₂ s₂ (t₂ = r₂pⁱ⁺²+u₂pⁱ⁺¹+v₂pⁱ+s₂ s₂ < pⁱ r₂+u₂ > 0)
r₃ s₃ (t₃ = r₃pⁱ⁺²+u₃pⁱ⁺¹+v₃pⁱ+s₃ s₃ < pⁱ r₃+u₃ > 0) ).

Now the reflexive and transitive closure T*(x₁,x₂,x₃,y₁,y₂,y₃) of a formula T(x₁,x₂,x₃,y₁,y₂,y₃) with free variables x₁, x₂, x₃, y₁, y₂ and y₃ can be defined by

there exists a t₁ t₂ t₃ p (First(t₁,t₂,t₃,p,x₁+1,x₂+1,x₃+1) Last(t₁,t₂,t₃,p,y₁+1,y₂+1,y₃+1) (Neighbours(t₁,t₂,t₃,p,u₁+1,u₂+1,u₃+1,v₁+1,v₂+1,v₃+1) T(u₁,u₂,u₃,v₁,v₂,v₃)).

If arithmetic would be defined so as to contain exponentiation, the above implementation would be complete (considering that a < b can be rewritten as there is

k: a+1+k = b).

However, arithmetic only features 0,1,+,x and =. The reason that exponentiation is not included in the definition of arithmetic, is that it doesn't add to the expressiveness of the language: the formula x = yⁿ is expressible in exponentiation-free arithmetic: Let E(i,u,y,j,v,y') be the formula

i = j+1 u = vy y' = y

then x = yⁿ holds iff E*(n,x,y,0,1,y), where E* is the reflexive and transitive closure of E. As this implementation of exponentiation involves the reflexive and transitive closure, is it necessary to express reflexive and transitive closure in exponentiation-free arithmetic. This is done as follows. The statement

"p is prime" there exists an n (q = pⁿ)

can be implemented in arithmetic as

forall d ( d 1 there exists an k (q = kd) k (d = kp) )

which says that for any divisor d of q with d

1, p must be a divisor of d. This implies that p is a prime divisor of q and q has no other prime divisors than p.

The only requirement on the number p in the implementation of reflexive and transitive closure above, is that it is sufficiently large. No generality is lost if one requires p to be prime. Now the formula First(t,p,x) can be reformulated as

0 < x < p there exists an q ("p is prime" n (q = pⁿ) s (t = xq+s s < q)).

Likewise Neighbours(t,p,u,v) can be reformulated as

u < p v < p there exists an q ("p is prime" i (q = pⁱ) r s (t = rqpp+uqp+vq+s s < q r+u > 0)).

In these formulas the use of exponentiation can be eliminated as indicated above. The same holds for the generalization to relations between vectors of numbers (of length 3 or otherwise).

Rob van Glabbeek

rvg@cs.stanford.edu