-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathsecond-syntactic.tex
2751 lines (2477 loc) · 148 KB
/
second-syntactic.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
\chapter{Syntactic Metalogic Redux} \label{chap-second}
%% first many sorted
\section{Many-sorted logic}
We now turn to a generalization of first-order logic --- a
generalization which has proven to be surprisingly controversial.
This generalization proceeds by noting that in ordinary first-order
logic, it is implicitly assumed that all syntactic objects are
compatible. For example, for any two terms $s,t$, it makes sense to
write $s=t$; and for any relation symbol $r$, and terms
$t_1,\dots ,t_n$, it makes sense to write $r(t_1,\dots ,t_n)$.
However, that assumption is not obviously warranted. Instead, one
might insist that syntactic entities, such as terms, come with a
\emph{type} or \emph{sort}, and that there are sort-based rules about
how these objects can be combined.
This generalization can provoke two responses that pull in completely
opposite directions. On the one hand, one might think that
many-sorted logic is stronger than single-sorted logic, and hence that
its theoretical commitments outrun those of single-sorted logic. (The
obvious analogy here is with second-order logic.) On the other hand,
some philosophers, such as \cite[267--268]{quine1963}, argue that
many-sorted logic is reducible to single-sorted logic, and hence is
dispensable. If we give pride of place to classical (single-sorted)
first-order logic, then both of these responses would undermine our
motivation to study many-sorted logic. However, the presuppositions
of these two responses cannot both be correct, i.e.\ many-sorted logic
cannot both exceed the resources of single-sorted logic, and also be
reducible to it. So which view is the right one?
The view we will advance here is that many-sorted logic is, in one
clear sense, reducible to single-sorted logic; but that this reduction
does not mean that many-sorted logic is dispensible. Before we take
up this argument, we need to explain how many-sorted logic works.
\begin{defn} A many-sorted \textbf{signature} $\Sigma$ is a set of
sort symbols, predicate symbols, function symbols, and constant
symbols. $\Sigma$ must have at least one sort symbol. Each
predicate symbol $p\in\Sigma$ has an \textbf{arity}
$\sigma_1\times\ldots\times\sigma_n$, where
$\sigma_1,\ldots, \sigma_n\in\Sigma$ are (not necessarily distinct)
sort symbols. Likewise, each function symbol $f\in\Sigma$ has an
\textbf{arity}
$\sigma_1\times\ldots\times\sigma_n\rightarrow\sigma$, where
$\sigma_1,\ldots, \sigma_n,\sigma\in\Sigma$ are again (not
necessarily distinct) sort symbols. Lastly, each constant symbol
$c\in\Sigma$ is assigned a sort $\sigma\in\Sigma$. In addition to
the elements of $\Sigma$ we also have a stock of variables. We use
the letters $x$, $y$, and $z$ to denote these variables, adding
subscripts when necessary. Each variable has a sort
$\sigma\in\Sigma$. \end{defn}
\begin{note} The symbol $\sigma _1\times \cdots \times \sigma _n$ has
no intrinsic meaning. To say that, ``$p$ has arity
$\sigma _1\times \cdots \times \sigma _n$'' is simply an abbreviated
way of saying that $p$ can be combined with $n$ terms, whose sorts
must respectively be $\sigma _1,\dots ,\sigma _n$. \end{note}
A $\Sigma$-term can be thought of as a ``naming expression'' in the
signature $\Sigma$. Each $\Sigma$-term has a sort $\sigma\in\Sigma$.
\begin{defn} The \textbf{$\mathbf{\Sigma}$-terms} of sort $\sigma$ are
recursively defined as follows. Every variable of sort $\sigma$ is a
$\Sigma$-term of sort $\sigma$, and every constant symbol
$c\in\Sigma$ of sort $\sigma$ is also a $\Sigma$-term of sort
$\sigma$. Furthermore, if $f\in\Sigma$ is a function symbol with
arity $\sigma_1\times\ldots\times\sigma_n\rightarrow\sigma$ and
$t_1,\ldots, t_n$ are $\Sigma$-terms of sorts
$\sigma_1, \ldots, \sigma_n$, then $f(t_1,\ldots, t_n)$ is a
$\Sigma$-term of sort $\sigma$. We will use the notation
$t(\vec{x})$ to denote a $\Sigma$-term in which all of the variables
that appear in $t$ are in the sequence
$\vec{x}\equiv x_1,\ldots, x_n$, but we leave open the possibility
that some of the $x_i$ do not appear in the term $t$. \end{defn}
A \textbf{$\mathbf{\Sigma}$-atom} is an expression either of the form
$s(x_1,\ldots, x_n)=t(x_1,\ldots, x_n)$, where $s$ and $t$ are
$\Sigma$-terms of the same sort $\sigma\in\Sigma$, or of the form
$p(t_1,\ldots, t_n)$, where $t_1,\ldots, t_n$ are $\Sigma$-terms of
sorts $\sigma_1, \ldots, \sigma_n$ and $p\in\Sigma$ is a predicate of
arity $\sigma_1\times\ldots\times\sigma_n$.
\begin{defn} The \textbf{$\mathbf{\Sigma}$-formulas} are defined
recursively as follows.
\begin{itemize}
\item Every $\Sigma$-atom is a $\Sigma$-formula.
\item If $\phi$ is a $\Sigma$-formula, then $\lnot\phi$ is a $\Sigma$-formula.
\item If $\phi$ and $\psi$ are $\Sigma$-formulas, then
$\phi\rightarrow\psi$, $\phi\land\psi$, $\phi\lor\psi$ and
$\phi\leftrightarrow\psi$ are $\Sigma$-formulas.
\item If $\phi$ is a $\Sigma$-formula and $x$ is a variable of sort
$\sigma\in\Sigma$, then $\forall_\sigma x\phi$ and
$\exists_\sigma x\phi$ are $\Sigma$-formulas.
\end{itemize} \end{defn}
In addition to the above formulas, we will use the notation
$\exists_{\sigma=1}y \phi(x_1,\ldots, x_n, y)$ to abbreviate the
formula
\[ \exists_\sigma y(\phi(x_1,\ldots, x_n, y)\land\forall_\sigma
z(\phi(x_1,\ldots, x_n,z)\rightarrow y=z)) .\] As above, the
notation $\phi(\vec{x})$ will denote a $\Sigma$-formula $\phi$ in
which all of the free variables appearing in $\phi$ are in the
sequence $\vec{x}\equiv x_1,\ldots, x_n$, but we again leave open the
possibility that some of the $x_i$ do not appear as free variables in
$\phi$.
\begin{defn} A $\mathbf{\Sigma}$\textbf{-sentence} is a
$\Sigma$-formula that has no free variables. \end{defn}
We will not give an explicit listing of the derivation rules for
many-sorted logic. Suffice it to say that they are direct
generalizations of the derivation rules for single-sorted logic,
provided that one observe all restrictions on syntactic compatibility.
For example, in many sorted logic, we can infer $\forall x\phi (x)$
from $\phi (y)$ only if the variables $x$ and $y$ are of the same
type. If they were not of the same type, then one of these two
formulas would fail to be well-formed.
As a result of these restrictions, we need to exercise some caution
about carrying over intuitions that we might have developed in using
single-sorted logic. For example, in single-sorted logic, for any two
terms $s$ and $t$, we have a tautology
\[ \vdash (s=t)\vee (s\neq t) .\] However, in many sorted logic, the
expressions $s=t$ and $s\neq t$ are well-formed only when $s$ and $t$
are terms of the same sort. Thus, to the question: ``do $s$ and $t$
denote the same object'', many-sorted logic sometimes offers no
answer.
One might be tempted nonetheless to think that if $s$ and $t$ are
terms of of different types, then they should, by default, be
considered to denote different things. However, that answer is
neither mandatory, nor is it safe. For example, suppose that we begin
with a theory $T$ which has as an axiom:
\[ \forall x^{\sigma} (x\neq s\to p(x)) .\] The suggestion above is to
extend the relation $\neq$ to include the terms $s$ and $t$. But can
we then apply the axiom, and derive $p(t)$? Before answering this
question, imagine that $\sigma$ is the sort of natural numbers, that
$s$ denotes $0$, and that $p$ is the predicate $x>0$. Suppose also
that $\sigma '$ is the sort of physical objects, and that $t$ denotes
my right pinky finger. Is it safe to affirm that my right pinky
finger satisfies the predicate $x>0$?
%% TO DO: examples of many sorted theories
%% category theory, vector spaces, geometry, and some really
%% simple ones
\begin{example} \label{two-sort} Let
$\Sigma = \{ \sigma _1,\sigma _2 \}$, and let $T$ be the empty
theory in $\Sigma$. Note that both $\exists _{\sigma _1}x(x=x)$ and
$\exists _{\sigma _2}y(y=y)$. This might seem like a strange
consequence: $T$ is the empty theory, and you might think that the
empty theory should have no non-trivial consequences. But the
combination of $\exists _{\sigma _1}x(x=x)$ and
$\exists _{\sigma _2}y(y=y)$ seems like a non-trivial consequence,
viz.\ that there are at least two things!
However, there is a mistake in our reasoning. Those two sentences
together do not imply that there are at least two things. For,
there is no third quantifier $\exists$ such that
$\exists v\exists w(v\neq w)$ is guaranteed to hold.
These considerations show that distinct sort symbols do not
necessarily represent different kinds of things. Indeef, it is not
generally valid to infer that there are $n+m$ objects from the fact
that there are $n$ objects of sort $\sigma _1$ and $m$ objects of
sort $\sigma _2$. \end{example}
\begin{example} Let $\Sigma = \{ \sigma _1,\sigma _2,i\}$, where
$i:\sigma _1\to\sigma _2$. Let $T$ be the theory that says that $i$
is bijective; that is, $i$ is injective:
\[ (i(x)=i(y))\to x=y ,\] and $i$ is surjective:
\[ \exists x(i(x)=z) .\] Then $T$ defines a functional relation
$\phi$ of sort $\sigma _2\times \sigma _1$ by means of
\[ \phi (z,x) \: \lra \: (i(x)=z) .\] The function
$j:\sigma _2\to\sigma _1$ corresponding to $\phi$ is the inverse of
$i$. \end{example}
%% TO DO: come back to this example when we do equivalence. Show that
%% $T$ is equivalent to a single sorted theory. TO DO: it's also cool
%% to look at the symmetries of a model of this theory. Since sigma1
%% is locked down to sigma2, the symmetries have to work in tandem!
\begin{example} \label{ex-cats} The theory of categories can
conveniently be formulated as a many-sorted theory. Let
$\Sigma = \{O,A,d_0,d_1,i,\circ \}$, where $O$ and $A$ are sorts,
$d_0:A\to O$, $d_1:A\to O$, $i:O\to A$, and $\circ$ is a relation of
sort $A\times A\times A$. (The relation $\circ$ is used as the
composition function on arrows, i.e.\ a partial function defined for
compatible arrows.) We will leave it as an exercise for the reader
to write down the axioms corresponding to the following ideas:
\begin{enumerate}
\item For each arrow $f$, $d_0f$ is the domain object, and $d_1f$ is
the codomain object. Thus, we may write $f:d_0f\to d_1f$. More
generally, we write $f:x\to y$ to indicate that $x=d_0f$ and
$y=d_1f$. The function $\circ$ is defined on pairs of arrows
where the first arrow's domain matches the second arrow's
codomain.
\item The function $\circ$ is associative.
\item For each object $x$, $i(x):x\to x$. Moreover, for any arrow
$f$ such that $d_1f=x$, we have $i(x)\circ f=f$. And for any
arrow $g$ such that $d_0g=x$, we have $g\circ
i(x)=g$. \end{enumerate} \end{example}
%% TO DO: remember, we are also going to talk about this under the
%% heading symmetry -- the duality in flipping direction of arrows.
%% This is related to that James Hawthorne thing -- using many-sorted
%% logic to separate un/observable
\begin{disc} What can many-sorted logic do for us? In pure
mathematics, it can certainly have pragmatic advantages to introduce
sorts. For example, in axiomatizing category theory, it seems more
intuitive to think about objects and arrows as different sorts of
things, rather than introducing some predicate that is satisfied by
objects but not by arrows. Similarly, in axiomatizing the theory of
vector spaces, it is convenient to think of vectors and scalars as
different sorts of things. Indeed, in this latter case, it's hard
to imagine a mathematician investigating the question: ``is $c$ a
scalar or a vector''? Instead, it seems that general words like
``vector'' and ``scalar'' function more like labels than they do as
names of properties that mathematicians are interested in
investigating.
But what about empirical theories? Could a many-sorted formulation
of an empirical theory provide a more perspicious representation of
the structure of reality? Let's focus on a more specific question,
that was central to 20th century philosophy of science: can the
un/observable distinction be encoded in the syntax of a theory?
Suppose then that in formulating a theory $T$, we begin by
introducing a sort symbol $O$ for observable objects, and a sort
symbol $P$ for theoretical objects. Then, any relation symbol $R$
must be explicitly sorted, i.e.\ each slot after $R$ can be occupied
only by terms of one particular sort. Similarly, in this language,
formulas such as $t=t'$ and $t\neq t'$ are well-formed only if $t$
and $t'$ are terms of the same type. It should be clear now that
this language does not have a predicate ``is unobservable'', nor
does it have any well-formed expression corresponding to the
sentence:
\begin{quote} ($\ast$) No theoretical entity (i.e.\ entity of type
$P$) is an observable entity (i.e.\ entity of type
$O$). \end{quote} The grammatical malformity of ($\ast$) is
sometimes brushed right over in criticisms of the syntactic view
\cite[e.g.][]{bas1980}, and in criticisms of the observation-theory
distinction \cite[e.g.][]{dick}.
\end{disc}
%% TO DO: modifying inference rules
%% TO DO: observable objects and sorts -- thwarting the van Fraassen /
%% Putnam reductio
\section{Morita extension and equivalence}
\citet{glymour1970} claims that definitional equivalence (see
\ref{df:cde}) is a necessary condition on the equivalence of
scientific theories. However, there are several reasons to believe
that this criterion is too strict.
First, it is frequently argued that many-sorted logic is reducible to
single-sorted logic \citep[see][]{schmidt1951,manzano-book}. What is
actually shown in these arguments is that for any many-sorted theory
$T$, a corresponding single-sorted theory $T'$ can be constructed.
But what is the relation between $T$ and $T'$? Obviously, the two
theories $T$ and $T'$ cannot be definitionally equivalent, since that
criterion applies only to single-sorted theories. Therefore, to make
sense of the claim that many-sorted logic can be reduced to
single-sorted logic, we need a generalization of definitional
equivalence. %% TO DO: make sure this is clearly defined
Second, there are well known examples of theories that could naturally
be formulated either within a single-sorted framework, or within a
many-sorted framework --- and we need a generalization of definitional
equivalence to explain in what sense these two formulations are
equivalent. For example, category theory can be formulated as a
many-sorted theory, using both a sort of ``objects'' and a sort of
``arrows'' \citep{eilenbergmaclane1942, eilenbergmaclane1945}; and
category theory can also be formulated as a single-sorted theory using
only ``arrows'' \citep{maclane1948}. [\citet[p.~5]{freyd1964} and
\citet[p.~9]{cwm} also describe this alternate formulation.] These
two formulations of category theory are in some sense equivalent, and
we would like an account of this more general notion of equivalence.
Third, definitional equivalence is too restrictive even for
single-sorted theories. For example, affine geometry can be
formalized in a way that quantifies over points; or it can be
formalized in a way that quantifies over lines
\citep[see][]{schwabhauser1983}. But saying that the point theory
($T_p$) and the line theory ($T_\ell$) both are formulations of the
same theory indicates again that $T_p$ and $T_\ell$ are in some sense
equivalent --- although $T_p$ and $T_\ell$ are {\it not}
definitionally equivalent. Indeed, the smallest model of $T_p$ has
five elements, which we can think of as the four corners of a square,
and its center point. On the other hand, the smallest model of
$T_\ell$ has six elements. But if $T_p$ and $T_\ell$ were
definitionally equivalent, then every model $M$ of $T_\ell$ would be
the reduct of an expansion of a model $M'$ of $T_p$
\citep{debouvere1965}. In particular, we would have $|M|=|M'|$, which
entails that $T_\ell$ has a model of cardinality five --- a
contradiction. Therefore, $T_p$ and $T_\ell$ are not definitionally
equivalent.
Finally, even if we ignore the complications mentioned above, and even
if we assume that each many-sorted theory $T$ can be replaced by a
single-sorted variant $T'$ (by the standard procedure of unifying
sorts), definitional equivalence is still inadequate. Consider the
following example.
\begin{example} Let $T_1$ be the objects-and-arrows formulation of
category theory, and let $T_2$ be the arrows-only formulation of
category theory. Intuitively, $T_1$ and $T_2$ are equivalent
theories; but their single-sorted versions $T_1'$ and $T_2'$ are not
definitionally equivalent. Indeed, $T_2'=T_2$, since $T_2$ is
single-sorted. However, $T_1'$ has a single sort that includes both
objects and arrows. Thus, while $T_2'$ has a model with one element
(i.e.\ the category with a single arrow), $T_1'$ has no models with
one element (since every model of $T_1'$ has at least one object and
at least one arrow). Therefore, $T_1'$ and $T_2'$ are not
definitionally equivalent. \end{example}
These examples all show that definitional equivalence does not capture
the sense in which some theories are equivalent. If one wants to
capture this sense, one needs a more general criterion for theoretical
equivalence than definitional equivalence. Our aim here is to
introduce one such criterion. We will call it \emph{Morita
equivalence}.\footnote{The concept of Morita equivalence --- if not
the name --- is already familiar in certain circles of
logicians. See \cite{andreka2008} and \cite{mere1992}. The name
``Morita equivalence'' descends from Kiiti Morita's work on rings
with equivalent categories of modules. Two rings $R$ and $S$ are
said to be Morita equivalent just in case there is an equivalence
$\text{Mod}(R)\cong \text{Mod}(S)$ between their categories of
modules. The notion was generalized from rings to algebraic
theories by \cite{dukarm}. See also \cite{adamek}. There is also a
notion of Morita equivalence for $C^*$-algebras, see
\cite{rieffel}. More recently, topos theorists have defined theories
to be Morita equivalent just in case their classifying toposes are
equivalent \citep{johnstone}. See \cite{tsementzis} for a comparison
of the topos-theoretic notion of Morita equivalence with ours.} This
criterion is a natural generalization of definitional equivalence. In
fact, Morita equivalence is essentially the same as definitional
equivalence, except that it allows one to define new sort symbols in
addition to new predicate symbols, function symbols, and constant
symbols. In order to state the criterion precisely, we again need to
do some work. We begin by defining the concept of a Morita extension.
In Chapter \ref{ch:sem2} we will show the sense in which Morita
equivalence is a natural generalization of definitional equivalence.
As we did for predicates, functions, and constants, we need to say how
to define new sorts. Let $\Sigma\subseteq\Sigma^+$ be signatures and
consider a sort symbol $\sigma\in\Sigma^+\backslash\Sigma$. One can
define the sort $\sigma$ as a product sort, a coproduct sort, a
subsort, or a quotient sort. In each case, one defines $\sigma$ using
old sorts in $\Sigma$ and new function symbols in
$\Sigma^+\backslash\Sigma$. These new function symbols specify how the
new sort $\sigma$ is related to the old sorts in $\Sigma$. We describe
these four cases in detail.
\begin{tomt} In order to define $\sigma$ as a product sort, one needs
two function symbols $\pi_1,\pi_2\in\Sigma^+\backslash\Sigma$ with
$\pi_1$ of arity $\sigma\rightarrow\sigma_1$, $\pi_2$ of arity
$\sigma\rightarrow\sigma_2$, and $\sigma_1,\sigma_2\in\Sigma$. The
function symbols $\pi_1$ and $\pi_2$ serve as the ``canonical
projections'' associated with the product sort $\sigma$. A sort
definition of the symbols $\sigma, \pi_1$, and $\pi_2$ as a
\textbf{product sort} in terms of $\Sigma$ is a $\Sigma^+$-sentence
of the form
\[ \forall_{\sigma_1}x\forall_{\sigma_2}
y\exists_{\sigma=1}z(\pi_1(z)=x\land\pi_2(z)=y) . \]
One should think of a product sort $\sigma$ as the sort whose elements
are ordered pairs, where the first element of each pair is of sort
$\sigma_1$ and the second is of sort $\sigma_2$.
% This sentence defines $\sigma$ as the product sort of $\sigma_1$ and
% $\sigma_2$ with corresponding projections $\pi_1$ and $\pi_2$.
\end{tomt}
\begin{tomt} One can also define $\sigma$ as a coproduct sort. One
again needs two function symbols
$\rho_1,\rho_2\in\Sigma^+\backslash\Sigma$ with $\rho_1$ of arity
$\sigma_1\rightarrow\sigma$, $\rho_2$ of arity
$\sigma_2\rightarrow\sigma$, and $\sigma_1,\sigma_2\in\Sigma$. The
function symbols $\rho_1$ and $\rho_2$ are the ``canonical
injections'' associated with the coproduct sort $\sigma$. A sort
definition of the symbols $\sigma, \rho_1$, and $\rho_2$ as a
\textbf{coproduct sort} in terms of $\Sigma$ is a
$\Sigma^+$-sentence of the form
\begin{gather*}
\forall_\sigma z\big(\exists_{\sigma_1=1}x(\rho_1(x)=z)\lor\exists_{\sigma_2=1} y(\rho_2(y)=z)\big)
\land \forall_{\sigma_1} x\forall_{\sigma_2} y\lnot\big(\rho_1(x)=\rho_2(y)\big)
\end{gather*}
%This sentence defines $\sigma$ as a coproduct sort of $\sigma_1$ and $\sigma_2$ with corresponding injections $\rho_1$ and $\rho_2$.
One should think of a coproduct sort $\sigma$ as the disjoint union of
the elements of sorts $\sigma_1$ and $\sigma_2$. \end{tomt}
When defining a new sort $\sigma$ as a product sort or a coproduct
sort, one uses two sort symbols in $\Sigma$ and two function symbols
in $\Sigma^+\backslash\Sigma$. The next two ways of defining a new
sort $\sigma$ only require one sort symbol in $\Sigma$ and one
function symbol in $\Sigma^+\backslash\Sigma$.
\begin{tomt} In order to define $\sigma$ as a subsort, one needs a
function symbol $i\in\Sigma^+\backslash\Sigma$ of arity
$\sigma\rightarrow\sigma_1$ with $\sigma_1\in\Sigma$. The function
symbol $i$ is the ``canonical inclusion'' associated with the
subsort $\sigma$. A sort definition of the symbols $\sigma$ and $i$
as a \textbf{subsort} in terms of $\Sigma$ is a $\Sigma^+$-sentence
of the form
\begin{gather}
\label{subsortdefinition}
\forall_{\sigma_1}x\big(\phi(x)\leftrightarrow\exists_\sigma
z(i(z)=x)\big) \land\forall_\sigma z_1\forall_\sigma
z_2\big(i(z_1)=i(z_2)\rightarrow z_1=z_2\big)
\end{gather}
where $\phi(x)$ is a $\Sigma$-formula. One can think of the subsort
$\sigma$ as consisting of ``the elements of sort $\sigma_1$ that are
$\phi$.'' The sentence \eqref{subsortdefinition} entails the
$\Sigma$-sentence $\exists_{\sigma_1} x \phi(x)$. As before, we will
call this $\Sigma$-sentence the \textbf{admissibility condition} for
the definition \eqref{subsortdefinition}. \end{tomt}
\begin{tomt} Lastly, in order to define $\sigma$ as a quotient sort
one needs a function symbol $\epsilon\in\Sigma^+\backslash\Sigma$ of
arity $\sigma_1\rightarrow\sigma$ with $\sigma_1\in\Sigma$. A sort
definition of the symbols $\sigma$ and $\epsilon$ as a
\textbf{quotient sort} in terms of $\Sigma$ is a $\Sigma^+$-sentence
of the form
\begin{equation}
\label{quotientdefinition}
\forall_{\sigma_1} x_1\forall_{\sigma_1}x_2\big(\epsilon(x_1)=\epsilon(x_2)\leftrightarrow\phi(x_1,x_2)\big) \land\forall_\sigma z\exists_{\sigma_1}x(\epsilon(x)=z)
\end{equation}
where $\phi(x_1,x_2)$ is a $\Sigma$-formula. This sentence defines
$\sigma$ as a quotient sort that is obtained by ``quotienting out''
the sort $\sigma_1$ with respect to the formula $\phi(x_1, x_2)$. The
sort $\sigma$ should be thought of as the set of ``equivalence classes
of elements of $\sigma_1$ with respect to the relation
$\phi(x_1,x_2)$.'' The function symbol $\epsilon$ is the ``canonical
projection'' that maps an element to its equivalence class. One can
verify that the sentence \eqref{quotientdefinition} implies that
$\phi(x_1,x_2)$ is an equivalence relation. In particular, it entails
the following $\Sigma$-sentences:
$$
\begin{aligned}
&\forall_{\sigma_1}x(\phi(x,x))\\
&\forall_{\sigma_1}x_1\forall_{\sigma_1}x_2(\phi(x_1,x_2)\rightarrow\phi(x_2, x_1))\\
&\forall_{\sigma_1}x_1\forall_{\sigma_1}x_2\forall_{\sigma_1}x_3\big((\phi(x_1,x_2)\land\phi(x_2,x_3))\rightarrow\phi(x_1, x_3)\big)
\end{aligned}
$$
These $\Sigma$-sentences are the \textbf{admissibility conditions} for
the definition \eqref{quotientdefinition}. \end{tomt}
Now that we have presented the four ways of defining new sort symbols,
we can define the concept of a Morita extension. A Morita extension is
a natural generalization of a definitional extension. The only
difference is that now one is allowed to define new sort symbols.
\begin{defn} Let $\Sigma\subset\Sigma^+$ be signatures and $T$ a
$\Sigma$-theory. A \textbf{Morita extension} of $T$ to the signature
$\Sigma^+$ is a $\Sigma^+$-theory
\[ T^+=T\cup\{\delta_s:s\in\Sigma^+\backslash\Sigma\} \]
that satisfies the following conditions. First, for each symbol
$s\in\Sigma^+-\Sigma$ the sentence $\delta_s$ is an explicit
definition of $s$ in terms of $\Sigma$. Second, if
$\sigma\in\Sigma^+\backslash\Sigma$ is a sort symbol and
$f\in \Sigma^+\backslash\Sigma$ is a function symbol that is used in
the sort definition of $\sigma$, then $\delta_f=\delta_\sigma$. (For
example, if $\sigma$ is defined as a product sort with projections
$\pi_1$ and $\pi_2$, then
$\delta_\sigma=\delta_{\pi_1}=\delta_{\pi_2}$.)
% Second, if $\sigma\in\Sigma^+-\Sigma$ is a sort symbol and
% $f\in\Sigma^+-\Sigma$ is a function symbol that is used in the
% explicit definition of $\sigma$ (that is, if $f$ is one of the
% function symbols $\pi_1, \pi_2, \rho_1,\rho_2, i,$ or $\epsilon$
% described above), then $\delta_\sigma=\delta_f$.\footnote{This
% requirements is to prevent one from defining a function symbol as,
% for example, a projection function for a product sort, and also
% defining the same function symbol in some other way.}
And third, if $\alpha_s$ is an admissibility condition for a
definition $\delta_s$, then $T\vdash\alpha_s$. \end{defn}
Note that unlike a definitional extension of a theory, a Morita
extension can have more sort symbols than the original
theory.\footnote{Also note that if $T^+$ is a Morita extension of $T$
to $\Sigma^+$, then there are restrictions on the arities of
predicates, functions, and constants in
$\Sigma^+\backslash\Sigma$. If $p\in\Sigma^+\backslash\Sigma$ is a
predicate symbol of arity $\sigma_1\times\ldots\times\sigma_n$, we
immediately see that $\sigma_1,\ldots,\sigma_n\in\Sigma$. Taking a
single Morita extension does not allow one to define predicate
symbols that apply to sorts that are not in $\Sigma$. One must take
multiple Morita extensions to do this. Likewise, any constant
symbol $c\in\Sigma^+\backslash\Sigma$ must be of sort
$\sigma\in\Sigma$. And a function symbol
$f\in\Sigma^+\backslash\Sigma$ must either have arity
$\sigma_1\times\ldots\times\sigma_n\rightarrow\sigma$ with
$\sigma_1,\ldots,\sigma_n,\sigma\in\Sigma$, or $f$ must be one of
the function symbols that appears in the definition of a new sort
symbol $\sigma\in\Sigma^+\backslash\Sigma$.} The following is a
particularly simple example of a Morita extension.
\begin{example}
\label{extensionexample}
Let $\Sigma=\{\sigma, p\}$ and $\Sigma^+=\{\sigma, \sigma^+, p, i\}$
be a signatures with $\sigma$ and $\sigma^+$ sort symbols, $p$ a
predicate symbol of arity $\sigma$, and $i$ a function symbol of arity
$\sigma^+\rightarrow\sigma$. Consider the $\Sigma$-theory
$T=\{\exists_\sigma x p(x)\}$. The following $\Sigma^+$-sentence
defines the sort symbol $\sigma^+$ as the subsort consisting of ``the
elements that are $p$.''
\begin{align*}
&\forall_{\sigma}x\big(p(x)\leftrightarrow\exists_{\sigma^+} z(i(z)=x)\big)\land\forall_{\sigma^+}z_1\forall_{\sigma^+}z_2\big(i(z_1)=i(z_2)\rightarrow z_1=z_2\big)\tag{$\delta_{\sigma^+}$}
\end{align*}
The $\Sigma^+$-theory $T^+=T\cup\{\delta_{\sigma^+}\}$ is a Morita
extension of $T$ to the signature $\Sigma^+$. The theory $T^+$ adds to
the theory $T$ the ability to quantify over the set of ``things that
are $p$.''
\end{example}
\begin{defn}
Let $T_1$ be a $\Sigma_1$-theory and $T_2$ a $\Sigma_2$-theory. $T_1$ and $T_2$ are \textbf{Morita equivalent} if there are theories $T_1^1, \ldots, T_1^n$ and $T_2^1,\ldots, T_2^m$ that satisfy the following three conditions:
\begin{itemize}
\item Each theory $T_1^{i+1}$ is a Morita extension of $T_1^{i}$,
\item Each theory $T_2^{i+1}$ is a Morita extension of $T_2^i$,
\item $T_1^n$ and $T_2^m$ are logically equivalent $\Sigma$-theories
with $\Sigma_1\cup\Sigma_2\subseteq\Sigma$.
\end{itemize}
\end{defn}
Two theories are Morita equivalent if they have a ``common Morita
extension.'' The situation can be pictured as follows, where each
arrow in the figure indicates a Morita extension.
\begin{center}
\includegraphics{moritaequivalence.pdf}
\end{center}
At first glance, Morita equivalence might strike one as different from
definitional equivalence in an important way. To show that theories
are Morita equivalent, one is allowed to take any finite number of
Morita extensions of the theories. On the other hand, to show that two
theories are definitionally equivalent, it appears that one is only
allowed to take \textit{one} definitional extension of each
theory. One might worry that Morita equivalence is therefore not
perfectly analogous to definitional equivalence.
Fortunately, this is not the case. By Theorem \ref{cde-it}, if $T'$
is a definitional extension of $T$, then $T$ and $T'$ are
intertranslatable. Clearly intertranslatability is a transitive
relation, and in Theorem \ref{it-cde}, we will see that if two
theories are intertranslatable, then they are definitionally
equivalent. Therefore, if theories $T_1, \ldots, T_n$ are such that
each $T_{i+1}$ is a definitional extension of $T_i$, then $T_n$ is in
fact a definitional extension of $T_1$. (One can easily verify that
this is not true of Morita extensions.) To show that two theories are
definitionally equivalent, therefore, one actually \textit{is} allowed
to take any finite number of definitional extensions of each theory.
If two theories are definitionally equivalent, then they are trivially
Morita equivalent. Unlike definitional equivalence, however, Morita
equivalence is capable of capturing a sense in which theories with
different sort symbols are equivalent. The following example
demonstrates that Morita equivalence is a more liberal criterion for
theoretical equivalence.
\begin{example}
Let $\Sigma _1 =\{ \sigma _1,p,q\}$ and $\Sigma _2=\{ \sigma
_2,\sigma _3 \}$ be signatures with $\sigma _i$ sort symbols, and
$p$ and $q$ predicate symbols of arity $\sigma _1$. Let $T_1$ be
the $\Sigma_1$-theory that says: $p$ and $q$ are non-empty, mutually
exclusive, and exhaustive. Let $T_2$ be the empty theory in $\Sigma
_2$. Since the signatures $\Sigma _1$ and $\Sigma _2$ have
different sort symbols, $T_1$ and $T_2$ can't possibly be
definitionally equivalent. Nonetheless, it's easy to see that $T_1$
and $T_2$ are Morita equivalent. Let
$\Sigma=\Sigma_1\cup\Sigma_2\cup\{i_2, i_3\}$ be a signature with
$i_2$ and $i_3$ function symbols of arity
$\sigma_2\rightarrow\sigma_1$ and
$\sigma_3\rightarrow\sigma_1$. Consider the following
$\Sigma$-sentences.
\begin{align*}
%
&\begin{aligned}
&\forall_{\sigma_1}x\big(p(x)\leftrightarrow\exists_{\sigma_2} y(i_2(y)=x)\big)\\
&\qquad\land\forall_{\sigma_2} y_1\forall_{\sigma_2} y_2\big(i_2(y_1)=i_2(y_2)\rightarrow y_1=y_2\big)\\
\end{aligned}
\tag{$\delta_{\sigma_2}$}\\
%
&\begin{aligned}
&\forall_{\sigma_1}x\big(q(x)\leftrightarrow\exists_{\sigma_3} z(i_3(z)=x)\big)\\
&\qquad\land\forall_{\sigma_3} z_1\forall_{\sigma_3} z_2\big(i_3(z_1)=i_3(z_2)\rightarrow z_1=z_2\big)\\
\end{aligned}
\tag{$\delta_{\sigma_3}$}\\
%
%&\forall_{\sigma_1} x\forall_{\sigma_2} y(\rho_1(y)=x\leftrightarrow i_2(y)=x)
%\tag{$\delta_{\rho_1}$}\\
%
%&\forall_{\sigma_1} x\forall_{\sigma_3} z(\rho_2(z)=x\leftrightarrow i_3(z)=x)
%\tag{$\delta_{\rho_2}$}\\
%
&\begin{aligned}
&\forall_{\sigma_1} x\big(\exists_{\sigma_2=1}y(i_2(y)=x)\lor\exists_{\sigma_3=1} z(i_3(z)=x)\big)\\
&\qquad\land \forall_{\sigma_2} y\forall_{\sigma_3} z \lnot\big(i_2(y)=i_3(z)\big)\\
\end{aligned}
\tag{$\delta_{\sigma_1}$}\\
%
&\begin{aligned}
&\forall_{\sigma_1} x\big(p(x)\leftrightarrow\exists_{\sigma_2} y(i_2(y)=x)\big)
\end{aligned}
\tag{$\delta_p$}\\
%
&\begin{aligned}
&\forall_{\sigma_1} x\big(q(x)\leftrightarrow\exists_{\sigma_3} z(i_3(z)=x)\big)
\end{aligned}
\tag{$\delta_q$}
\end{align*}
%We define a Morita extension $T_1^1$ of $T_1$ to the signature $\Sigma_1^1=\Sigma_1\cup\{\sigma_2, \sigma_3, i_2, i_3\}$, where $\sigma_2$ and $\sigma_3$ are sort symbols and $i_2$ and $i_3$ are function symbols of arity $\sigma_2\rightarrow\sigma_1$ and $\sigma_3\rightarrow\sigma_1$.
%The following $\Sigma_1^1$-sentences are explicit definitions of these two new sort symbols as subsorts.
%\begin{align*}
%\delta_{\sigma_2}&:=\forall_{\sigma_1}x\big(p(x)\leftrightarrow\exists_{\sigma_2} y(i_2(y)=x)\big)
%\land\forall_{\sigma_2} y_1\forall_{\sigma_2} y_2\big(i_2(y_1)=i_2(y_2)\rightarrow y_1=y_2\big)\\
%\delta_{\sigma_3}&:=\forall_{\sigma_1}x\big(q(x)\leftrightarrow\exists_{\sigma_3} z(i_2(z)=x)\big)
%\land\forall_{\sigma_3} z_1\forall_{\sigma_3} z_2\big(i_2(z_1)=i_2(z_2)\rightarrow z_1=z_2\big)
%\end{align*}
%$T_1$ uses the sentence $\delta_{\sigma_2}$ to define the symbols $\sigma_2$ and $i_2$ as the subsort of ``things that are $p$'' and the sentence $\delta_{\sigma_3}$ to define $\sigma_3$ and $i_3$ as the subsort of ``things that are $q$.''
%The axioms of $T_1$ guarantee that the admissibility conditions for these definitions are satisfied.
The $\Sigma$-theory $T_1^1=T_1\cup\{\delta_{\sigma_2},
\delta_{\sigma_3}\}$ is a Morita extension of $T_1$ to the signature
$\Sigma$. It defines $\sigma_2$ to be the subsort of ``elements that
are $p$'' and $\sigma_3$ to be the subsort of ``elements that are
$q$.''
%We now define another Morita extension of $T_1^1$ to the signature $\Sigma_1^2=\Sigma_1^1\cup\{\rho_1, \rho_2\}$, where $\rho_1$ and $\rho_2$ are function symbols of arity $\sigma_2\rightarrow\sigma_1$ and $\sigma_3\rightarrow\sigma_1$. The theory $T_1^1$ uses the sentences $\delta_{\rho_1}$ and $\delta_{\rho_2}$ to define these new symbols.
%defined by the following sentences.
%\begin{align*}
%\delta_{\rho_1}&:=\forall_{\sigma_1} x\forall_{\sigma_2} y(\rho_1(y)=x\leftrightarrow i_2(y)=x) \\
%\delta_{\rho_2}&:=\forall_{\sigma_1} x\forall_{\sigma_3} z(\rho_2(z)=x\leftrightarrow i_3(z)=x)
%\end{align*}
%(The admissibility conditions for these definitions are trivially satisfied.)
%And furthermore, the theory $T_1^2=T_1^1\cup\{\delta_{\rho_1}, \delta_{\rho_2}\}$ is then a Morita extension of $T_1^1$ to the signature $\Sigma=\{\sigma_1,\sigma_2, \sigma_3, p,q, i_2, i_3, \rho_1, \rho_2\}$, where $\rho_1$ and $\rho_2$ are function symbols of arity $\sigma_2\rightarrow\sigma_1$ and $\sigma_3\rightarrow\sigma_1$.
The theory $T_2^1=T_2\cup\{\delta_{\sigma_1}\}$ is a Morita extension of $T_2$ to the signature $\Sigma_2\cup\{\sigma_1, i_2, i_3\}$. It defines $\sigma_1$ to be the coproduct sort of $\sigma_2$ and $\sigma_3$. Lastly, the $\Sigma$-theory $T_2^2=T_2^1\cup\{\delta_p, \delta_q\}$ is a Morita extension of $T_2^1$ to the signature $\Sigma$. It defines the predicates $p$ and $q$ to apply to elements in the ``images'' of $i_2$ and $i_3$, respectively.
%with the sentence
%$$
%\delta_{\sigma_1}:=\forall_{\sigma_1} x\big(\exists_{\sigma_2=1}y(\rho_1(y)=x)\lor\exists_{\sigma_3=1} z(\rho_2(z)=x)\big)
%\land \forall_{\sigma_2} y\forall_{\sigma_3} z \lnot\big(\rho_1(y)=\rho_2(z)\big)
%$$
%which defines $\sigma_1$ as a coproduct sort of $\sigma_2$ and $\sigma_3$. The theory $T_2^1=T_1\cup\{\delta_{\sigma_1}\}$ is a Morita extension of $T_2$. We define one final Morita extension $T_2^2$ of $T_2^1$ to the signature $\Sigma$. The following two sentences define the predicates $p$ and $q$.
%\begin{align*}
%\delta_p:= \forall_{\sigma_1} x\big(p(x)\leftrightarrow\exists_{\sigma_2} y(i_2(y)=x)\big) \\
%\delta_q:= \forall_{\sigma_1} x\big(q(x)\leftrightarrow\exists_{\sigma_3} z(i_3(z)=x)\big)
%\end{align*}
%The theory $T_2^2=T_2^1\cup\{\delta_p, \delta_q, \delta_{\rho_1}, \delta_{\rho_2}\}$ is a Morita extension of $T_2^1$ to $\Sigma$. (The sentenecs $\delta_{\rho_1}$ and $\delta_{\rho_2}$ are here serving as explicit definitions of the function symbols $i_2$ and $i_3$.)
%
One can verify that $T_1^1$ and $T_2^2$ are logically equivalent, so
$T_1$ and $T_2$ are Morita equivalent.
\end{example}
\section{Quine on the dispensability of many-sorted
logic} \label{quine-sort}
The notion of Morita equivalence bears directly on several central
disputes in 20th century analytic philosophy. For example, in his
debate with Carnap and the logical positivists, Quine claims that
many-sorted logic is dispensable. Morita equivalence shows a precise
sense in which Quine is right about that. Similarly, to motivate the
rejection of scientific realism, Putnam claims that a geometric theory
with points as primitives is equivalent to a theory with lines as
primitives. (See, for example, \cite[489-91]{putnam1977}, \cite[109,
115-20]{putnam1992}, and \cite{putnam2001}.) Morita equivalence also
shows a precise sense in which Putnam is right about that. We take up
Quine's argument in the remainder of this section. We take up
Putnam's argument in Chapter \ref{chap-realism}, after we have
developed some semantic tools.
One proves Quine's claim by explicitly constructing a
``corresponding'' single-sorted theory $\widehat{T}$ for every
many-sorted theory $T$. The basic idea behind the construction is
intuitive. The theory $\widehat{T}$ simply replaces the sort symbols
that the theory $T$ uses with predicate symbols.\footnote{This
construction recalls the proof that every theory is definitionally
equivalent to a theory that uses only predicate symbols
\cite[Prop.~2]{barrett-glymour}. \cite{quine1937, quine1938,
quine1956, quine1963} suggests the basic idea behind our proof, as
do \citet[12]{burgess2005} and \citet[221-2]{manzano-book}. The
theorem that we prove here is much more general than Quine's results
because we make no assumption about what the theory $T$ is, whereas
Quine only considered Russell's theory of types and NBG set theory.}
It takes some work, however, to make this idea precise.
Let $\Sigma$ be a signature with finitely many sort symbols
$\sigma_1,\ldots, \sigma_n$. We begin by constructing a corresponding
signature $\widehat{\Sigma}$ that contains one sort symbol
$\sigma$. The symbols in $\widehat{\Sigma}$ are defined as follows.
%\begin{itemize}
%\item
For every sort symbol $\sigma_j\in\Sigma$ we let $q_{\sigma_j}$ be a
predicate symbol of sort $\sigma$.
% \item
For every predicate symbol $p\in\Sigma$ of arity
$\sigma_{j_1}\times\ldots\times\sigma_{j_m}$ we let $q_p$ be a
predicate symbol of arity $\sigma^m$ (the $m$-fold product of
$\sigma$).
%\item
Likewise, for every function symbol $f\in\Sigma$ of arity
$\sigma_{j_1}\times\ldots\times\sigma_{j_m}\rightarrow\sigma_j$ we let
$q_f$ be a predicate symbol of arity $\sigma^{m+1}$.
%\item
And lastly, for every constant symbol $c\in\Sigma$ we let $d_c$ be a constant symbol of sort $\sigma$.
%\end{itemize}
The single-sorted signature $\widehat{\Sigma}$ corresponding to $\Sigma$ is then defined to be
$$
\widehat{\Sigma}=\{\sigma\}\cup\{q_{\sigma_1},\ldots,
q_{\sigma_n}\}\cup\{q_p: p\in\Sigma\}\cup\{q_f: f\in\Sigma\}\cup\{d_c:
c\in\Sigma\}
$$
We can now describe a method of ``translating'' $\Sigma$-theories into
$\widehat{\Sigma}$-theories. Let $T$ be an arbitrary $\Sigma$-theory.
We define a corresponding $\widehat{\Sigma}$-theory $\widehat{T}$, and
then show that $\widehat{T}$ is Morita equivalent to $T$.
We begin by translating the axioms of $T$ into the signature
$\widehat{\Sigma}$. This will take two steps. First, we describe a
way to translate the $\Sigma$-terms into $\widehat{\Sigma}$-formulas.
Given a $\Sigma$-term $t(x_1,\ldots, x_n)$, we define the
$\widehat{\Sigma}$-formula $\widehat{\psi}_t(y_1,\ldots, y_n, y)$
recursively as follows.
\begin{itemize}
\item If $t(x_1,\ldots, x_n)$ is the variable $x_i$, then
$\widehat{\psi}_t$ is the $\widehat{\Sigma}$-formula $y_i=y$.
\item If $t(x_1,\ldots, x_n)$ is the constant $c$, then
$\widehat{\psi}_t$ is the $\widehat{\Sigma}$-formula $d_{c}=y$.
\item Suppose that $t(x_1,\ldots, x_n)$ is the term $f(t_1(x_1,\ldots,
x_n),\ldots, t_k(x_1,\ldots, x_n))$ and that each of the
$\widehat{\Sigma}$-formulas $\widehat{\psi}_{t_i}(y_1,\ldots, y_n,
y)$ have been defined. Then $\widehat{\psi}_t(y_1,\ldots, y_n, y)$
is the $\widehat{\Sigma}$-formula
$$
\exists_{\sigma}z_1\ldots \exists_\sigma z_k\big(\widehat{\psi}_{t_1}(y_1,\ldots, y_n, z_1)\land\ldots\land\widehat{\psi}_{t_k}(y_1,\ldots, y_n, z_k)\land q_{f}(z_1,\ldots, z_k, y)\big)
$$
\end{itemize}
One can think of the formula $\psi_t(y_1,\ldots, y_n, y)$ as the
translation of the expression ``$t(x_1,\ldots, x_n)=x$'' into the
signature $\widehat{\Sigma}$.
Second, we use this map from $\Sigma$-terms to
$\widehat{\Sigma}$-formulas to describe a map from $\Sigma$-formulas
to $\widehat{\Sigma}$-formulas. Given a $\Sigma$-formula
$\psi(x_1,\ldots, x_n)$, we define the $\widehat{\Sigma}$-formula
$\widehat{\psi}(y_1,\ldots, y_n)$ recursively as follows.
\begin{itemize}
\item If $\psi(x_1,\ldots, x_n)$ is $t(x_1,\ldots, x_n)=s(x_1,\ldots, x_n)$, where $s$ and $t$ are $\Sigma$-terms of sort $\sigma_i$, then $\widehat{\psi}(y_1,\ldots, y_n)$ is the $\widehat{\Sigma}$-formula
$$
\exists_\sigma z\big(\widehat{\psi}_{t}(y_1,\ldots, y_n, z)\land
\widehat{\psi}_{s}(y_1,\ldots, y_n, z)\land q_{\sigma_i}(z)\big)
$$
\item If $\psi(x_1,\ldots, x_n)$ is $p(t_1(x_1,\ldots, x_n),\ldots,
t_k(x_1,\ldots, x_n))$, where $p\in\Sigma$ is a predicate symbol,
then $\widehat{\psi}(y_1,\ldots, y_n)$ is the
$\widehat{\Sigma}$-formula
$$
\exists_\sigma z_1\ldots \exists_\sigma
z_k\big(\widehat{\psi}_{t_1}(y_1,\ldots, y_n,
z_1)\land\ldots\land\widehat{\psi}_{t_k}(y_1,\ldots, y_n, z_k)\land
q_{p}(z_1,\ldots, z_k)\big)
$$
\item This definition extends to all $\Sigma$-formulas in the standard
way. We define the $\widehat{\Sigma}$-formulas
$\widehat{\lnot\psi}:=\lnot\widehat{\psi}$,
$\widehat{\psi_1\land\psi_2}:=\widehat{\psi_1}\land\widehat{\psi_2}$,
$\widehat{\psi_1\lor\psi_2}:=\widehat{\psi_1}\lor\widehat{\psi_2}$,
and
$\widehat{\psi_1\rightarrow\psi_2}:=\widehat{\psi_1}\rightarrow\widehat{\psi_2}$. Furthermore,
if $\psi(x_1,\ldots, x_n, x)$ is a $\Sigma$-formula, then we define
both of the following:
\begin{align*}
\widehat{\forall_{\sigma_i} x \psi}&:=\forall_\sigma y(q_{\sigma_i}(y)\rightarrow \widehat{\psi}(y_1,\ldots, y_n, y))\\
\widehat{\exists_{\sigma_i} x\psi}&:=\exists_\sigma y(q_{\sigma_i}(y)\land\widehat{\psi}(y_1,\ldots, y_n, y))
\end{align*}
\end{itemize}
% We have defined the $\widehat{\Sigma}$-formula $\widehat{\psi}$ for
% every $\Sigma$-formula $\psi$.
One should think of the formula $\widehat{\psi}$ as the translation of
the $\Sigma$-formula $\psi$ into the signature $\widehat{\Sigma}$.
This allows us to consider the translations $\widehat{\alpha}$ of the
axioms $\alpha\in T$. The single-sorted theory $\widehat{T}$ will have
the $\widehat{\Sigma}$-sentences $\widehat{\alpha}$ as some of its
axioms. But $\widehat{T}$ will have more axioms than just the
sentences $\widehat{\alpha}$. It will also have some \textbf{auxiliary
axioms}. These auxiliary axioms will guarantee that the symbols in
$\widehat{\Sigma}$ ``behave like'' their counterparts in $\Sigma$. We
define auxiliary axioms for the predicate symbols
$q_{\sigma_1},\ldots, q_{\sigma_n}\in\widehat{\Sigma}$,
$q_{p}\in\widehat{\Sigma}$, and $q_f\in\widehat{\Sigma}$, and for the
constant symbols $d_c\in\widehat{\Sigma}$. We discuss each of these
four cases in detail.
We first define auxiliary axioms to guarantee that the symbols
$q_{\sigma_1},\ldots, q_{\sigma_n}$ behave like sort symbols. The
$\widehat{\Sigma}$-sentence $\phi$ is defined to be $\forall_\sigma
y(q_{\sigma_1}(y)\lor\ldots\lor q_{\sigma_n}(y))$.\footnote{Note that
if there were infinitely many sort symbols in $\Sigma$, then we
could not define the $\widehat{\Sigma}$-sentence $\phi$ in this
way.} Furthermore, for each sort symbol $\sigma_j\in\Sigma$ we
define the $\widehat{\Sigma}$-sentence $\phi_{\sigma_j}$ to be
\begin{align*}
\exists_\sigma y (q_{\sigma_j}(y))\land\forall_\sigma y \big(q_{\sigma_j}(y)\rightarrow (&\lnot q_{\sigma_1}(y)\land\ldots\land \lnot q_{\sigma_{j-1}}(y)\\&\land \lnot q_{\sigma_{j+1}}(y)\land\ldots\land\lnot q_{\sigma_n}(y))\big)
\end{align*}
One can think of the sentences $\phi_{\sigma_1},\ldots, \phi_{\sigma_n}$, and $\phi$ as saying that ``everything is of some sort, nothing is of more than one sort, and every sort is nonempty.''
Next we define auxiliary axioms to guarantee that the symbols $q_p$,
$q_f$, and $d_c$ behave like their counterparts $p$, $f$, and $c$ in
$\Sigma$. For each predicate symbol $p\in\Sigma$ of arity
$\sigma_{j_1}\times\ldots\times\sigma_{j_m}$, we define the
$\widehat{\Sigma}$-sentence $\phi_{p}$ to be
$$
\forall_\sigma y_1\ldots \forall_\sigma y_m\left(q_{p}(y_1,\ldots,
y_m)\rightarrow \left(q_{\sigma_{j_1}}(y_1)\land \ldots\land
q_{\sigma_{j_m}}(y_m)\right)\right)
$$
This sentence restricts the extension of $q_p$ to the subdomain of
$n$-tuples satisfying $q_{\sigma _{j_1}},\dots ,q_{\sigma _{j_m}}$,
guaranteeing that the predicate $q_p$ has ``the appropriate arity.''
Consider, for example, the case of a unary predicate $p$ of sort
$\sigma _i$. In that case, $\phi _p$ says that
$$ \forall _\sigma y (q_p(y)\to q_{\sigma _i}(y)) ,$$
which means that nothing outside the subdomain $q_{\sigma _i}$
satisfies $q_p$. Note, however, that here we have made a
conventional choice. We could just as well have stipulated
that $q_p$ applies to \textit{everything} outside of the subdomain
$q_{\sigma _i}$. All that matters here is that $q_p$ is trivial (either trivially true, or trivially false) except on the subdomain of objects satisfying $q_{\sigma _i}$.
For each function symbol $f\in\Sigma$ of arity
$\sigma_{j_1}\times\ldots\times\sigma_{j_m}\rightarrow\sigma_j$ we
define the $\widehat{\Sigma}$-sentence $\phi_{f}$ to be the
conjunction
\begin{align*}
&\forall_\sigma y_1\ldots\forall_\sigma y_m \forall_\sigma y\big(q_{f}(y_1,\ldots, y_m, y)\rightarrow (q_{\sigma_{j_1}}(y_1)\land\ldots\land q_{\sigma_{j_m}}(y_m)\land q_{\sigma_j}(y))\big)\\
%
&\land \forall_\sigma y_1\ldots\forall_\sigma y_m\big((q_{\sigma_{j_1}}(y_1)\land\ldots\land q_{\sigma_{j_m}}(y_m))\rightarrow\exists_{\sigma=1} y(q_{f}(y_1,\ldots, y_m, y))\big)
\end{align*}
The first conjunct guarantees that the symbol $q_{f}$ has ``the
appropriate arity,'' and the second conjunct guarantees that $q_f$
behaves like a function. Lastly, if $c\in\Sigma$ is a constant symbol
of arity $\sigma_j$, then we define the $\widehat{\Sigma}$-sentence
$\phi_{c}$ to be $q_{\sigma_j}(d_{c})$. This sentence guarantees that
the constant symbol $d_{c}$ also has ``the appropriate arity.''
We now have the resources to define a $\widehat{\Sigma}$-theory
$\widehat{T}$ that is Morita equivalent to $T$.
\begin{align*}
\widehat{T}=\{\widehat{\alpha}: \alpha\in T\} & \cup \{\phi,
\phi_{\sigma_1},\ldots,\phi_{\sigma_n}\}\\ &\cup \{\phi_{p}:
p\in\Sigma\}\\ &\cup \{\phi_{f}: f\in \Sigma\}\\ &\cup \{\phi_{c}:
c\in\Sigma\}
\end{align*}
The theory $\widehat{T}$ has two kinds of axioms, the translated
axioms of $T$ and the auxiliary axioms. These axioms allow
$\widehat{T}$ to imitate the theory $T$ in the signature
$\widehat{\Sigma}$. Indeed, one can prove the following result.
\begin{thm}[Barrett] The theories $T$ and $\widehat{T}$ are Morita
equivalent. \label{mange}
\end{thm}
The proof of Theorem \ref{mange} requires some work, but the idea
behind it is simple. The theory $T$ needs to define symbols in
$\widehat{\Sigma}$. It defines the sort symbol $\sigma$ as a
``universal sort,'' by taking the coproduct of the sorts
$\sigma_1,\ldots, \sigma_n\in\Sigma$.
%One can think of $T$ as defining $\sigma$ to be the sort ``$\sigma_1\sqcup\ldots\sqcup\sigma_n$.''
The theory $T$ then defines the symbols $q_p$, $q_f$, and $d_c$ in
$\widehat{\Sigma}$ simply by using the corresponding symbols $p$, $f$,
and $c$ in $\Sigma$. Likewise, the theory $\widehat{T}$ needs to
define the symbols in $\Sigma$. It defines the sort symbol $\sigma_j$
as the subsort of ``things that are $q_{\sigma_j}$'' for each
$j=1,\ldots, n$. And $\widehat{T}$ defines the symbols $p$, $f$, and
$c$ again by using the corresponding symbols $q_p$, $q_f$, and $d_c$.
We now proceed to the gory details. We prove a
special case of the result for convenience. We will assume that
$\Sigma$ has only three sort symbols $\sigma_1, \sigma_2,\sigma_3$ and
that $\Sigma$ does not contain function or constant symbols. A
perfectly analogous (though more tedious) proof goes through in the
general case.
We prove the result by explicitly constructing a ``common Morita extension'' $T_{4}\cong \widehat{T}_{4}$ of $T$ and $\widehat{T}$ to the following signature.
\begin{align*}
\Sigma^+\:=\:\Sigma\cup\widehat{\Sigma}&\cup\{\sigma_{12}\} \cup \{\rho_1, \rho_2, \rho_{12}, \rho_3\}
\cup\{ i_1,i_2, i_3\} .
\end{align*}
The symbol $\sigma_{12}\in\Sigma^+$ is a sort symbol. The symbols
denoted by subscripted $\rho$ are function symbols. Their arities are
expressed in the following figure.
\[
%% TO DO: fix this picture
\begin{tikzpicture}
%\node(1) {$\sigma$};
%\node(2) [below right of=1] {$\sigma_{n}$};
%\node(3) [below left of=1] {$\sigma_{1\ldots n-1}$};
%\node(4) [below right of=3] {$\sigma_{n-1}$};
\node(5) {$\sigma$};
\node(6) [below right of=5] {$\sigma_{3}$};
\node(7) [below left of=5] {$\sigma_{12}$};
\node(8) [below right of=7] {$\sigma_2$};
\node(9) [below left of =7] {$\sigma_1$};
%\draw[loosely dotted] (6) to node {} (4);
%\draw[loosely dotted] (5) to node {} (3);
\draw[->] (9) to node {$\rho_1$} (7);
\draw[->] (7) to node {$\rho_{12}$} (5);
\draw[->] (6) to node {$\rho_{3}$} (5);
\draw[->] (8) to node {$\rho_2$} (7);
%\draw[->] (4) to node {$\rho_{n-1}$} (3);
%\draw[->] (3) to node {$\rho_{1\ldots n-1}$} (1);
%\draw[->] (2) to node {$\rho_{n}$} (1);
%\draw[->] (8) to [out=0, in=0, looseness=3] node [swap] {$i_2$} (1);
%\draw[->] (2) to [out=0, in=0, looseness=2] node {$i_n$} (1);
%\draw[->] (4) to [out=0, in=0, looseness=3] node [swap] {$i_{n-1}$} (1);
%\draw[->] (6) to [out=0, in=0, looseness=3] node [swap] {$i_{n-1}$} (1);
\end{tikzpicture}
\]
The symbols $i_1$, $i_2$, and $i_3$ are function symbols with arity $\sigma_1\rightarrow \sigma$, $\sigma_2\rightarrow\sigma$, and $\sigma_3\rightarrow\sigma$, respectively.
We now turn to the proof.
\begin{proof}[Proof of Theorem \ref{mange}]
The following figure illustrates how our proof will be organized.
\begin{center}
\includegraphics{quineconjecture3revised.pdf}
\end{center}
Steps 1--3 define the theories $\widehat{T}_1,\ldots, \widehat{T}_{4}$, steps 4--6 define $T_1,\ldots, T_{4}$, and step 7 shows that $T_{4}$ and $\widehat{T}_{4}$ are logically equivalent.
\textbf{Step 1.} We begin by defining the theory $\widehat{T}_1$. For each sort $\sigma_j\in\Sigma$ we consider the following sentence.
\begin{align*}
\begin{aligned}
&\forall_\sigma y \left(q_{\sigma_j}(y)\leftrightarrow \exists_{\sigma_j} x(i_j(x)=y)\right) \\ &\qquad\land \forall_{\sigma_j} x_1\forall_{\sigma_j} x_2 (i_j(x_1)=i_j(x_2)\rightarrow x_1=x_2)
\end{aligned}
\tag{$\theta_{\sigma_j}$}
\end{align*}
The sentence $\theta_{\sigma_j}$ defines the symbols $\sigma_j$ and
$i_j$ as the subsort of ``things that are $q_{\sigma_j}$.'' The
auxiliary axioms $\phi_{\sigma_j}$ of $\widehat{T}$ guarantee that the
admissibility conditions for these definitions are satisfied. The
theory
$\widehat{T}_1=\widehat{T}\cup\{\theta_{\sigma_1},
\theta_{\sigma_2},\theta_{\sigma_3}\}$ is therefore a Morita extension
of $\widehat{T}$ to the signature
$\widehat{\Sigma}\cup\{\sigma_1,\sigma_2,\sigma_3, i_1,i_2,i_3\}$.
%We now show how the theory $\widehat{T}_1$ defines the new sorts $\sigma_{12}, \ldots, \sigma_{12,\ldots, n-1}$ and the function symbols $\rho_1, \rho_2, \ldots, \rho_{12\ldots n-1}, \rho_n$.
\textbf{Step 2.} We now define the theories $\widehat{T}_2$ and $\widehat{T}_3$. Let $\theta_{\sigma_{12}}$ be a sentence that defines the symbols $\sigma_{12}, \rho_1, \rho_2$ as a coproduct sort. The theory $\widehat{T}_2=\widehat{T}_1\cup\{\theta_{\sigma_{12}}\}$ is clearly a Morita extension of $\widehat{T}_1$.
%Likewise, the theory $\widehat{T}_1^1$ uses a sentence $\theta_{\sigma_{123}}$ to define the symbols $\sigma_{123}, \rho_{12}, \rho_3$ as a coproduct sort, resulting in another Morita extension $\widehat{T}_1^2=\widehat{T}\cup\{\theta_{\sigma_{12}}, \theta_{\sigma_{123}}\}$.
%We repeat this process, at each step using the sentence $\theta_{\sigma_{1\ldots k}}$ to define the symbols $\sigma_{1\ldots k}$, $\rho_{1\ldots k-1}$, $\rho_k$ as coproduct sort, until we arrive at the theory
%$$
%\widehat{T}_{n-1}=\widehat{T}_1\cup\{\theta_{\sigma_{12}}, \ldots, \theta_{\sigma_{1\ldots n-1}}\}
%$$
%in the signature $\widehat{\Sigma}\cup\{\sigma_1,\ldots, \sigma_n, i_1,\ldots, i_n\}\cup\{\sigma_{12},\ldots, \sigma_{12\ldots n-1}, \rho_1,\ldots, \rho_{n-1}\}$. One trivially sees that each $\widehat{T}_k$ that we have defined is a Morita extension of the theory $\widehat{T}_{k-1}$.
%
We have yet to define the function symbols $\rho_{12}$ and $\rho_3$. The following two sentences define these symbols.
\begin{align*}
\tag{$\theta_{\rho_3}$}
&\forall_{\sigma_3} x \forall_\sigma y(\rho_3(x)=y\leftrightarrow i_3(x)=y)\\
%
&\forall_{\sigma_{12}} x \forall_\sigma y( \rho_{12}(x)=y\leftrightarrow \psi(x, y))
\tag{$\theta_{\rho_{12}}$}
\end{align*}
The sentence $\theta_{\rho_3}$ simply defines $\rho_3$ to be equal to the function $i_3$. For the sentence $\theta_{\rho_{12}}$, we define the formula $\psi(x,y)$ to be
%\begin{quote}
%Either (1) there is a $z$ such that $\rho_{n-1}(z)=z$ and $i_{n-1}(z)=y$ \textit{or} (2) there are $z_1$ and $z_2$ such that $\rho_{1\ldots n-2}(z_1)=x$, $\rho_{n-2}(z_2)=z_1$, and $i_{n-2}(z_2)=y$ \textit{or} \ldots \textit{or} (
%\end{quote}
\begin{align*}
&\exists_{\sigma_{1}} z_1 \big( \rho_{1}(z_1)=x\land i_{1}(z_1)=y\big)
\lor\exists_{\sigma_{2}} z_2\big( \rho_{2}(z_2)=x\land i_{2}(z_2)=y\big)
\end{align*}
We should take a moment here to understand the definition $\theta_{\rho_{12}}$. We want to define what the function $\rho_{12}$ does to an element $a$ of sort $\sigma_{12}$. Since the sort $\sigma_{12}$ is the coproduct of the sorts $\sigma_1$ and $\sigma_2$, the element $a$ must ``actually be'' of one of the sorts $\sigma_1$ or $\sigma_2$. (The disjuncts in the formula $\psi(x,y)$ correspond to these possibilities.) The definition $\theta_{\rho_{12}}$ stipulates that if $a$ is ``actually'' of sort $\sigma_j$, then the value of $\rho_{12}$ at $a$ is the same as the value of $i_j$ at $a$. One can verify that $\widehat{T}_{2}$ satisfies the admissibility conditions for $\theta_{\rho_3}$ and $\theta_{\rho_{12}}$, so the theory $\widehat{T}_3=\widehat{T}_{2}\cup\{\theta_{\rho_3}, \theta_{\rho_{12}}\}$ is a Morita extension of $\widehat{T}_{2}$ to the signature
$$
\widehat{\Sigma}\cup\{\sigma_1,\sigma_2, \sigma_3, \sigma_{12}, i_1,i_2, i_3, \rho_1, \rho_2,\rho_3, \rho_{12}\}
$$
\textbf{Step 3.} We now describe the $\Sigma^+$-theory $\widehat{T}_{4}$. This theory defines the predicates in the signature $\Sigma$. Let $p\in\Sigma$ be a predicate symbol of arity $\sigma_{j_1}\times\ldots\times\sigma_{j_m}$. We consider the following sentence.
\begin{align*}
\tag{$\theta_{p}$}
\forall_{\sigma_{j_1}} x_1\ldots \forall_{\sigma_{j_m}} x_m\left(p(x_1,\ldots, x_m)\leftrightarrow q_{p}(i_{j_1}(x_1),\ldots, i_{j_m}(x_m))\right)
\end{align*}
The theory $\widehat{T}_{4}=\widehat{T}_3\cup\{\theta_{p}:p\in\Sigma\}$ is therefore a Morita extension of $\widehat{T}_3$ to the signature $\Sigma^+$.
\textbf{Step 4.} We turn to the left-hand side of our organizational figure and define the theories $T_1$ and $T_{2}$. We proceed in an analogous manner to the first part of Step 2. The theory $T_1=T\cup\{\theta_{\sigma_{12}}\}$ is a Morita extension of $T$ to the signature $\Sigma\cup\{\sigma_{12}, \rho_1,\rho_2\}$. Now let $\theta_\sigma$ be the sentence that defines the symbols $\sigma, \rho_{12}, \rho_3$ as a coproduct sort. The theory $T_{2}=T_{1}\cup\{\theta_{\sigma}\}$ is a Morita extension of $T_{1}$ to the signature $\Sigma\cup\{\sigma_{12}, \sigma, \rho_1, \rho_2, \rho_3, \rho_{12}\}$.
\textbf{Step 5.} This step defines the function symbols $i_1$, $i_2$, and $i_3$. We consider the following sentences.
\begin{align*}
\tag{$\theta_{i_3}$}