How the continuum hypothesis might have been a fundamental axiom
Abstract. I shall describe a historical thought experiment showing how our attitude toward the continuum hypothesis could easily have been very different than it is. If our mathematical history had been just a little different, I claim, if certain mathematical discoveries had been made in a slightly different order, then we would naturally view the continuum hypothesis as a fundamental axiom of set theory, necessary for mathematics and indeed indispensable for calculus.
Abstract. I shall describe a simple historical thought experiment showing how our attitude toward the continuum hypothesis could easily have been very different than it is. If our mathematical history had been just a little different, I claim, if certain mathematical discoveries had been made in a slightly different order, then we would naturally view the continuum hypothesis as a fundamental axiom of set theory, necessary for mathematics and indeed indispensable for calculus.
Abstract. I describe a simple historical thought experiment showing how we might have come to view the continuum hypothesis as a fundamental axiom, one necessary for mathematics, indispensable even for calculus.
See also this talk I gave on the topic at the University of Oslo:
Abstract. Zermelo famously proved that second-order ZFC is quasi-categoricalβthe models of this theory are precisely the rank-initial segments of the set-theoretic universe cut off at an inaccessible cardinal. Which are the fully categorical extensions of this theory? This question gives rise to the notion of categorical large cardinals, and opens the door to several puzzling philosophical issues, such as the conflict between categoricity as a fundamental value in mathematics and reflection principles in set theory. (This is joint work with Robin Solberg, Oxford.)
Categorical accounts of various mathematical structures lie at the very core of structuralist mathematical practice, enabling mathematicians to refer to specific mathematical structures, not by having carefully to prepare and point at specially constructed instancesβpreserved like the one-meter iron bar locked in a case in Parisβbut instead merely by mentioning features that uniquely characterize the structure up to isomorphism.
It follows that for any two models of ZFC2, one of them is isomorphic to an initial segment of the other. These set-theoretic models ππ have now come to be known as Zermelo-Grothendieck universes, in light of Grothendieckβs use of them in category theory (a rediscovery several decades after Zermelo); they feature in the universe axiom, which asserts that every set is an element of some such ππ , or equivalently, that there are unboundedly many inaccessible cardinals.
In this article, we seek to investigate the extent to which Zermeloβs quasi-categoricity analysis can rise fully to the level of categoricity, in light of the observation that many of the ππ universes are categorically characterized by their sentences or theories.
Question. Which models of ZFC2 satisfy fully categorical theories?
If π is the smallest inaccessible cardinal, for example, then up to isomorphism ππ is the unique model of ZFC2 satisfying the first-order sentence βthere are no inaccessible cardinals.β The least inaccessible cardinal is therefore an instance of what we call a first-order sententially categorical cardinal. Similar ideas apply to the next inaccessible cardinal, and the next, and so on for quite a long way. Many of the inaccessible universes thus satisfy categorical theories extending ZFC2 by a sentence or theory, either in first or second order, and we should like to investigate these categorical extensions of ZFC2.
In addition, we shall discuss the philosophical relevance of categoricity and point particularly to the philosophical problem posed by the tension between the widespread support for categoricity in our fundamental mathematical structures with set-theoretic ideas on reflection principles, which are at heart anti-categorical.
Our main theme concerns these notions of categoricity:
Main Definition.
A cardinal π is first-order sententially categorical, if there is a first-order sentence π in the language of set theory, such that ππ is categorically characterized by ZFC2+π.
A cardinal π is first-order theory categorical, if there is a first-order theory π in the language of set theory, such that ππ is categorically characterized by ZFC2+π.
A cardinal π is second-order sententially categorical, if there is a second-order sentence π in the language of set theory, such that ππ is categorically characterized by ZFC2+π.
A cardinal π is second-order theory categorical, if there is a second-order theory π in the language of set theory, such that ππ is categorically characterized by ZFC2+π.
Follow through to the arxiv for the pdf to read more:
Abstract: Zermelo famously characterized the models of second-order Zermelo-Fraenkel set theory ZFC2 in his 1930 quasi-categoricity result asserting that the models of ZFC2 are precisely those isomorphic to a rank-initial segment ππ of the cumulative set-theoretic universe π cut off at an inaccessible cardinal π . I shall discuss the extent to which Zermeloβs quasi-categoricity analysis can rise fully to the level of categoricity, in light of the observation that many of the ππ universes are categorically characterized by their sentences or theories. For example, if π is the smallest inaccessible cardinal, then up to isomorphism ππ is the unique model of ZFC2 plus the sentence βthere are no inaccessible cardinals.β This cardinal π is therefore an instance of what we call a first-order sententially categorical cardinal. Similarly, many of the other inaccessible universes satisfy categorical extensions of ZFC2 by a sentence or theory, either in first or second order. I shall thus introduce and investigate the categorical cardinals, a new kind of large cardinal. This is joint work with Robin Solberg (Oxford).
Consider the real numbers β and the complex numbers β and the question of whether these structures are interpretable in one another as fields.
What does it mean to interpret one mathematical structure in another? It means to provide a definable copy of the first structure in the second, by providing a definable domain of π-tuples (not necessarily just a domain of points) and definable interpretations of the atomic operations and relations, as well as a definable equivalence relation, a congruence with respect to the operations and relations, such that the first structure is isomorphic to the quotient of this definable structure by that equivalent relation. All these definitions should be expressible in the language of the host structure.
One may proceed recursively to translate any assertion in the language of the interpreted structure into the language of the host structure, thereby enabling a complete discussion of the first structure purely in the language of the second.
Similarly, the rational field β can be interpreted in the integers as the quotient field, whose elements can be thought of as integer pairs (π,π) written more conveniently as fractions ππ, where πβ 0, considered under the same-ratio relation ππβ‘ππ βΊπβ’π =πβ’π. The field structure is now easy to define on these pairs by the familiar fractional arithmetic, which is well-defined with respect to that equivalence. Thus, we have provided a definable copy of the rational numbers inside the integers, an interpretation of β in β€.
The complex field β is of course interpretable in the real field β by considering the complex number π+πβ’π as represented by the real number pair (π,π), and defining the operations on these pairs in a way that obeys the expected complex arithmetic.
(π,π)+(π,π)=(π+π,π+π)(π,π)β (π,π)=(πβ’πβπβ’π,πβ’π+πβ’π)
Thus, we interpret the complex number field β inside the real field β.
Question. What about an interpretation in the converse direction? Can we interpret β in β?
Although of course the real numbers can be viewed as a subfield of the complex numbers βββ,this by itself doesnβt constitute an interpretation, unless the submodel is definable. And in fact, β is not a definable subset of β. There is no purely field-theoretic property πβ‘(π₯), expressible in the language of fields, that holds in β of all and only the real numbers π₯. But more: not only is β not definable in β as a subfield, we cannot even define a copy of β in β in the language of fields. We cannot interpret β in β in the language of fields.
Theorem. As fields, the real numbers β are not interpretable in the complex numbers β.
The theorem is well-known to model theorists, a standard observation, and model theorists often like to prove it using some sophisticated methods, such as stability theory. The main issue from that point of view is that the order in the real numbers is definable from the real field structure, but the theory of algebraically closed fields is too stable to allow it to define an order like that.
But I would like to give a comparatively elementary proof of the theorem, which doesnβt require knowledge of stability theory. After a conversation this past weekend with Jonathan Pila, Boris Zilber and Alex Wilkie over lunch and coffee breaks at the Robin Gandy conference, here is such an elementary proof, based only on knowledge concerning the enormous number of automorphisms of β, a consequence of the categoricity of the complex field, which itself follows from the fact that algebraically closed fields of a given characteristic are determined by their transcedence degree over their prime subfield. It follows that any two transcendental elements of β are automorphic images of one another, and indeed, for any element π§ββ any two complex numbers transcendental over ββ‘(π§) are automorphic in β by an automorphism fixing π§.
Proof of the theorem. Suppose that we could interpret the real field β inside the complex field β. So we would define a domain of π-tuples π ββπ with an equivalence relation β on it, and operations of addition and multiplication on the equivalence classes, such that the real field was isomorphic to the resulting quotient structure π /β. There is absolutely no requirement that this structure is a submodel of β in any sense, although that would of course be allowed if possible. The + and Γ of the definable copy of β in β might be totally strange new operations defined on those equivalence classes. The definitions altogether may involve finitely many parameters βπ=(π1,β¦,ππ), which we now fix.
As we mentioned, the complex number field β has an enormous number of automorphisms, and indeed, any two π-tuples βπ₯ and βπ¦ that exhibit the same algebraic equations over ββ‘(βπ) will be automorphic by an automorphism fixing βπ. In particular, this means that there are only countably many isomorphism orbits of the π-tuples of β. Since there are uncountably many real numbers, this means that there must be two β-inequivalent π-tuples in the domain π that are automorphic images in β, by an automorphism π:βββ fixing the parameters βπ. Since π fixes the parameters of the definition, it will take π to π and it will respect the equivalence relation and the definition of the addition and multiplication on π /β. Therefore, π will induce an automorphism of the real field β, which will be nontrivial precisely because π took an element of one β-equivalence class to another.
I taught a course in Fall 2011 at NYU entitled Topics in Logic: set theory and the philosophy of set theory, aimed at graduate students in philosophy and others who want to gain greater understanding of some of the set-theoretic topics central to work in the philosophy of set theory. The course began with a review of the mathematical ideas, including a presentation of large cardinals, strong axioms of infinity and their associated elementary embeddings of the universe, and forcing, emphasizing the connection with the Boolean ultrapower and Boolean-valued models, but discussing the alternative formalizations. The second part of the course covers some of the philosophical literature, including what it means to accept or believe mathematical axioms, whether mathematics needs new axioms, the criteria one might use when adopting new axioms, and the question of pluralism and categoricity in set theory.
Here is a partial list of our readings:
1. Mathematical background.
Thomas Jech, Set Theory.
Akihiro Kanamori, The Higher Infinite.
Timothy Chow, βA beginnerβs guide to forcing,β from Contemporary Mathematics. http://arxiv.org/abs/0712.1320. (for mathematical review)
4. W. N. Reinhardt, βRemarks on reflection principles, large cardinals, and elementary embeddings,β Proceedings of Symposia in Pure Mathematics, Vol 13, Part II, 1974, pp. 189-205.
5. Donald Martin, βMultiple universes of sets and indeterminate truth values,β Topoi 20, 5β16, 2001.
6. Hartry Field, βWhich undecidable mathematical sentences have determinate truth values,β as reprinted in his book Truth and the Absence of Fact, Oxford University Press, 2001.
7. A brief selection from Marc Balaguer, Platonism and Anti-Platonism in Mathematics, Oxford University Press, 1998, describing the plenitudinous Platonism position.
8. Daniel Isaacson, βThe reality of mathematics and the case of set theory,β 2007.
13. Interpretability of theories, the interpretability degrees and Orey sentences in set theory and arithmetic. Some of the basic material is found in Per LindstrΓΆmβs book Aspects of Incompleteness, available at http://projecteuclid.org/euclid.lnl/1235416274, particularly chapter 6, and some later chapters.
14. Haim Gaifman, βOn ontology and realism in mathematics,β to appear in the Review of Symbolic Logic (special issue connected with the NYU philosophy of mathematics conference 2009).