Question. Which topological spaces support a topological model of arithmetic?
In the paper, we prove that the reals β, the reals in any finite dimension βπ, the long line and Cantor space do not support a topological model of arithmetic, and neither does any Suslin line. Meanwhile, there are many other spaces that do support topological models, including many uncountable subspaces of the plane β2. It remains an open question whether any uncountable Polish space, including the Baire space, can support a topological model of arithmetic.
Let me state the main theorem and briefly sketch the proof.
Main Theorem. Every countable model of PA has a continuous presentation on the rationals β.
This phenomenon amounts exactly to the continuity of addition and multiplication with respect to what we call the final-digits topology on β, which is the topology having basic open sets ππ , the set of numbers whose binary representations ends with the digits π , for any finite binary string π . (One can do a similar thing with any base.) In the ππ notation, we include the number that would arise by deleting initial 0s from π ; for example, 6βπ00110. Addition and multiplication are continuous in this topology, because if π₯+π¦ or π₯β π¦ has final digits π , then by the school-childβs observation, this is ensured by corresponding final digits in π₯ and π¦, and so (π₯,π¦) has an open neighborhood in the final-digits product space, whose image under the sum or product, respectively, is contained in ππ .
But let us belabor the argument somewhat, since we find it interesting to notice that the final-digits topology (or equivalently, the 2-adic metric topology on β) is precisely the order topology of a certain definable order on β, what we call the final-digits order, an endless dense linear order, which is therefore order-isomorphic and thus also homeomorphic to the rational line β, as desired.
The final-digits order β on the natural numbers is determined by the left-to-right order of nodes and branches in a certain presentation of the binary tree, pictured in figure 1. Each node in the tree represents a natural number via the binary sequence of digits proceeding from that node down to the root, and the final-digits order is determined from the induced left-to-right order of these nodes. As one climbs the tree, successive bits are added as leading digits of the binary representations. Since leading digits of 0 do not affect the number represented, these nodes move neither left nor right, but proceed straight up in the tree. Leading digits of 1, however, branch to the right at even stages in this tree and to the left at odd stages. One easily determines any instance of the final-digits order relation for natural numbers πβπ by inspecting the right-most digit of disagreement in the binary representations of π and π (appending leading 0s if necessary), and considering whether this digitβs place is even or odd.
We now complete the proof by considering an arbitrary countable model π of PA. Let βπ be the final-digits order as defined inside π. Since the reasoning of the above paragraphs can be undertaken in PA, it follows that π can see that its addition and multiplication are continuous with respect to the order topology of its final-digits order. Since π is countable, the final-digits order of π makes it a countable endless dense linear order, which by Cantorβs theorem is therefore order-isomorphic and hence homeomorphic to β. Thus, π has a continuous presentation on the rational line β, as desired. β»
The executive summary of the proof is: the arithmetic of the standard model β is continuous with respect to the final-digits topology, which is the same as the 2-adic metric topology on β, and this is homeomorphic to the rational line, because it is the order topology of the final-digits order, a definable endless dense linear order; applied in a nonstandard model π, this observation means the arithmetic of π is continuous with respect to its rational line βπ, which for countable models is isomorphic to the actual rational line β, and so such an π is continuously presentable upon β.
Let me mention the following order, which it seems many people expect to use instead of the final-digits order as we defined it above. With this order, one in effect takes missing initial digits of a number as 0, which is of course quite reasonable.
The problem with this order, however, is that the order topology is not actually the final-digits topology. For example, the set of all numbers having final digits 110 in this order has a least element, the number 6, and so it is not open in the order topology. Worse, I claim that arithmetic is not continuous with respect to this order. For example, 1+1=2, and 2 has an open neighborhood consisting entirely of even numbers, but every open neighborhood of 1 has both odd and even numbers, whose sums therefore will not all be in the selected neighborhood of 2. Even the successor function π₯β¦π₯+1 is not continuous with respect to this order.
Finally, let me mention that a version of the main theorem also applies to the integers β€, using the following order.
I was recently asked this interesting question on set-theoretic geology by Iian Smythe, a set-theory post-doc at Rutgers University; the problem arose in the context of one of this current research projects.
Question. Assume that two product forcing extensions are the same πβ‘[πΊ]β’[πΎ]=πβ‘[π»]β’[πΎ], where πβ‘[πΊ] and πβ‘[π»] are forcing extensions of π by the same forcing notion β, and πΎβββπ is both πβ‘[πΊ] and πβ‘[π»]-generic with respect to this further forcing β. Can we conclude that πβ‘[πΊ]=πβ‘[π»]? Can we make this conclusion at least in the special case that β is adding a Cohen real and β is collapsing the continuum?
It seems natural to hope for a positive answer, because we are aware of many such situations that arise with forcing, where indeed πβ‘[πΊ]=πβ‘[π»]. Nevertheless, the answer is negative. Indeed, we cannot legitimately make this conclusion even when both steps of forcing are adding merely a Cohen real. And such a counterexample implies that there is a counterexample of the type mentioned in the question, simply by performing further collapse forcing.
Theorem. For any countable model π of set theory, there are π-generic Cohen reals π, π and π, such that
The Cohen reals π and π are mutually generic over π.
The Cohen reals π and π are mutually generic over π.
These two pairs produce the same forcing extension πβ‘[π]β’[π]=πβ‘[π]β’[π].
But the intermediate models are different πβ‘[π]β πβ‘[π].
Proof. Fix π, and let π and π be any two mutually generic Cohen reals over π. Let us view them as infinite binary sequences, that is, as elements of Cantor space. In the extension πβ‘[π]β’[π], let π=π+πmodβ‘2, in each coordinate. That is, we get π from π by flipping bits, but only on coordinates that are 1 in π. This is the same as applying a bit-flipping automorphism of the forcing, which is available in πβ‘[π], but not in π. Since π is πβ‘[π]-generic by reversing the order of forcing, it follows that π also is πβ‘[π]-generic, since the automorphism is in πβ‘[π]. Thus, π and π are mutually generic over π. Further, πβ‘[π]β’[π]=πβ‘[π]β’[π], because πβ‘[π]β’[π]=πβ‘[π]β’[π], as π and π were isomorphic generic filters by an isomorphism in πβ‘[π]. But finally, πβ‘[π] and πβ‘[π] are not the same, because from π and π together we can construct π, because we can tell exactly which bits were flipped. β»
If one now follows the π forcing with collapse forcing, one achieves a counterexample model of the type mentioned in the question, namely, with πβ‘[π]β’[πβπΎ]=πβ‘[π]β’[πβπΎ], but πβ‘[π]β πβ‘[π].
I have a feeling that my co-authors on a current paper in progress, Set-theoretic blockchains, on the topic of non-amalgamation in the generic multiverse, will tell me that the argument above is an instance of some of the theorems we prove in the latter part of that paper. (Miha, please tell me in the comments, if you see this, or tell me where I have seen this argument before; I think I made this argument or perhaps seen it before.) The paper is [bibtex key=βHabicHamkinsKlausnerVernerWilliams2018:Set-theoretic-blockchainsβ].
I have found a new proof of the Barwise extension theorem, that wonderful yet quirky result of classical admissible set theory, which says that every countable model of set theory can be extended to a model of ZFC+π=πΏ.
Barwise Extension Theorem. (Barwise 1971) Every countable model of set theory πβ§ZF has an end-extension to a model of ZFC+π=πΏ.
The Barwise extension theorem is both (i) a technical culmination of the pioneering methods of Barwise in admissible set theory and infinitary logic, including the Barwise compactness and completeness theorems and the admissible cover, but also (ii) one of those rare mathematical theorems that is saturated with significance for the philosophy of mathematics and particularly the philosophy of set theory. I discussed the theorem and its philosophical significance at length in my paper, The multiverse perspective on the axiom of constructibility, where I argued that it can change how we look upon the axiom of constructibility and whether this axiom should be considered βrestrictive,β as it often is in set theory. Ultimately, the Barwise extension theorem shows how wrong a model of set theory can be, if we should entertain the idea that the set-theoretic universe continues growing beyond it.
Regarding my new proof, below, however, what I find especially interesting about it, if not surprising in light of (i) above, is that it makes no use of Barwise compactness or completeness and indeed, no use of infinitary logic at all! Instead, the new proof uses only classical methods of descriptive set theory concerning the representation of Ξ 11 sets with well-founded trees, the Levy and Shoenfield absoluteness theorems, the reflection theorem and the Keisler-Morley theorem on elementary extensions via definable ultrapowers. Like the Barwise proof, my proof splits into cases depending on whether the model π is standard or nonstandard, but another interesting thing about it is that with my proof, it is the π-nonstandard case that is easier, whereas with the Barwise proof, the transitive case was easiest, since one only needed to resort to the admissible cover when π was ill-founded. Barwise splits into cases on well-founded/ill-founded, whereas in my argument, the cases are π-standard/π-nonstandard.
Set theory, of course, overflows with instances of end-extensions. For example, the rank-initial segments ππΌ end-extend to their higher instances ππ½, when πΌ<π½; similarly, the hierarchy of the constructible universe πΏπΌβπΏπ½ are end-extensions; indeed any transitive set end-extends to all its supersets. The set-theoretic universe π is an end-extension of the constructible universe πΏ and every forcing extension πβ‘[πΊ] is an end-extension of its ground model π, even when nonstandard. (In particular, one should not confuse end-extensions with rank-extensions, also known as top-extensions, where one insists that all the new sets have higher rank than any ordinal in the smaller model.)
If π happens to be countable in πΏπ+, then consider the assertion, βthere is a countable transitive set, such that the assertion that it has no end-extension to a model of ZFC+π=πΏ has rank π.β This is a Ξ£1 assertion, since it is witnessed by the countable transitive set and the ranking function of the tree associated with the non-extension assertion. Since the parameters are countable, it follows by Levy reflection that the statement is true in πΏπ+. So πΏπ+ has a countable transitive set, such that the assertion that it has no end-extension to a model of ZFC+π=πΏ has rank π. But since π is actually well-founded, the statement would have to be actually true; but it isnβt, since πΏπ+ itself is such an extension, a contradiction.
So we may assume π is uncountable in π+. In this case, since π was actually well-ordered, it follows that πΏπ is well-founded beyond its π1. Consider the statement βthere is a countable transitive set having no end-extension to a model of ZFC+π=πΏ.β This is a Ξ£12 sentence, which is true in π+β‘[πΊ] by our assumption about π, and so by Shoenfield absoluteness, it is true in πΏπ+ and hence also πΏπ. So πΏπ thinks there is a countable transitive set π having no end-extension to a model of ZFC+π=πΏ. This is a Ξ 11 assertion about π, whose truth is witnessed in πΏπ by a ranking of the associated tree. Since this rank would be countable in πΏπ and this model is well-founded up to its π1, the tree must be actually well-founded. But this is impossible, since it is not actually true that π has no such end-extension, since πΏπ itself is such an end-extension of π. Contradiction. β»
One can prove a somewhat stronger version of the theorem, as follows.
Theorem. For any countable model π of ZF, with an inner model πβ§ZFC, and any statement π true in π, there is an end-extension of π to a model of ZFC+π. Furthermore, one can arrange that every set of π is countable in the extension model.
In particular, one can find end-extensions of ZFC+π=πΏ+π, for any statement π true in πΏπ.
Proof. Carry out the same proof as above, except in all the statements, ask for end-extensions of ZFC+π, instead of end-extensions of ZFC+π=πΏ, and also ask that the set in question become countable in that extension. The final contradictions are obtained by the fact that the countable transitive sets in πΏπ do have end-extensions like that, in which they are countable, since π is such an end-extension. β»
For example, we can make the following further examples.
Corollaries.
Every countable model π of ZFC with a measurable cardinal has an end-extension to a model π of ZFC+π=πΏβ‘[π].
Every countable model π of ZFC with extender-based large cardinals has an end-extension to a model π satisfying ZFC+π=πΏβ‘[βπΈ].
Every countable model π of ZFC with infinitely many Woodin cardinals has an end-extension to a model π of ZF+AD+π=πΏβ‘(β).
And in each case, we can furthermore arrange that every set of π is countable in the extension model π.
This proof grew out of a project on the Ξ£1-definable universal finite set, which I am currently undertaking with Kameryn Williams and Philip Welch.
Can set-theoretic mereology serve as a foundation of mathematics?
Abstract: Mereology, the study of the relation of part to whole, is often contrasted with set theory and its membership relation, the relation of element to set. Whereas set theory has found comparative success in the foundation of mathematics, since the time of Cantor, Zermelo and Hilbert, mereology is strangely absent. Can a set-theoretic mereology, based upon the set-theoretic inclusion relation β rather than the element-of relation β, serve as a foundation of mathematics? Can we faithfully interpret arbitrary mathematical structure in terms of the subset relation to the same extent that set theorists have done so in terms of the membership relation? At bottom, the question is: can we get by with merely β in place of β? Ultimately, I shall identify grounds supporting generally negative answers to these questions, concluding that set-theoretic mereology by itself cannot serve adequately as a foundational theory.
Please join me in Prague! See the Call for Papers, requesting contributed papers and contributed symposia on twenty different thematic sections, from mathematical and philosophical logic to the philosophy of science, philosophy of computing and many other areas. I am given to understand that this will be a large meeting, with about 800 participants expected.
J. D. Hamkins and M. Kikuchi, βSet-theoretic mereology,β Logic and Logical Philosophy, special issue βMereology and beyond, part IIβ, vol. 25, iss. 3, pp. 285-308, 2016.
Abstract. The principle of open class determinacy is preserved by pre-tame class forcing, and after such forcing, every new class well-order is isomorphic to a ground-model class well-order. Similarly, the principle of elementary transfinite recursion ETRΞ for a fixed class well-order Ξ is preserved by pre-tame class forcing. The full principle ETR itself is preserved by countably strategically closed pre-tame class forcing, and after such forcing, every new class well-order is isomorphic to a ground-model class well-order. Meanwhile, it remains open whether ETR is preserved by all forcing, including the forcing merely to add a Cohen real.
The principle of elementary transfinite recursion ETR β according to which every first-order class recursion along any well-founded class relation has a solution β has emerged as a central organizing concept in the hierarchy of second-order set theories from GΓΆdel-Bernays set theory GBC up to Kelley-Morse set theory KM and beyond. Many of the principles in the hierarchy can be seen as asserting that certain class recursions have solutions.
In addition, many of these principles, including ETR and its variants, are equivalently characterized as determinacy principles for certain kinds of class games. Thus, the hierarchy is also fruitfully unified and organized by determinacy ideas.
This hierarchy of theories is the main focus of study in the reverse mathematics of second-order set theory, an emerging subject aiming to discover the precise axiomatic principles required for various arguments and results in second-order set theory. The principle ETR itself, for example, is equivalent over GBC to the principle of clopen determinacy for class games and also to the existence of iterated elementary truth predicates (see Open determinacy for class games); since every clopen game is also an open game, the principle ETR is naturally strengthened by the principle of open determinacy for class games, and this is a strictly stronger principle (see Hachtman and Sato); the weaker principle ETROrd, meanwhile, asserting solutions to class recursions of length Ord, is equivalent to the class forcing theorem, which asserts that every class forcing notion admits a forcing relation, to the existence of set-complete Boolean completions of any class partial order, to the existence of Ord-iterated elementary truth predicates, to the determinacy of clopen games of rank at most Ord+1, and to other natural set-theoretic principles (see The exact strength of the class forcing theorem).
Since one naturally seeks in any subject to understand how oneβs fundamental principles and tools interact, we should like in this article to consider how these second-order set-theoretic principles are affected by forcing. These questions originated in previous work of Victoria Gitman and myself, and question 1 also appears in the dissertation of Kameryn Williams, which was focused on the structure of models of second-order set theories.
It is well-known, of course, that ZFC, GBC, and KM are preserved by set forcing and by tame class forcing, and this is true for other theories in the hierarchy, such as GBC+Ξ 1π-comprehension and higher levels of the second-order comprehension axiom. The corresponding forcing preservation theorem for ETR and for open class determinacy, however, has not been known.
Question 1. Is ETR preserved by forcing?
Question 2. Is open class determinacy preserved by forcing?
We intend to ask in each case about class forcing as well as set forcing. Question 1 is closely connected with the question of whether forcing can create new class well-order order types, longer than any class well-order in the ground model. Specifically, Victoria Gitman and I had observed earlier that ETRΞ for a specific class well-order Ξ is preserved by pre-tame class forcing, and this would imply that the full principle ETR would also be preserved, if no fundamentally new class well-orders are created by the forcing. In light of the fact that forcing over models of ZFC adds no new ordinals, that would seem reasonable, but proof is elusive, and the question remains open. Can forcing add new class well-orders, perhaps very tall ones that are not isomorphic to any ground model class well-order? Perhaps there are some very strange models of GBC that gain new class well-order order types in a forcing extension.
Question 3. Assume GBC. After forcing, must every new class well-order be isomorphic to a ground-model class well-order? Does ETR imply this?
The main theorem of this article provides a full affirmative answer to question 2, and partial affirmative answers to questions 2 and 3.
Main Theorem.
Open class determinacy is preserved by pre-tame class forcing. After such forcing, every new class well-order is isomorphic to a ground-model class well-order.
The principle ETRΞ, for any fixed class well order Ξ, is preserved by pre-tame class forcing.
The full principle ETR is preserved by countably strategically closed pre-tame class forcing. After such forcing, every new class well-order is isomorphic to a ground-model class well-order.
We should like specifically to highlight the fact that questions 1 and 3 remain open even in the case of the forcing to add a Cohen real. Is ETR preserved by the forcing to add a Cohen real? After adding a Cohen real, is every new class well-order isomorphic to a ground-model class well-order? One naturally expects affirmative answers, especially in a model of ETR.
For more, click through the arxiv for a pdf of the full article.
Paul K. Gorbow successfully defended his dissertation, βSelf-similarity in the foundationsβ on June 14, 2018 at the University of Gothenburg in the Department of Philosophy, Linguistics and Theory of Science, under the supervision of Ali Enayat, with Peter LeFanu Lumsdaine and Zachiri McKenzie serving as secondary supervisors. The defense opponent was Roman Kossak, with a dissertation committee consisting of Jon Henrik Forssell, Joel David Hamkins (myself) and Vera Koponen, chaired by Fredrik EngstrΓΆm. Congratulations!
Paul K. Gorbow, βSelf-similarity in the foundations,β PhD dissertation for the University of Gothenburg, Acta Philosophica Gothoburgensia 32, June 2018. (arxiv:1806.11310)
Abstract. This thesis concerns embeddings and self-embeddings of foundational structures in both set theory and category theory.
The first part of the work on models of set theory consists in establishing a refined version of Friedmanβs theorem on the existence of embeddings between countable non-standard models of a fragment of ZF, and an analogue of a theorem of Gaifman to the effect that certain countable models of set theory can be elementarily end-extended to a model with many automorphisms whose sets of fixed points equal the original model. The second part of the work on set theory consists in combining these two results into a technical machinery, yielding several results about non-standard models of set theory relating such notions as self-embeddings, their sets of fixed points, strong rank-cuts, and set theories of different strengths.
The work in foundational category theory consists in the formulation of a novel algebraic set theory which is proved to be equiconsistent to New Foundations (NF), and which can be modulated to correspond to intuitionistic or classical NF, with or without atoms. A key axiom of this theory expresses that its structures have an endofunctor with natural properties.
In the Swedish style of dissertation defense, the opponent (in this case Roman Kossak) summarizes the dissertation, placing it in a broader context, and then challenges various parts of it, probing the candidateβs expertise in an extended discussion. What a pleasure it was to see this. After this, there is a broader discussion, in which the committee is also involved.
I had the pleasure a few weeks ago to visit my daughterβs math class at her school and undertake some fun math activities with the sixth graders (11-12 years old, all girls). What a fun time we had!
The topic was orthoprojection and the problem of gaining insight about a three-dimensional object or arrangement by considering its various orthogonal projections.
I had prepared a collection of puzzles, which I hoped would challenge the students in spatial reasoning and abstract visualization. (The puzzle booklet is available at bottom.)
The puzzles involved orthogonal projections of arrangements of unit blocks.
Unit blocks are a certain kind of wooden block toy, which exhibit precise dimensions in the ratio 1Γ2Γ4. Thus, two blocks oriented the shortest way combine to the same thickness as one block oriented another way, or half the long length. Thus, unit blocks can be flexibly combined in a huge variety of combinations, often leading to aesthetically pleasing and structurally sound creations. Furthermore, they are thought to help develop a childβs intuition for fractions in a completely natural and meaningful way, for often the child needs to build a support exactly 1/4 unit or 1/2 unit taller, or what have you, and in this way they are lead to see how the fractional units combine into wholes.
Unit blocks are commonly found in American elementary schools and kindergartens, and woodworkers will recognize that one can make a set of unit blocks from standard 2Γ4 lumber, available at most hardware stores. You can also buy unit block sets online, and my opinion is that one hardly needs any other toy than a good unit block set for any child aged 2 to 102. Note: you do NOT need any of the fancy extra blocks, not following the unit block standard, that are sometimes available with these sets; the quality of play is best with just the unit blocks and half-units, thin units, double units and so on.
For the school visit, I brought sufficient blocks for all the girls, and explained that we were going to play with blocks the way that a mathematician or engineer might play with blocks. And I had prepared a puzzle booklets, one per child (available below).
For each puzzle, one sees the front view, top view and right side view of an arrangement of blocks, and the challenge is to assemble the blocks so as to realize those views. (In these puzzles, I had chosen not to show any hidden lines and edges, to make them slightly more challenging, although it would also make sense to me to show hidden lines; it would be customary to do so with a dashed line.)
Here are a few more easy examples with solutions.
It happens that these front/top/side projection view problems inspire some deep feelings in me, for they remind me of my father, who was a mechanical engineer. In my childhood, he would often bring home and show me blueprints of the machines or machine parts that he was designing or working with, and those blueprints were filled with front/top/side projection views of the various parts. In pre-computer design days, engineers would specify their machine parts by providing the various othogonal views. (Nowadays, of course, computers are used and one can compute orthogonal and perspective views from any angle.)
In my daughterβs classroom, the students took up the puzzles with a seriousness that shocked me. Once we had passed out the puzzle booklets and distributed the blocks, the girls just steamed through the puzzles, one after the other, totally absorbed. They didnβt want any hints or advice or help of any kind; they just went from each puzzle to the next, solving them one after another. There were a sufficient number of blocks for them all to work on the smaller puzzles on their own, but for the larger puzzles, they formed groups and combined blocks. It was a big success!
Without further ado, here are your puzzles. Iβve also included some photos below, out of order, and some puzzles do not match a photo and vice versa, but you can look at the photos if you need a hint.
After these puzzles, we moved on to the inverse problem. The girls made their own arrangement of blocks and then drew all six orthogonal projections: front, top, right, left, back, bottom. You can draw them on the net of a cube, so that you can imagine folding the cube so as to realize the projections.
And after this, we moved beyond unit blocks to more general shapes. Can you solve the following projection puzzles?
The most challenging puzzle was the following. Can you imagine a solid that appears as a square from the front, a circle from the top and a triangle from the right side?
(And here is an alternative version of the puzzles made by David Butler for use with Jenga blocks, which have the dimensional ratios 3Γ5Γ15 rather than 1Γ2Γ4: Jenga Views; see also this Twitter thread.)
Abstract. Providing a set-theoretic analogue of the universal algorithm, I shall define a certain finite set in set theory {π₯β£πβ‘(π₯)}
and prove that it exhibits a universal extension property: it can be any desired particular finite set in the right set-theoretic universe and it can become successively any desired larger finite set in top-extensions of that universe. Specifically, ZFC proves the set is finite; the definition π has complexity Ξ£2 and therefore any instance of it πβ‘(π₯) is locally verifiable inside any sufficiently large ππ; the set is empty in any transitive model; and if π defines the set π¦ in some countable model π of ZFC and π¦βπ§ for some finite set π§ in π, then there is a top-extension of π to a model π of ZFC in which π defines the new set π§. I shall draw out consequences of the universal finite set for set-theoretic potentialism and discuss several issues it raises in the philosophy of set theory.
The talk will include joint work with W. Hugh Woodin, Γystein Linnebo and others.
Consider what I call the Sudoku game, recently introduced in the MathOverflow question Who wins two-player Sudoku? posted by Christopher King (user PyRulez). Two players take turns placing numbers on a Sudoku board, obeying the rule that they must never explicitly violate the Sudoku condition: the numbers on any row, column or sub-board square must never repeat. The first player who cannot continue legal play loses. Who wins the game? What is the winning strategy?
The game is not about building a global Sudoku solution, since a move can be legal in this game even when it is not part of any global Sudoku solution, provided only that it doesnβt yet explicitly violate the Sudoku condition. Rather, the Sudoku game is about trying to trap your opponent in a maximal such position, a position which does not yet explicitly violate the Sudoku condition but which cannot be further extended.
In my answer to the question on MathOverflow, I followed an idea suggested to me by my daughter Hypatia, namely that on even-sized boards π2Γπ2 where π is even, then the second player can win with a mirroring strategy: simply copy the opponentβs moves in reflected mirror image through the center of the board. In this way, the second player ensures that the position on the board is always symmetric after her play, and so if the previous move was safe, then her move also will be safe by symmetry. This is therefore a winning strategy for the second player, since any violation of the Sudoku condition will arise on the opponentβs play.
This argument works on even-sized boards precisely because the reflection of every row, column and sub-board square is a totally different row, column and sub-board square, and so any new violation of the Sudoku conditions would reflect to a violation that was already there. The mirror strategy definitely does not work on the odd-sized boards, including the main 9Γ9 case, since if the opponent plays on the central row, copying directly would immediately introduce a Sudoku violation.
After posting that answer, Orson Peters (user orlp) pointed out that one can modify it to form a winning strategy for the first player on odd-sized boards, including the main 9Γ9 case. In this case, let the first player begin by playing 5 in the center square, and then afterwards copy the opponentβs moves, but with the tenβs complement at the reflected location. So if the opponent plays π₯, then the first player plays 10βπ₯ at the reflected location. In this way, the first player can ensure that the board is tenβs complement symmetric after her moves. The point is that again this is sufficient to know that she will never introduce a violation, since if her 10βπ₯ appears twice in some row, column or sub-board square, then π₯ must have already appeared twice in the reflected row, column or sub-board square before that move.
This idea is fully general for odd-sized Sudoku boards π2Γπ2, where π is odd. If π=2β’πβ1, then the first player starts with π in the very center and afterward plays the 2β’π-complement of her opponentβs move at the reflected location.
Conclusion.
On even-sized Sudoku boards, the second player wins the Sudoku game by the mirror copying strategy.
On odd-sized Sudoku boards, the first players wins the Sudoku game by the complement-mirror copying strategy.
Note that on the even boards, the second player could also play complement mirror copying just as successfully.
What I really want to tell you about, however, is the infinite Sudoku game (following a suggestion of Sam Hopkins). Suppose that we try to play the Sudoku game on a board whose subboard squares are β€Γβ€, so that the full board is a β€Γβ€ array of those squares, making β€2Γβ€2 altogether. (Or perhaps you might prefer the board β2Γβ2?)
One thing to notice is that on an infinite board, it is no longer possible to get trapped at a finite stage of play, since every finite position can be extended simply by playing a totally new label from the set of labels; such a move would never lead to a new violation of the explicit Sudoku condition.
For this reason, I should like to introduce the Sudoku Solver-Spoiler game variation as follows. There are two players: the Sudoku Solver and the Sudoku Spoiler. The Solver is trying to build a global Sudoku solution on the board, while the Spoiler is trying to prevent this. Both players must obey the Sudoku condition that labels are never to be explicitly repeated in any row, column or sub-board square. On an infinite board, the game proceeds transfinitely, until the board is filled or there are no legal moves. The Solver wins a play of the game, if she successfully builds a global Sudoku solution, which means not only that every location has a label and there are no repetitions in any row, column or sub-board square, but also that every label in fact appears in every row, column and sub-board square. That is, to count as a solution, the labels on any row, column and sub-board square must be a bijection with the set of labels. (On infinite boards, this is a stronger requirement than merely insisting on no repetitions.)
The Solver-Spoiler game makes sense in complete generality on any set π, whether finite or infinite. The sub-boards are π2=πΓπ, and one has an πΓπ array of them, so π2Γπ2 for the whole board. Every row and column has the same size as the sub-board square π2, and the set of labels should also have this size.
Upon reflection, one realizes that what matters about π is just its cardinality, and we really have for every cardinal π the π -Sudoku Solver-Spoiler game, whose board is π 2Γπ 2, a π Γπ array of π Γπ sub-boards. In particular, the game β€2Γβ€2 is actually isomorphic to the game β2Γβ2, despite what might feel initially like a very different board geometry.
What I claim is that the Solver has a winning strategy in the all the infinite Sudoku Solver-Spoiler games, in a very general and robust manner.
Theorem. For every infinite cardinal π , the Solver has a winning strategy to win the π -Sudoku Solver-Spoiler game.
The strategy will win in π many moves, producing a full Sudoku solution.
The Solver can win whether she goes first or second, starting from any legal position of size less than π .
The Solver can win even when the Spoiler is allowed to play finitely many labels at once on each turn, or fewer than π many moves (if π is regular), even if the Solver is only allowed one move each turn.
In the countably infinite Sudoku game, the Solver can win even if the Spoiler is allowed to make infinitely many moves at once, provided only that the resulting position can in principle be extended to a full solution.
Proof. Consider first the countably infinite Sudoku game, and assume the initial position is finite and that the Spoiler will make finitely many moves on each turn. Consider what it means for the Solver to win at the limit. It means, first of all, that there are no explicit repetitions in any row, column or sub-board. This requirement will be ensured since it is part of the rules for legal play not to violate it. Next, the Solver wants to ensure that every square has a label on it and that every label appears at least once in every row, every column and every sub-board. If we think of these as individual specific requirements, we have countably many requirements in all, and I claim that we can arrange that the Solver will simply satisfy the ππ‘β’β requirement on her ππ‘β’β play. Given any finite position, she can always find something to place in any given square, using a totally new label if need be. Given any finite position, any row and any particular label π, since can always find a place on that row to place the label, which has no conflict with any column or sub-board, since there are infinitely many to choose from and only finitely many conflicts. Similarly with columns and sub-boards. So each of the requirements can always be fulfilled one-at-a-time, and so in π many moves she can produce a full solution.
The argument works equally well no matter who goes first or if the Spoiler makes arbitrary finite play, or indeed even infinite play, provided that the play is part of some global solution (perhaps a different one each time), since on each move the Solve can simply meet the requirement by using that solution at that stage.
An essentially similar argument works when π is uncountable, although now the play will proceed for π many steps. Assuming π 2=π , a consequence of the axiom of choice, there are π many requirements to meet, and the Solve can meet requirement πΌ on the πΌπ‘β’β move. If π is regular, we can again allow the Spoiler to make arbitrary size-less-than-π size moves, so that at any stage of play before π the position will still be size less than π . (If π is singular, one can allow Spoiler to make finitely many moves at once or indeed even some uniform bounded size πΏ<π many moves at once. β»
I find it interesting to draw out the following aspect of the argument:
Observation. Every finite labeling of an infinite Sudoku board that does not yet explicitly violate the Sudoku condition can be extended to a global solution.
Similarly, any size less than π labeling that does not yet explicitly violate the Sudoku condition can be extended to a global solution of the π -Sudoku board for any infinite cardinal π .
What about asymmetric boards? It has come to my attention that people sometimes look at asymmetric Sudoku boards, whose sub-boards are not square, such as in the six-by-six Sudoku case. In general, one could take Sudoku boards to consist of a πΓπ array of sub-boards of size π Γπ, where π and π are cardinals, not necessarily the same size and not necessarily both infinite or both finite. How does this affect the arguments Iβve given?
In the finite (πΓπ)Γ(πΓπ) case, if one of the numbers is even, then it seems to me that the reflection through the origin strategy works for the second player just as before. And if both are odd, then the first player can again play in the center square and use the mirror-complement strategy to trap the opponent. So that analysis will work fine.
In the case (π Γπ)Γ(πΓπ ) where πβ€π and π =πβ’π is infinite, then the proof of the theorem seems to break, since if π<π , then with only π many moves, say putting a common symbol in each of the π many rectangles across a row, we can rule out that symbol in a fixed row. So this is a configuration of size less than π that cannot be extended to a full solution. For this reason, it seems likely to me that the Spoiler can win the Sudoko Solver-Spoiler game in the infinite asymmetric case.
Finally, letβs consider the Sudoku Solver-Spoiler game in the purely finite case, which actually is a very natural game, perhaps more natural than what I called the Sudoku game above. It seems to me that the Spoiler should be able to win the Solver-Spoiler game on any nontrivial finite board. But I donβt yet have an argument proving this. I asked a question on MathOverflow: The Sudoku game: Solver-Spoiler variation.
Kameryn J. Williams successfully defended his dissertation under my supervision at the CUNY Graduate Center on April 6th, 2018, earning his Ph.D. degree in May 2018. He has accepted a position in mathematics at the University of Hawaii, to begin Fall 2018.
What a pleasure it was to work with Kameryn, an extremely talented mathematician with wide interests and huge promise.
Abstract.This dissertation is a contribution to the project of second-order set theory, which has seen a revival in recent years. The approach is to understand second-order set theory by studying the structure of models of second-order set theories. The main results are the following, organized by chapter. First, I investigate the poset of T-realizations of a fixed countable model of ZFC, where T is a reasonable second-order set theory such as GBC or KM, showing that it has a rich structure. In particular, every countable partial order embeds into this structure. Moreover, we can arrange so that these embedding preserve the existence/nonexistence of upper bounds, at least for finite partial orders. Second I generalize some constructions of Marek and Mostowski from KM to weaker theories. They showed that every model of KM plus the Class Collection schema βunrollsβ to a model of ZFCβ with a largest cardinal. I calculate the theories of the unrolling for a variety of second-order set theories, going as weak as GBC + ETR. I also show that being T-realizable goes down to submodels for a broad selection of second-order set theories T. Third, I show that there is a hierarchy of transfinite recursion principles ranging in strength from GBC to KM. This hierarchy is ordered first by the complexity of the properties allowed in the recursions and second by the allowed heights of the recurions. Fourth, I investigate the question of which second-order set theories have least models. I show that strong theoriesβsuch as KM or Ξ 11-CAβdo not have least transitive models, while weaker theoriesβfrom GBC to GBC + ETROrdβdo have least transitive models.
In addition to his dissertation work and the research currently arising out of it, Kameryn has undertaken a number of collaborations with various international research efforts, including the following:
He is co-author on a current joint project with Miha HabiΔ, myself, Daniel Klausner and Jonathan Verner concerning the nonamalgamation phenomenon in the generic multiverse of a countable model of set theory.
He is co-author on a current joint project with myself and Philip Welch concerning the universal Ξ£1-definable finite sequence, an analogue of the universal finite set, but for the constructible universe.
I was fascinated recently to discover something I hadnβt realized about relative interpretability in set theory, and Iβd like to share it here. Namely,
Different set theories extending ZF are never bi-interpretable!
For example, ZF and ZFC are not bi-interpretable, and neither are ZFC and ZFC+CH, nor ZFC and ZFC+Β¬CH, despite the fact that all these theories are equiconsistent. The basic fact is that there are no nontrivial instances of bi-interpretation amongst the models of ZF set theory. This is surprising, and could even be seen as shocking, in light of the philosophical remarks one sometimes hears asserted in the philosophy of set theory that what is going on with the various set-theoretic translations from large cardinals to determinacy to inner model theory, to mention a central example, is that we can interpret between these theories and consequently it doesnβt much matter which context is taken as fundamental, since we can translate from one context to another without loss.
The bi-interpretation result shows that these interpretations do not and cannot rise to the level of bi-interpretations of theories β the most robust form of mutual relative interpretability β and consequently, the translations inevitably must involve a loss of information.
To be sure, set theorists classify the various set-theoretic principles and theories into a hierarchy, often organized by consistency strength or by other notions of interpretative power, using forcing or definable inner models. From any model of ZF, for example, we can construct a model of ZFC, and from any model of ZFC, we can construct models of ZFC+CH or ZFC+Β¬CH and so on. From models with sufficient large cardinals we can construct models with determinacy or inner-model-theoretic fine structure and vice versa. And while we have relative consistency results and equiconsistencies and even mutual interpretations, we will have no nontrivial bi-interpretations.
(I had proved the theorem a few weeks ago in joint work with Alfredo Roque Freire, who is visiting me in New York this year. We subsequently learned, however, that this was a rediscovery of results that have evidently been proved independently by various authors. Albert Visser proves the case of PA in his paper, βCategories of theories and interpretations,β Logic in Tehran, 284β341, Lect. Notes Log., 26, Assoc. Symbol. Logic, La Jolla, CA, 2006, (pdf, see pp. 52-55). Ali Enayat gave a nice model-theoretic argument for showing specifically that ZF and ZFC are not bi-interpretable, using the fact that ZFC models can have no involutions in their automorphism groups, but ZF models can; and he proved the general version of the theorem, for ZF, second-order arithmetic π2 and second-order set theory KM in his 2016 article, A. Enayat, βVariations on a Visserian theme,β in Liber Amicorum Alberti : a tribute to Albert Visser / Jan van Eijck, Rosalie Iemhoff and Joost J. Joosten (eds.) Pages, 99-110. ISBN, 978-1848902046. College Publications, London. The ZF version was apparently also observed independently by Harvey Friedman, Visser and Fedor Pakhomov.)
Meanwhile, let me explain our argument. Recall from model theory that one theory π is interpreted in another theory π, if in any model of the latter theory πβ§π, we can define (and uniformly so in any such model) a certain domain πβππ and relations and functions on that domain so as to make π a model of π. For example, the theory of algebraically closed fields of characteristic zero is interpreted in the theory of real-closed fields, since in any real-closed field π , we can consider pairs (π,π), thinking of them as π+πβ’π, and define addition and multiplication on those pairs in such a way so as to construct an algebraically closed field of characteristic zero.
Two theories are thus mutually interpretable, if each of them is interpretable in the other. Such theories are necessarily equiconsistent, since from any model of one of them we can produce a model of the other.
Note that mutual interpretability, however, does not insist that the two translations are inverse to each other, even up to isomorphism. One can start with a model of the first theory πβ§π and define the interpreted model πβ§π of the second theory, which has a subsequent model of the first theory again Β―πβ§π inside it. But the definition does not insist on any particular connection between π and Β―π, and these models need not be isomorphic nor even elementarily equivalent in general.
By addressing this, one arrives at a stronger and more robust form of mutual interpretability. Namely, two theories π and π are bi-interpretable, if they are mutually interpretable in such a way that the models can see that the interpretations are inverse. That is, for any model π of the theory π, if one defines the interpreted model πβ§π inside it, and then defines the interpreted model Β―π of π inside π, then π is isomorphic to Β―π by a definable isomorphism in π, and uniformly so (and the same with the theories in the other direction). Thus, every model of one of the theories can see exactly how it itself arises definably in the interpreted model of the other theory.
For example, the theory of linear orders β€ is bi-interpretable with the theory of strict linear order <, since from any linear order β€ we can define the corresponding strict linear order < on the same domain, and from any strict linear order < we can define the corresponding linear order β€, and doing it twice brings us back again to the same order.
We are now ready to prove that this bi-interpretation situation does not occur with different set theories extending ZF.
Theorem. Distinct set theories extending ZF are never bi-interpretable. Indeed, there is not a single model-theoretic instance of bi-interpretation occurring with models of different set theories extending ZF.
I claim that π must think the ordinals of Β―π are well-founded, for otherwise it would have some bounded cut π΄ in the ordinals of Β―π with no least upper bound, and this set π΄ when pulled back pointwise by the isomorphism of π with Β―π would mean that π has a cut in its own ordinals with no least upper bound; but this cannot happen in ZF.
It follows that although ZF and ZFC are equiconsistent, they are not bi-interpretable. Similarly, ZFC and ZFC+CH and ZFC+Β¬CH are equiconsistent, but no pair of them is bi-interpretable. And again with all the various equiconsistency results concerning large cardinals.
A similar argument works with PA to show that different extensions of PA are never bi-interpretable.
This will be a talk for the Rutgers Logic Seminar, April 2, 2018. Hill Center, Busch campus.
Abstract. I shall define a certain finite set in set theory {π₯β£πβ‘(π₯)} and prove that it exhibits a universal extension property: it can be any desired particular finite set in the right set-theoretic universe and it can become successively any desired larger finite set in top-extensions of that universe. Specifically, ZFC proves the set is finite; the definition π has complexity Ξ£2 and therefore any instance of it πβ‘(π₯) is locally verifiable inside any sufficient ππ; the set is empty in any transitive model and others; and if π defines the set π¦ in some countable model π of ZFC and π¦βπ§ for some finite set π§ in π, then there is a top-extension of π to a model π in which π defines the new set π§. The definition can be thought of as an idealized diamond sequence, and there are consequences for the philosophical theory of set-theoretic top-extensional potentialism.
This will be a talk for the CUNY Set Theory Seminar, April 27, 2018, GC Room 6417, 10-11:45am (please note corrected date).
Abstract. Open class determinacy is the principle of second order set theory asserting of every two-player game of perfect information, with plays coming from a (possibly proper) class π and the winning condition determined by an open subclass of ππ, that one of the players has a winning strategy. This principle finds itself about midway up the hierarchy of second-order set theories between GΓΆdel-Bernays set theory and Kelley-Morse, a bit stronger than the principle of elementary transfinite recursion ETR, which is equivalent to clopen determinacy, but weaker than GBC+Ξ 11-comprehension. In this talk, Iβll given an account of my recent joint work with W. Hugh Woodin, proving that open class determinacy is preserved by forcing. A central part of the proof is to show that in any forcing extension of a model of open class determinacy, every well-founded class relation in the extension is ranked by a ground model well-order relation. This work therefore fits into the emerging focus in set theory on the interaction of fundamental principles of second-order set theory with fundamental set theoretic tools, such as forcing. It remains open whether clopen determinacy or equivalently ETR is preserved by set forcing, even in the case of the forcing merely to add a Cohen real.
This will be a talk for the CUNY Logic Workshop on March 23, 2018, GC 6417 2-3:30pm.
Abstract. Consider a countable model of set theory π in the context of all its successive forcing extensions and grounds. This generic multiverse has long been known to exhibit instances of nonamalgamation: one can have two extensions πβ‘[π] and πβ‘[π], both adding a merely a generic Cohen real, which have no further extension in common. In this talk, I shall describe new joint work that illuminates the extent of non-amalgamation: every finite partial order (and more) embeds into the generic multiverse over any given model in a way that preserves amalgamability and non-amalgamability. The proof uses the set-theoretic blockchain argument (pictured above), which has affinities with constructions in computability theory in the Turing degrees. Other arguments, which also resemble counterparts in computability theory, show that the generic multiverse exhibits the exact pair phenonemon for increasing chains. This is joint work with Miha HabiΔ, myself, Lukas Daniel Klausner and Jonathan Verner. The paper will be available this Spring.