Professional Documents
Culture Documents
MONTHLY
VOLUME 120, NO. 9 NOVEMBER 2013
NOTES
REVIEWS
Member Plus
$169/Year
$249/Year
Individual Member Benefits
$35/Year
Member
Student
Electronic Subscriptions to:
• The American Mathematical Monthly (10 per year) X X X
• The College Mathematics Journal (5 per year) X X X
• Mathematics Magazine (5 per year) X X X
• Math Horizons (4 per year) X X X
• MAA FOCUS (6 per year) X X X
MONTHLY
Volume 120, No. 9 November 2013
EDITOR
Scott T. Chapman
Sam Houston State University
ASSOCIATE EDITORS
William Adkins Ulrich Krause
Louisiana State University Universität Bremen
David Aldous Jeffrey Lawson
University of California, Berkeley Western Carolina University
Elizabeth Allman C. Dwight Lahr
University of Alaska, Fairbanks Dartmouth College
Jonathan M. Borwein Susan Loepp
University of Newcastle Williams College
Jason Boynton Irina Mitrea
North Dakota State University Temple University
Edward B. Burger Bruce P. Palka
Williams College National Science Foundation
Minerva Cordero-Epperson Vadim Ponomarenko
University of Texas, Arlington San Diego State University
Beverly Diamond Catherine A. Roberts
College of Charleston College of the Holy Cross
Allan Donsig Rachel Roberts
University of Nebraska, Lincoln Washington University, St. Louis
Michael Dorff Ivelisse M. Rubio
Brigham Young University Universidad de Puerto Rico, Rio Piedras
Daniela Ferrero Adriana Salerno
Texas State University Bates College
Luis David Garcia-Puente Edward Scheinerman
Sam Houston State University Johns Hopkins University
Sidney Graham Susan G. Staples
Central Michigan University Texas Christian University
Tara Holm Dennis Stowe
Cornell University Idaho State University
Roger A. Horn Daniel Ullman
University of Utah George Washington University
Lea Jenkins Daniel Velleman
Clemson University Amherst College
Daniel Krashen
University of Georgia
EDITORIAL ASSISTANT
Bonnie K. Ponce
NOTICE TO AUTHORS Proposed problems or solutions should be sent to:
The MONTHLY publishes articles, as well as notes and DOUG HENSLEY, MONTHLY Problems
other features, about mathematics and the profes- Department of Mathematics
sion. Its readers span a broad spectrum of math- Texas A&M University
ematical interests, and include professional mathe- 3368 TAMU
maticians as well as students of mathematics at all College Station, TX 77843-3368
collegiate levels. Authors are invited to submit arti-
cles and notes that bring interesting mathematical
In lieu of duplicate hardcopy, authors may submit
ideas to a wide audience of MONTHLY readers.
pdfs to monthlyproblems@math.tamu.edu.
The MONTHLY’s readers expect a high standard of ex-
position; they expect articles to inform, stimulate,
challenge, enlighten, and even entertain. MONTHLY Advertising Correspondence:
articles are meant to be read, enjoyed, and dis- MAA Advertising
cussed, rather than just archived. Articles may be 1529 Eighteenth St. NW
expositions of old or new results, historical or bio- Washington DC 20036
graphical essays, speculations or definitive treat-
ments, broad developments, or explorations of a Phone: (877) 622-2373
single application. Novelty and generality are far E-mail: tmarmor@maa.org
less important than clarity of exposition and broad
appeal. Appropriate figures, diagrams, and photo- Further advertising information can be found online
graphs are encouraged. at www.maa.org
Notes are short, sharply focused, and possibly infor- Change of address, missing issue inquiries, and
mal. They are often gems that provide a new proof other subscription correspondence:
of an old theorem, a novel presentation of a familiar MAA Service Center, maahq@maa.org
theme, or a lively discussion of a single issue.
All at the address:
Submission of articles, notes, and filler pieces is re-
quired via the MONTHLY’s Editorial Manager System. The Mathematical Association of America
Initial submissions in pdf or LATEX form can be sent 1529 Eighteenth Street, N.W.
to the Editor Scott Chapman at Washington, DC 20036
Abstract. We explore the geometric and measure-theoretic properties of a set built by stacking
central Cantor sets with continuously varying scaling factors. By using self-similarity, we
are able to describe its main features in a fairly complete way. We show that it is made of
an uncountable number of analytic curves, compute the exact areas of the gaps of all sizes,
and show that its Hausdorff and box-counting dimensions are both equal to 2. It provides a
particularly good example to introduce and showcase these notions because of the beauty and
simplicity of the arguments. Our derivation of explicit formulas for the areas of all of the gaps
is elementary enough to be explained to first-year calculus students.
1. INTRODUCTION. Consider the beautiful and striking set illustrated below (let
us call the set CH).
Clearly, this set has an intricate recursive (fractal) structure. In fact, it is constructed
by “stacking” Cantor sets (in a way that we will describe momentarily). We can also
see that CH is composed of uncountably many “strands.” Our name for the set (and this
paper) was inspired by the resemblance of CH to the braided hair of the first author’s
wife, Christiane.
Here we explore some geometric and measure-theoretic properties of this set. We
hope to convince you that the elegance and simplicity of the geometrical arguments
are just as striking as the set itself.
In this figure, we can see the binary structure of C clearly. In fact, C is homeomor-
phic to the countably infinite product of the two-point discrete space {0, 1}. We can
think of a point x ∈ C as resulting from an infinite sequence of choices of left or right.
Each such sequence of choices selects some nested sequence of closed intervals, one
from each stage of the construction. The intersection of the resulting nested intervals
is always a single point. As an example, choosing the left interval at each stage will
result in the point x = 0.
It is simple to modify the construction of C where we remove some other (fixed)
ratio of the length at each stage. We shall denote by C y the set obtained by removing
the length 1 − 2y in the middle of [0, 1] at the first stage, where y ranges in [0, 1/2]
(the reason for our funny choice of 1 − 2y for the gap length will become evident
below). At the nth stage, we have 2n−1 remaining closed intervals, each of length y n−1 ,
and we remove a length of (1 − 2y)y n−1 from the middle of each interval. We build in
this way a continuum of Cantor sets, where the size of the gaps at each stage decrease.
When y = 1/2, we do not remove anything, so the resulting set is just the interval
[0, 1], while for y = 0, the resulting set is reduced to the two-point set {0, 1}. The
vertical stacking of the C y for y from 0 to 1/2 is our set CH.
More formally,
Another, more useful way of constructing the classical Cantor C set is by using an
Iterated Function System (IFS) [1, 7]. Consider the two functions
C = w0 (C ) ∪ w1 (C ), (2)
with the union being disjoint. This self-tiling or self-similarity property uniquely de-
fines C in that if A ⊂ R is any non-empty compact set with A = w0 (A) ∪ w1 (A),
then it must be the case that A = C . Moreover, the set-valued mapping Ŵ given by
Ŵ (A) = w0 (A) ∪ w1 (A) is contractive in the Hausdorff metric, and thus we have that
Ŵ n (A) converges to C for any non-empty compact A ⊂ R. Note that Figure 1 illus-
trates this convergence with the initial set A = [0, 1]. The first line in the figure shows
and
( )
X
w1 (A) = tn 3−n : t1 = 2, tn ∈ {0, 2} for n ≥ 2 ,
n≥1
so A = w0 (A) ∪ w1 (A). However, this means that A = C and so (3) gives an explicit
description of C . For the more general situation below it will be useful to notice that
(∞ ) ( )
X X
tn 3−n : tn ∈ {0, 2} = 2 bn 3−n : bn ∈ {0, 1} .
n=1 n≥1
The sets C y introduced above are also obviously associated to iterated functions
systems. For y ∈ [0, 1/2], we define the two maps
Thus, y = 1/3 yields the classical Cantor set, while y = 1/4 yields a middle-1/2
version of it.
Each C y is the unique invariant set under the two maps given in (4). The entire set
CH is invariant under these two maps as well, where we now think of these maps as
w0 , w1 : [0, 1] × [0, 1/2] → [0, 1] × [0, 1/2], abusing notation slightly. The reason
we label the sets C y is that y is the contraction factor for the IFS (4). Labeling each
Cantor set in the “stack” by its central gap length would result in messier formulas for
the IFS (4).
Comments. The set CH is a nice illustration of how the interval [0, 1] has been
“ripped apart” to form the classical Cantor set. We see at all the points with two binary
representations, that is, all the numbers of the form i/2 j , that the interval [0, 1] has
been cut and a gap has been inserted. More specifically, a gap of length 3−n has been
inserted at each point of the form i/2n for 1 ≤ i ≤ 2n − 1. It is best to think of this
construction in stages, as with the usual construction. First, we cut at x = 1/2, scale
each half by a factor of 2/3, and insert a gap of length 1/3. The scaling preserves the
total length. Then, we cut at the points that originally had coordinates 1/4 and 3/4
(now their coordinates have changed), scale each part by 2/3, and insert gaps of length
1/9. This is illustrated in CH where a gap originates at each dyadic point on the top
line, when they are inserted.
(x, y) : x ∈ C y , y ∈ S .
The set CH is the maximal compact invariant subset of [0, 1] × [0, 1/2] in the sense
that it contains any other invariant compact subset.
We make no claim to having discovered CH. For instance, Mandelbrot’s book [10]
contains a version of it on page 81.
3. AREAS OF THE “GAPS” OF CH. The starting point for this paper was a sur-
prising (to us) observation about the areas of the “gaps” of CH. In this short section we
explain this pretty little geometric fact, which is simple enough that it can be explained
to calculus students.
Consider the set CH as enclosed in the box [0, 1] × [0, 1/2]. We notice that the
central “gap” (or void) is a triangle with base length equal to one and height equal to
1/2. (It is worthwhile spending a little bit of time understanding why it is actually a
triangle.)
Remarkably, it is possible to exactly compute the areas of all the other gaps, even
though their sides are complicated curves. For instance, each of the two “stage-two”
gaps (the images of the central gap) have area
Z y=1/2
1
(1 − 2y)y dy = .
y=0 24
How does this formula arise? We see that for any given y, the set C y has a central
gap of length 1 − 2y. Because of the self-similarity of C y under the two maps given
in (4), the common length of the next largest gaps (one on either side of the central
gap) is equal to y(1 − 2y). Thus, the area of either one of these “stage-two” gaps is
the integral of this length over y.
In a similar way, the “nth-stage” gaps all have area equal to
y=1/2
2−n
Z
(1 − 2y)y n−1 dy = .
y=0 n(n + 1)
We can check that the sum of these areas is, in fact, equal to 1/2, the entire area of the
rectangle. Computing this sum is simple, as it is a telescoping sum:
X 2−n 1X1 1 1
2n−1 = − = .
n≥1
n(n + 1) 2 n≥1 n n + 1 2
The order of composition in (5) is very important. Notice the difference between
and
exists and is uniform in y (since the contraction factor is uniformly bounded in y). For
the “hairs,” our starting functions are f 0 (y) = 0 for all y (the left edge) and f 1 (y) = 1
for all y. The two edges of the central triangle are φ0 ( f 1 )(y) = y and φ1 ( f 0 )(y) =
1 − y. Then the boundaries of the next gap (at “stage” 2) are the four curves:
From these considerations, it is clear that all the boundary curves of any of the nth
stage “gaps” are polynomials of degree n. The strand associated with σ ∈ 6 is the
uniform limit in (6) and is thus a continuous function. We show that all of the deriva-
tives converge uniformly as well. To do this, just notice that
(φ0 ( f ))0 (y) = y f 0 (y) + f (y) and (φ1 ( f ))0 (y) = y f 0 (y) + f (y) − 1. (7)
(φ0 ( f ))00 (y) = y f 00 (y) + 2 f 0 (y) and (φ1 ( f ))00 (y) = y f 00 (y) + 2 f 0 (y), (8)
(φ0 ( f ))(n) (y) = y f (n) (y) + n f (n−1) (y) = (φ1 ( f ))(n) (y). (9)
This means that we have two linear mappings on the function value and its first n
derivatives, which are given by
f y 0 0 ··· 0 0 f
f0 1 y 0 ··· 0 0 f 0
f 00 0 2 y ··· 0 0 f 00
80
.. =.
. .. .. . . .. .. ..
(10)
. . . . . . . .
(n−1) 0 (n−1)
f 0 0 ··· y 0 f
f (n) 0 0 0 ··· n y f (n)
and
f y 0 0 ··· 0 0 f 1−y
f0 1 y 0 ··· 0 0 f 0 −1
f 00 0 2 y ··· 0 0 f 00 0
81 .
.. =.
. .. .. . . .. .. .. ..
+ (11)
. . . . . . . . .
f (n−1) 0 0 0 ··· y 0 f (n−1) 0
f (n) 0 0 0 ··· n y f (n) 0
It is clear that both 80 and 81 are contractive (recall y ≤ 1/2), and thus the first n
derivatives converge uniformly as well. Since this is true for any n, Tσ (y) = φσ ( f )(y)
is a C ∞ function of y for any σ ∈ 6.
We note that it is easy to show from (7) that |Tσ0 (y)| ≤ 2 for all y and σ ∈ 6. This
fact will be important in section 5.
Thread functions are analytic. A completely different approach to the thread func-
tions shows that these functions are, in fact, real analytic.
n≥1
and
( )
X
w1 (Sy ) = (1 − y) + (1 − y)y −1
σn y n+1
:σ ∈6
n≥1
( )
X
= (1 − y)yy −1
+ (1 − y)y −1
σn y n+1
:σ ∈6
n≥1
( )
X
= (1 − y)y −1 αn y n : α ∈ 6, α1 = 1 .
n≥1
Thus Sy = w0 (Sy ) ∪ w1 (Sy ), and so Sy = C y . This means that the “thread function”
T : 6 × [0, 1/2] → [0, 1] is given by
X
Tσ (y) = (1 − y)y −1 σn y n . (13)
n≥1
Hausdorff dimension of A, such that for 0 ≤ s < dim H (A) we have h s (A) = +∞, and
for s > dim H (A) we have h s (A) = 0. Any A ⊂ Rd with non-empty interior has Haus-
dorff dimension d, so this notion agrees with our intuitive idea of dimension for nice
sets. However, unlike our intuitive notion of dimension, it is certainly possible for sets
to have fractional Hausdorff dimension. As an example, dim H (C y ) = − ln(2)/ ln(y).
The Hausdorff dimension is monotone (A ⊆ B implies dim H (A) ≤ dim H (B)) and
countably stable (dim H (∪i Ai ) = supi dim H (Ai )). Sets with 0 < h s (A) < ∞ are
called s-sets and have been extensively studied. In fact, any generalized Cantor set is
an s-set for an appropriate dimension function [2].
The measure h s has a nice behaviour under Lipschitz mappings, which will be very
important for us. Let f : Rd → Rd be Lipschitz (that is, k f (x) − f (y)k ≤ K kx − yk
for some constant K ), then | f (U )|s ≤ K s |U |s and thus h s ( f (A)) ≤ K s h s (A). In par-
ticular, dim H ( f (A)) ≤ dim H (A).
dim H (CH) = 2. We now show that the Hausdorff dimension of CH is equal to two.
One interesting feature is that “locally” the Hausdorff dimension of CH is strictly less
than two everywhere except in neighborhoods of the top line. Locally, CH is close to
being a product of a Cantor set (in the horizontal direction) with an interval (in the
vertical direction). The local geometry of CH varies considerably from top to bottom.
Clearly, the Hausdorff dimension of CH is at least one, since it contains many
smooth curves any of which have dimension equal to one. We show that it is actually
equal to two. For [a, b] ⊂ (0, 1/2) let K ab = ([0, 1] × [a, b]) ∩ CH. We show that
dim H (K ab ) ≥ 1 − ln(2)/ ln(a). By 7.2 in [3] we know that
dim H (Ca × [a, b]) ≥ dim H (Ca ) + dim H ([a, b]) = − ln(2)/ ln(a) + 1,
|φαβ (y) − φαβ (y)| = {|g 0 | : g 0 ⊂ [φαβ (x), φαβ (y)] is a stage-n gap in Cα }
XX
X (1 − 2α)α n−1 X
= {|g| : g ⊂ [x, y] is a stage-n gap in Cβ }
n
(1 − 2β)β n−1
1 − 2α
≤ |y − x|.
1 − 2β
In words, φαβ acts by mapping each gap in Cβ to its corresponding gap in Cα ; the
mapping of the gaps completely defines the action of φαβ on Cβ by “squeezing” each
point of Cβ between its corresponding gaps. Because of the decay rates for the gap
lengths in Cβ and Cα , the most stretching is done in mapping the largest gap in Cβ to
the largest gap in Cα .
We define our desired Lipschitz surjection 8 : K ab → Ca × [a, b] by
8(x, y) = (φay (x), y).
Let (x1 , y1 ), (x2 , y2 ) ∈ K ab with y1 < y2 . We note that, by construction, φay2 = φay1 ◦
φ yy12 . This suggests that we decompose the action of 8 by first moving (x2 , y2 ) to
(φ yy12 (x2 ), y2 ), and then mapping to (φay1 (φ yy12 (x2 )), y2 ). For the point (x1 , y1 ), we need
only do the one step to (φay1 (x1 ), x1 ). By the uniform bound |Tσ0 (y)| ≤ 2 (from above),
we see that
|x1 − φ yy12 (x2 )| ≤ |x1 − x2 | + 2|y1 − y2 | ≤ 2(|x1 − x2 | + |y1 − y2 |).
We know that the mapping φay1 : C y1 → Ca is Lipschitz, so altogether 8 is also Lip-
schitz. This proves that dim H (K ab ) ≥ 1 − ln(2)/ ln(a). Taking an = (1/2)(1 − 1/n)
and bn = (1/2 + an )/2, we see that
ln(2)
2 ≥ dim H (CH) ≥ dim H (K abnn ) ≥ 1 + n−1
, for all n > 1,
ln(2) − ln n
Thus the Hausdorff measures of all the K ab are equal to zero, in their dimension.
5.2. The box-counting dimension of CH. The Hausdorff measures have very nice
properties but are somewhat difficult to work with. This makes computing the Haus-
dorff dimension difficult as well. For these, as well as other reasons, many different di-
mensions and corresponding measures of the “size” of a set have been defined. Among
the simplest of these is the box-counting dimension (also called the box dimension).
Given a bounded subset A ⊂ Rd , let Nδ (A) be the smallest number of sets of diam-
eter δ > 0 that will cover A. The box-counting dimension measures the asymptotic
growth rate of Nδ (A) as δ decreases to zero, specifically by fitting a model of the form
Nδ (A) ∼ Cδ −s , and so
ln(Nδ (A))
dim B (A) = lim . (15)
δ→0 − ln(δ)
Of course the limit doesn’t have to exist, so in general we have the upper box dimension
and lower box dimension given by
Clearly, for a bounded A ⊂ Rd we have Nδ (A) = O(δ −d ), and so dim B (A) ≤ d. Fur-
thermore, since Nδ (cl(A)) = Nδ (A), we have that the box dimensions of A and cl(A)
agree, where cl(A) is the closure of the set A.
Unfortunately, the box dimensions are only finitely stable with dim B (∪i=1 k
Ai ) =
maxi dim B (Ai ). The set A = {0} ∪ {1/n : n ≥ 1} ⊂ R provides a nice counter-
example to countable stability, since dim B (A) = 1/2, as is easy to show.
We also have a simple relation between the Hausdorff and box-counting dimen-
sions. It is easy to see that h sδ (A) ≤ Nδ (A)δ s . Thus, if h s (A) > 1, then for sufficiently
small δ > 0 we have 0 < ln(Nδ (A)) + s ln(δ), and so s ≤ dim B (A). In particular, for
any s ≤ dim H (A), we have h s (A) = +∞, and so s ≤ dim B (A). This means that
Since Ld (Aδ ) ≤ cNδ (A)δ d for some constant c > 0 depending only on d, it’s not sur-
prising that there is a relationship between the box dimensions and the decay rate of
Ld (Aδ ), as δ tends to zero. In fact, for A ⊂ Rd we have
ln Ld (Aδ )
dim B (A) = d − lim sup (16)
δ ln(δ)
ln Ld (Aδ )
dim B (A) = d − lim inf . (17)
δ ln(δ)
An especially interesting class of sets is the one composed of the so-called Min-
kowski measurable sets. These are sets A ⊂ Rd such that limδ→0 Ld (Aδ )/δ d−s exists,
where s = dim B (A). The limit is then called the Minkowski content of A. Minkowski
measurable sets are analogues of s-sets for box-counting dimension.
and thus we just need to get an upper bound on the box dimension.
Our idea is to use an explicit expression for the length, L y (), of the -dilation of C y
for each y ∈ [a, b], and then integrate this over y to get an area. Now, this will give the
area of a “horizontal” dilation of K ab , and not a true -dilation of K ab . However, if V ()
is the area of the -dilation and Vh () is the area of our “horizontal” dilation, then
√
Vh () ≤ V () ≤ Vh 2 . (18)
So if Vh () → 0 as → 0, then the same is true for V (). For (18), it is important to
know that the derivative of any of the functions which parameterize the strands is uni-
formly bounded by 2. In particular, none of these curves are close to being horizontal.
This is also why it is reasonable to integrate the length of the dilation of C y and obtain
the area of the “horizontal” dilation of K ab . That is, for a random stacking of Cantor
sets (even with these same interval lengths), there is no reason to expect a geometric
relationship between an -dilation at height y and an -dilation at a height y 0 . Since
the hairs are analytic functions, there is a smooth change from one C y to the nearby
C y 0 , and thus the integration is a reasonable thing to do.
For a given y, C y is a central Cantor set with scaling factor y, so from equation (1.9)
in [9] we see that the length of the -dilation of C y is
X
L y () = 2#{gaps ≥ 2} + {all gaps < 2} . (19)
To understand (19), think about the gap endpoints. If x is an endpoint for a gap g and
|g| < 2, then the g ⊂ (C y ) .
As there are 2n gaps of length (1 − 2y)y n , then for (1 − 2y)y n ≥ 2 we must have
2
$ %
ln 1−2y
n ≤ N (y, ) := . (20)
ln(y)
N (y,)+1 N (y,)+2
= 2(2 − 1) + (2y) . (21)
We will integrate this expression in y over the range [a, b]. The main difficulty is
that y influences the value of an integer in both terms. So, we simply break the in-
tegral up into parts where this integer value is constant. For each k between N (a, )
and N (b, ), let yk ∈ [a, b] satisfy (1 − 2yk )ykk = 2, so that the intervals [yk , yk+1 ]
form a partition of [a, b]. Let γ = 1/(1 − 2b). Since (2)1/k solves x k = 2, we
know that
Thus we get
Z b k=N
X (b,) Z yk+1 Z yk+1
L y () dy ≤ 2 (2 k+1
− 1) dy + (2y) k+2
dy
a k=N (a,) yk yk
(b,)
( )
k=N
X Z (2γ )1/(k+1) Z (2γ )1/(k+1)
≤ 2 (2k+1
− 1) dy + (2y) dy . (22)
k
First, estimate the second integral in (22) and its contribution to the sum. We see
that
(2γ )1/(k+1)
2k 2k
Z
(2y)k dy = 2 γ − (2)1/k ≤ 2γ .
(2)1/k k+1 k+1
Estimating the first term in (22) is similar. Using the Mean Value Theorem for the
function (x, y) 7 → x 1/y , we have
Z (2γ )1/(k+1)
2k+1 − 1 dy = 2k+1 − 1 (2γ )1/(k+1) − (2)1/k
(2)1/k
γ − 1 ln(2)
≤2 k+1
(2γ ) 1/(k+1)
− .
k k2
(b,) (b,) k
" k=N k=N
#
2k X X 2
≤ 4γ (γ − 1) − ln(2) . (24)
k=N (a,)
k k=N (a,)
k2
The first sum has the same estimate as (23). For the second sum, we see that
N (b,)
2x
Z
1
d x ∼ 2x .
N (a,) x2 ln(2)N (b, )2
(25)
Comments. We notice that dim H (CH) = 2, but that L2 (CH) = 0 = h 2 (CH), and so
CH is a rather simple example of a set with maximal dimension and zero measure. It
is a more interesting and simpler example of this than the set
#{1 ≤ i ≤ n : ith binary digit of x is one}
S = x ∈ [0, 1] : lim does not exist ,
n n
which is a 1-dimensional set with zero Lebesgue measure. However, S is not compact
and cl(S) = [0, 1], so the Minkowski content doesn’t agree with the Lebesgue mea-
sure, unlike in the case of CH. The behaviour of CH and K ab is the same with respect
to both the Hausdorff and box-counting dimensions and with respect to the Hausdorff
measure and Minkowski content.
for some appropriate (and interesting) choice for g(y). Figure 2 illustrates p (clock-
wise from upper left) the sets associated
√ with the functions g(y) = 1/2 − 1/4 − y 2 ,
2
g(y) = sin(2π y)/2, and g(y) = y/2, g(y) = 2y . The only real restriction on g is
that 0 ≤ g(y) ≤ 1/2 for y ∈ [0, 1/2]. For polynomial g, it is easy to do explicit com-
putations. We see that, just as in the standard case of g(y) = y, all the visible strands
in the image are polynomial functions of y.
We will use CHg to denote the version of CH associated with the function g(y).
Area of “gaps.” For particular choices of g(y) it is also possible to obtain explicit
values for the pareas of the “gaps” in CHg . One non-polynomial case, where we set
g(y) = 1/2 − 1/4 − y 2 , is particularly interesting. In this case, the central gap is a
semi-circle. The calculations in this case are more complicated than in the standard
case, but it is still relatively simple to obtain explicit formulas for the areas of the
“gaps.” The formula for the area of an nth stage “gap” is
Z 1/2 p p n−1
2−n−1 1 − 4y 2 1 − 1 − 4y 2 dy.
0
This is a perfect integral for a trigonometric substitution of the form y = sin(θ )/2,
giving the integral
Z π/2
An = 2−n−2 cos2 (θ )(1 − cos(θ ))n−1 dθ.
0
Explicitly computing these and checking to see if they sum to 1/2 is rather tedious. A
nice trick, however, is to see that the total area is equal to
1 π/2
X Z X
2n An = cos2 (θ ) (1 − cos(θ ))n dθ
n
2 0 n≥0
Z π/2
1 1
= cos2 (θ ) dθ
2 0 1 − 1 + cos(θ )
Z π/2
1 1
= cos(θ ) dθ = .
2 0 2
Thread functions are real-analytic. If the function g(y) is real-analytic, then so are
all the thread functions. This is rather simple to show, since in this case the thread
function for σ ∈ 6 is given by
X
Tσg (y) = (1 − g(y))g(y)−1 σn g(y)n , (27)
n≥1
for all y such that g(y) 6 = 0, 1. In fact, we see that Tσg = Tσ ◦ g, and so Tσg is real-
analytic, as it is the composition of two real-analytic functions. From this viewpoint,
the standard version of CH has a “universality” property, since the thread functions for
any variation are constructed in a simple way from the thread functions of the standard
version.
Thus dim H (CHg ) = 2 and L2 (CHg ) = h 2 (CHg ) = 0 under these conditions. Notice
that this condition is satisfied for three of the examples in Figure 2.
Furthermore, under this same condition the integral estimates for the “horizontal”
tube formula for CHg are a straightfoward change-of-variable from that for CH, so we
obtain the same decay rate, up to a constant multiplier.
Since we obtained the Hausdorff dimension, Hausdorff measure, box dimension,
and Minkowski content for K ab for any 0 ≤ a < b ≤ 1/2, we can use this information
to analyze these same local properties of any CHg .
ACKNOWLEDGMENTS. The second author was partially supported by a Discovery Grant from the Natural
Sciences and Engineering Research Council of Canada.
JACQUES LÉVY VÉHEL is a research director at Inria in France. He is interested in understanding irregu-
larity in natural phenomena, using tools in harmonic analysis, probability theory, and fractal analysis.
Regularity team, INRIA Saclay and MAS Laboratory, Ecole Centrale Paris, Grande Voie des Vignes,
92295 Chatenay-Malabry Cedex, France
jacques.levy-vehel@inria.fr
FRANKLIN MENDIVIL is a professor of mathematics at Acadia University in Nova Scotia. His research is
a blend of fractal geometry and analysis, image processing, and optimization. He considers himself extremely
lucky to be in a profession that allows him to explore many different topics.
Department of Mathematics and Statistics, Acadia University, 12 University Avenue,
Wolfville, NS Canada B4P 2R6
franklin.mendivil@acadiau.ca
Abstract. The game of memory is played with a deck of n pairs of cards. The cards in each
pair are identical. The deck is shuffled and the cards laid face down. A move consists of
flipping over first one card and then another. The cards are removed from play if they match.
Otherwise, they are flipped back over and the next move commences. A game ends when all
pairs have been matched. We determine that, when the game is played optimally, as n → ∞:
• The expected number of moves is (3 − 2 ln 2)n + 7/8 − 2 ln 2 ≈ 1.61n.
• The expected number of times two matching cards are unwittingly flipped over is ln 2.
√
• The expected number of flips until two matching cards have been seen is 22n / 2nn ∼ πn.
1. What is the expected length of a game (i.e., how many moves are required)?
Each card must be flipped over at least once. If it is not matched the first time it
is flipped over, then it is flipped again, and removed, once the location of its mate is
known. Thus, each card is flipped either once or twice, and therefore the total number
of card flips is between 2n and 4n. Since two cards are flipped over on each move, this
http://dx.doi.org/10.4169/amer.math.monthly.120.09.787
MSC: Primary 60C05, Secondary 05A16
means that the number of moves in any game will be between n and 2n. Analysis of
the possibilities for the end of the game shows that the last card to be flipped over will
only be flipped once, and therefore the length of the game cannot be 2n. Therefore,
the length lies between n and 2n − 1. Example 2 (in Section 2) illustrates that both of
these lengths are possible. (It is not hard to show that if the player has no memory at
all, and plays by simply flipping over cards at random, then the expected length of a
game is n 2 .)
Computer investigations indicate that the expected length of a game is roughly 1.6n.
In Sections 4 and 5 we will determine the exact length, and show that it is approxi-
mately (3 − 2 ln 2)n + 7/8 − 2 ln 2, with the error in this approximation approaching 0
as n → ∞. Figure 1 shows the distribution of game lengths for n = 100. The expected
length in this case is about 160.8589, and our approximation gives about 160.8593.
1 0.20
10 −50 0.15
10 −100 0.10
0.05
10 −150
A lucky move is one in which the player flips two cards that have not been flipped
before and they happen to match. Note that a game of length n is one in which every
move is lucky. The second question, to be addressed in Section 5.2, is as follows.
3. How many flips are required before the player should expect to have seen both
cards of a pair?
Example 1. Suppose that n = 6 and that the cards are laid out in the order
1 2 1 6 2 3 3 5 4 4 5 6.
Example 2. We can now give examples of the shortest and longest possible games. If
the cards are laid out in the order
1 1 2 2 3 3 · · · n n,
then the game will consist of n lucky moves; this is the shortest possible game. It is
only slightly harder to illustrate the longest possible game. We will let the reader verify
that if the cards are dealt in the order
1 2 3 1 4 2 · · · k k − 2 · · · n − 1 n − 3 n n − 2 n − 1 n,
1 2 1 6 2 3 3 5 4 4 5 6
4 5 4 6 5 1 1 2 3 3 2 6
Figure 2. Two deals for n = 6 with the same interconnection network.
The two deals in Figure 2 differ only in the labels we have assigned to each pair
of cards, and therefore the first deal leads to essentially the same game as the second.
We wish to identify those games with the same underlying interconnection network.
In general, there is an n!-to-1 map from size-2n deals to size-2n interconnection net-
works. This map for n = 2 is illustrated in Figure 3. We prefer to remain in the realm
of card games, so we designate a representative for each class of deals. We call a deal
standard if, when the game is played, the pairs are removed in order from 1 to n. Put
another way, a deal is standard if the second occurrence of i occurs to the left of the
second occurrence of i + 1 for each i between 1 and n − 1. The deals in Examples 1
and 2 are all standard.
{1 1 2 2, 2 2 1 1}
{1 2 1 2, 2 1 2 1}
{2 1 1 2, 1 2 2 1}
Figure 3. Memory deals for n = 2 paired according to interconnection network.
a(n, j)
,
(2n)!/(2n n!)
n+1 Pn+1
X a(n, j) j=2 j · a(n, j)
j· = . (1)
j=2
(2n!)/(2n n!) (2n)!/(2n n!)
It turns out that there is a simple formula for the numerator of the last fraction.
n+1
j · a(n, j) = 2n n!.
X
j=2
n+1
X n+1
X
= j · (2n −1− j) · a(n −1, j) + j · ( j −1) · a(n −1, j −1)
j=2 j=2
n+1
X n
X
= j · (2n −1− j) · a(n −1, j) + ( j +1) · j · a(n −1, j)
j=2 j=1
n
X n
X
= 2n j · a(n −1, j) = 2n · j · a(n −1, j)
j=2 j=2
22n
2n
.
n
Proof. Using equation (1) and Lemma 4, we find that the expected position of the first
match is
Pn+1
j=2 j · a(n, j) 2n n! 22n 22n
= = = .
(2n)!/(2n n!) (2n)!/(2n n!) (2n)!/n!2 2n
n
√
Corollary 6. The expected position of the first match grows as π n.
√ n n
Proof. Straightforward applications of Stirling’s approximation n! ∼ 2πn (see,
e
for example, [5]) yield
√ n
22n 2n
( 2π n ne )2 √
2n
∼2 √ = πn.
2n 2n
n 2π(2n) e
Consider any standard deal σ ∈ Mn . In the game based on σ , the player will turn
over unknown cards two at a time until she comes to the first matching card at position
f (σ ). What happens at that point depends on whether f (σ ) is even or odd. If f (σ )
is odd, then the card at position f (σ ), which is the second 1, will be flipped over at
the beginning of a move. Since we are assuming that the player has perfect memory,
she will then flip the first 1 and remove both 1s. The number of moves required to
n n
bi (σ ) 1X 1
X
+ ai (σ ) = bi (σ ) + (n − e(σ )) · + (e(σ ) − `(σ )) · 1
i=1
2 2 i=1 2
3n e(σ )
= + − `(σ ).
2 2
For example, if n = 6 and σ is the deal given in Example 1, then the lengths of the
blocks are 3, 2, 2, 3, 1, 1, and the third and fourth blocks are lucky. Therefore e(σ ) = 2
and `(σ ) = 1, and the length of the game is
3·6 2
+ − 1 = 9,
2 2
b(n, j)
b(n, j) = ,
(2n)!/(2n n!)
`(n, j)
`(n, j) = .
(2n)!/(2n n!)
Using this notation, we can write the expected value of e(σ ) as i=1
P∞
b(n, 2i), and the
expected value of `(σ ) as i=1 `(n, 2i). Notice that, although we have written these
P∞
expected values as infinite sums, in fact the sums are finite, since all but finitely many
terms in each sum are 0. Thus, the expected length of a game is
∞ ∞
3n 1 X X
+ b(n, 2i) − `(n, 2i). (2)
2 2 i=1 i=1
Several steps are required to pass from equation (2) to Theorem 7. The first is to
replace the b(n, j) and `(n, j) in (2) with (what turn out to be) their asymptotic limits.
We will show that, for large n,
2(2n + 1) 1
b(n, j) ≈ and for j ≥ 2, `(n, j) ≈ . (3)
j ( j + 1)( j + 2) j ( j − 1)
We therefore define
2(2n +1) ˚ 1
b̊(n, j) = b(n, j)− and for j ≥ 2, `(n, j) = `(n, j)− .
j ( j +1)( j +2) j ( j −1)
(4)
Equivalently,
2(2n +1) 1 ˚
b(n, j) = + b̊(n, j) and for j ≥ 2, `(n, j) = + `(n, j).
j ( j +1)( j +2) j ( j −1)
We wish to rewrite (2) using the asymptotic limits of the b(n, j) and `(n, j). To this
end, we first compute
∞ ∞
2(2n + 1)
X X
b(n, 2i) = + b̊(n, 2i)
i=1 i=1
2i(2i + 1)(2i + 2)
∞ ∞
X 2(2n + 1) X
= + b̊(n, 2i).
i=1
2i(2i + 1)(2i + 2) i=1
2 1 2 1
= − + . (5)
j ( j + 1)( j + 2) j j +1 j +2
This gives us
∞ N
2(2n +1) 1 2 1
X X
= (2n +1) lim − +
i=1
2i(2i +1)(2i +2) N →∞
i=1
2i 2i +1 2i +2
1 2 1 1 2 1 1 2 1
= (2n +1) lim − + + − + +· · ·+ − +
N →∞ 2 3 4 4 5 6 2N 2N +1 2N +2
1 2 2 2 2 2 1
= (2n +1) lim − + − +· · ·+ − +
N →∞ 2 3 4 5 2N 2N +1 2N +2
3 1 1 1 1 1
= (2n +1) lim + −2 1− + − +· · ·+
N →∞ 2 2N +2 2 3 4 2N +1
3
= (2n +1) −2 ln 2 ,
2
Next, we compute
∞ ∞ ∞ ∞
1 1
X
˚ 2i) = ˚ 2i).
X X X
`(n, 2i) = + `(n, + `(n,
i=1 i=1
2i(2i − 1) i=1
2i(2i − 1) i=1
1 1 1
= − .
j ( j − 1) j −1 j
This gives us
∞ N
1 1 1
X X
= lim −
i=1
2i(2i − 1) N →∞ i=1 2i − 1 2i
1 1 1 1 1
= lim 1 − + − + · · · + − = ln 2,
N →∞ 2 3 4 2N − 1 2N
so
∞ ∞
˚ 2i).
X X
`(n, 2i) = ln 2 + `(n, (7)
i=1 i=1
∞ ∞
3n 1 X X
+ b(n, 2i) − `(n, 2i)
2 2 i=1 i=1
∞ ∞
3 1X X
˚ 2i).
= (3 − 2 ln 2)n + − 2 ln 2 + b̊(n, 2i) − `(n,
4 2 i=1 i=1
∞
X 1
lim b̊(n, 2i) =
n→∞
i=1
4
and
∞
˚ 2i) = 0.
X
lim `(n,
n→∞
i=1
5.1. Sum of b̊(n, 2i). First we determine recurrences for the b(n, j). If n = 1, then
the only standard deal is 1 1, which has a single block of length 2. So b(1, 2) = 1 and
b(1, j) = 0 for j 6 = 2. For n ≥ 2, as we saw before, any standard deal σ ∈ Mn can
be constructed from a standard deal σ 0 ∈ Mn−1 by inserting an n in one of the 2n − 1
possible positions in σ 0 , and then adding a second n at the end. Now, what happens to
the blocks of σ 0 when this is done? A block of length j in σ 0 either becomes a block
of length j + 1 (if the first n is inserted into that block) or length j (if the first n is
inserted somewhere else). Also, there is a new block at the end of length either 2 (if
both copies of n are at the end) or 1 (if not). So, for n ≥ 2:
(2n − 2)!
b(n, 1) = (2n − 2) · b(n − 1, 1) + (2n − 2) · ,
2n−1 (n − 1)!
(2n − 2)!
b(n, 2) = b(n − 1, 1) + (2n − 3) · b(n − 1, 2) + ,
2n−1 (n
− 1)!
and
Finally, we use the definition of b̊(n, j) (equation (4)) to convert these recurrences into
recurrences for b̊.
(2n − 2) · b̊(n − 1, 1) − 1
b̊(n, 1) = ,
2n − 1
b̊(n − 1, 1) + (2n − 3) · b̊(n − 1, 2) + 1
b̊(n, 2) = ,
2n − 1
and
j
1 2 3 4 5 6 7
1 −1 3/4 −1/10 −1/20 −1/35 −1/56 −1/84
2 −1 1/4 1/2 −1/12 −1/21 −5/168 −5/252
3 −1 3/20 3/10 17/60 −1/15 −1/24 −1/36
n 4 −1 3/28 3/14 1/4 1/7 −3/56 −1/28
5 −1 1/12 1/6 53/252 11/63 31/504 −11/252
6 −1 3/44 3/22 71/396 17/99 85/792 7/396
7 −1 3/52 3/26 89/572 23/143 425/3432 3/52
8 −1 1/20 1/10 107/780 29/195 439/3432 53/660
Lemma 9.
1. For n ≥ 1,
b̊(n, 1) = −1.
2. For n ≥ 1,
3
b̊(n, 2) = .
4(2n − 1)
3. For n ≥ 2,
3
b̊(n, 3) = 2b̊(n, 2) = .
2(2n − 1)
Proof. All three statements are proven by induction using Lemma 8, with the proof of
each statement after the first also making use of the previous statement.
3 2 · 4 · 6 · · · (2n − 2)
Cn = · .
4 3 · 5 · 7 · · · (2n − 1)
For n = 1, we interpret the products in the numerator and denominator of the last
fraction in the formula for
Cn as empty products, which are equal to 1. So the inequality
to be proven is b̊(1, j) ≤ 3/4, which follows easily from the formulas in Lemma 8.
Now suppose that n ≥ 2, and the lemma holds for n − 1. Then the inductive hy-
pothesis is that for all j ≥ 2, b̊(n − 1, j) ≤ Cn−1 . We now consider three cases.
b̊(n, 2) =
3 3 2
≤ ·
4(2n − 1) 4 2n − 1
3 2 4 · 6 · · · (2n − 2)
≤ · · = Cn .
4 2n − 1 3 · 5 · · · (2n − 3)
Case 2: 3 ≤ j ≤ 2n − 1. Then
( j − 1) · b̊(n − 1, j − 1) + (2n − 1 − j) · b̊(n − 1, j)
b̊(n, j) =
2n − 1
j −1 2n − 1 − j
· b̊(n − 1, j − 1) + · b̊(n − 1, j)
≤
2n − 1 2n − 1
j −1 2n − 1 − j 2n − 2
≤ · Cn−1 + · Cn−1 = · Cn−1
2n − 1 2n − 1 2n − 1
2n − 2 3 2 · 4 · · · (2n − 4)
= · · = Cn .
2n − 1 4 3 · 5 · · · (2n − 3)
2(2n + 1) 2(2n + 1)
b̊(n, j) = b(n, j) − =− .
j ( j + 1)( j + 2) j ( j + 1)( j + 2)
Thus
b̊(n, j) = 2(2n + 1) 2(2n + 1) 1 1
≤ = ≤
j ( j + 1)( j + 2) 2n(2n + 1)(2n + 2) n(2n + 2) 2n − 1
3 2 3 2 4 · 6 · · · (2n − 2)
≤ · ≤ · · = Cn .
4 2n − 1 4 2n − 1 3 · 5 · · · (2n − 3)
To get a better idea of the size of the bound in the last lemma, note that
2
2 · 4 · · · (2n − 2) 2 2 4 4 2n − 2 2n − 2 1
= · · · ··· · · .
3 · 5 · · · (2n − 1) 1 3 3 5 2n − 3 2n − 1 2n − 1
2 2 4 4 2n − 2 2n − 2
· · · ··· ·
1 3 3 5 2n − 3 2n − 1
increases as n increases, and it is well known that as n → ∞, it converges to π/2 (see,
for example, [7]). Therefore, for j ≥ 2 we have
b̊(n, j) ≤ 3 · π
r
. (9)
4 2(2n − 1)
In particular, it follows that b̊(n, j) → 0 as n → ∞, which justifies our claim that (3)
gives the asymptotic limits for the b(n, j).
The first sum on the right-hand side of equation (10) is the expected number of blocks
(of all lengths). But for every standard deal, the number of blocks is n, so this expected
number is n. We can evaluate the second sum on the right-hand side of (10) by using
the partial fractions decomposition (5):
∞
X 2(2n + 1)
j=1
j ( j + 1)( j + 2)
N
1 2 1
X
= (2n + 1) lim − +
N →∞
j=1
j j +1 j +2
2 1 1 2 1 1 2 1
= (2n + 1) lim 1 − + + − + + · · · + − +
N →∞ 2 3 2 3 4 N N +1 N +2
1 1 1 1
= (2n + 1) lim − + =n+ .
N →∞ 2 N +1 N +2 2
Thus,
∞ ∞ ∞
2(2n + 1) 1 1
X X X
b̊(n, j) = b(n, j) − =n− n+ =− .
j=1 j=1 j=1
j ( j + 1)( j + 2) 2 2
To separate out the contributions of the even- and odd-numbered terms in (11), we
will use the following fact, which is illustrated in the first graph in Figure 4.
0.12
0.025
0.10
0.020
0.08
0.015
0.06
0.010
0.04
0.005
0.02
0.000 j
0.00 j 5 10 15 20
5 10 15 20 − 0.005
− 0.02
˚
Figure 4. Plots of the values of b̊(n, j) (left) and `(n, j) (right) for j from 2 to 21 for n equal to 10 (circles),
15 (squares), and 20 (diamonds).
and
b̊(n, m n ) ≤ b̊(n, m n + 1) ≤ · · · .
Proof. We use induction on n. It is easy to verify that the lemma holds for n = 1 and
n = 2 (with k1 = 2, m 1 = 3, k2 = 3, and m 2 = 4). Now suppose that n ≥ 3, and the
lemma holds for n − 1. To prove that the lemma holds for n, we will show that
We can therefore set kn to be one of kn−1 or kn−1 + 1, and set m n to be one of m n−1 or
m n−1 + 1.
To prove (12), suppose that 2 ≤ j < kn−1 ; we must verify that b̊(n, j) ≤ b̊(n, j +
1). If j = 2, this follows immediately from Lemma 9. Now suppose that j ≥ 3, and
note that j < m n−1 ≤ n + 1 ≤ 2n − 2. Then by the inductive hypothesis, we have
and therefore
( j − 1) · b̊(n − 1, j − 1) + (2n − 1 − j) · b̊(n − 1, j)
b̊(n, j) =
2n − 1
(2n − 2) · b̊(n − 1, j)
≤ (15)
2n − 1
j · b̊(n − 1, j) + (2n − 2 − j) · b̊(n − 1, j + 1)
≤ = b̊(n, j + 1).
2n − 1
Similar reasoning can be used to prove (13). To prove (14), suppose that j ≥
m n−1 + 1. If j ≤ 2n − 2, then we can repeat the reasoning in (15) to show that
b̊(n, j) ≤ b̊(n, j + 1). If j ≥ 2n − 1, then j ≥ n + 2, so b(n, j) = b(n, j + 1) = 0,
and therefore
2(2n + 1) 2(2n + 1)
b̊(n, j) = − ≤− = b̊(n, j + 1).
j ( j + 1)( j + 2) ( j + 1)( j + 2)( j + 3)
|B − A| ≤ 2δ. (16)
A = a1 + a3 + · · · + ak−1 ≤ a2 + a4 + · · · + ak = B
and
A − a1 = a3 + a5 + · · · + ak−1 ≥ a2 + a4 + · · · + ak−2 = B − ak .
We are finally ready to evaluate the first limit needed to complete our proof of
Theorem 7.
Lemma 13.
∞
X 1
lim b̊(n, 2i) = .
n→∞
i=1
4
Proof. By Lemma 11, for N > 0 the sequence b̊(n, 2), b̊(n, 3), . . . , b̊(n, 2N + 1) con-
sists of at most three monotonic subsequences.
√ We can therefore apply Lemma 12 to
each subsequence with the bound δ = (3/4) π/(2(2n − 1)) from (9) to show that
N
X N
X N
X
b̊(n, 2i + 1) − 6δ ≤ b̊(n, 2i) ≤ b̊(n, 2i + 1) + 6δ.
i=1 i=1 i=1
PN
Adding i=1 b̊(n, 2i), we can rewrite this as
2N
X +1 N
X 2N
X +1
b̊(n, j) − 6δ ≤ 2 b̊(n, 2i) ≤ b̊(n, j) + 6δ.
j=2 i=1 j=2
or in other words,
∞
1 9 π 1 9 π
r X r
− ≤ b̊(n, 2i) ≤ + .
4 4 2(2n − 1) i=1
4 4 2(2n − 1)
(2n − 3) · `(n − 1, 2) + 1
`(n, 2) =
2n − 1
and
˚
Finally, applying the definition of `(n, j) we get
˚ 2) = 1 ,
`(1,
2
˚ 1
`(1, j) = − ( j ≥ 3),
j ( j − 1)
and for n ≥ 2,
˚
˚ 2) = (2n − 3) · `(n − 1, 2)
`(n,
2n − 1
and
˚ − 1, j − 1) + (2n − 1 − j) · `(n
( j − 2) · `(n ˚ − 1, j)
˚
`(n, j) = (for j ≥ 3).
2n − 1
˚
`(n, 1
.
j) ≤
2(2n − 1)
Also, for n ≥ 2,
˚ 2) = `(n,
˚ 3) = 1
`(n, .
2(2n − 1)
Note that the first equation in Lemma 15 indicates that the expected number of
adjacent matching pairs in a random deal is 1.
1 1 1 1 1
= lim 1 − + − + · · · + −
N →∞ 2 2 3 N −1 N
1
= lim 1 − = 1.
N →∞ N
Thus ∞
˚
X
`(n, j) = 1 − 1 = 0.
j=2
˚ 2), `(n,
Proof. By Lemma 16, for N > 0 the sequence `(n, ˚ 3), . . . , `(n,
˚ 2N + 1) con-
sists of at most two monotonic subsequences. As in the proof of Lemma 13, we apply
Lemma 12 with δ = 1/(2(2n − 1)) to show that
N N N
˚ 2i + 1) − 4δ ≤ ˚ 2i) ≤ ˚ 2i + 1) + 4δ.
X X X
`(n, `(n, `(n,
i=1 i=1 i=1
1 1 1
O √ + =O √ .
n n n
ACKNOWLEDGMENTS. The authors would like to thank Dan Archdeacon for useful conversations leading
to this paper, and the anonymous referees for several helpful suggestions.
REFERENCES
1. E. Alfthan, Optimal strategy in the childrens [sic] game memory (2007), available at http://www.math.
kth.se/xComb/x1.pdf.
2. Classic memory game (2012), available at http://classicmemorygame.com/.
3. Concentration memory game (2012), available at http://www.mathsisfun.com/games/memory/
index.html.
4. P. Flajolet, R. Sedgewick, Analytic Combinatorics, Cambridge University Press, Cambridge, 2009.
5. R. L. Graham, D. E. Knuth, O. Patashnik, Concrete Mathematics, second edition. Addison-Wesley, Read-
ing, MA, 1994.
6. J. C. Lagarias, A. M. Odlyzko, D. B. Zagier, On the capacity of disjointly shared networks, Com-
put. Networks ISDN Systems 10 (1985) 275–285, available at http://dx.doi.org/10.1016/0169-
7552(85)90070-4.
7. J. Wästlund, An elementary proof of the Wallis product formula for pi, Amer. Math. Monthly 114 (2007)
914–917.
DANIEL J. VELLEMAN received his B.A. from Dartmouth College in 1976 and his Ph.D. from the Univer-
sity of Wisconsin–Madison in 1980. He taught at the University of Texas before joining the faculty of Amherst
College in 1983. He was the editor of the American Mathematical Monthly from 2007 to 2011. In his spare
time he enjoys singing, bicycling, and playing volleyball.
Department of Mathematics, Amherst College, Amherst, MA 01002
djvelleman@amherst.edu
GREGORY S. WARRINGTON received his B.A. from Princeton University in 1995 and his Ph.D. from
Harvard University in 2001. He is now an algebraic combinatorialist at the University of Vermont. He and his
family enjoy playing games together; the inspiration for this paper came while he was playing the game of
memory with his daughter. This work was partially supported by a grant from the Simons Foundation (grant
number 197419) and National Science Foundation grant DMS-1201312.
Department of Mathematics and Statistics, University of Vermont, Burlington, VT 05401
gregory.warrington@uvm.edu
Abstract. We revisit the classical calculus problem of describing the flow of brine in a sys-
tem of tanks connected by pipes. For various configurations involving an arbitrary number of
tanks, we show that the corresponding linear system of differential equations can be solved
analytically. Finally, we analyze the asymptotic behavior of solutions for a general closed sys-
tem of tanks. It turns out that the problem is closely related to the study of Laplacian matrices
for directed graphs.
1. INTRODUCTION. Consider two tanks filled with brine, which are connected by
a pair of pipes. One pipe brings brine from the first tank to the second tank at a given
rate, while the second pipe carries brine in the opposite direction at the same rate. This
situation suggests the following problem. Assuming that the initial concentrations in
both tanks are known and that we have a perfect mixing in both tanks, find the concen-
trations in both tanks after a given period of time. Does the problem sound familiar?
Similar mixing problems appear in many differential equations textbooks (see, e.g.,
[3], [10], and especially [5], which has an impressive collection of mixing problems).
Most authors restrict themselves to mixing problems involving two or three tanks ar-
ranged in various configurations (a cascade with brine flowing in a single direction
only, a linear arrangement of tanks connected by pairs of pipes, a cyclic arrangement
of tanks, etc.). The problem then leads to a linear system of differential equations for
the unknown concentrations, which is solved by calculating the eigenvalues and eigen-
vectors of the corresponding matrix.
An exercise in [5, p. 380] asks the reader to consider a cascade of n tanks and to
solve the corresponding differential equations numerically in the case when n = 10.
In this article, we discuss a variety of mixing problems with n tanks (including the one
from [5]) and show that they can be solved exactly. Not only is it satisfying to obtain
analytic solutions of these problems, but we will also have the opportunity to review
a number of other interesting topics such as the Gershgorin circle theorem, recurrence
relations, the heat equation, circulant matrices, and Laplacian matrices.
T3 T7
T1
T4 T6
T5
Without loss of generality (by taking suitable time units), we may assume that f = V .
The system of differential equations can be written in the form x 0 (t) = Ax(t), where
x(t) = (x1 (t), . . . , xn (t))T and
−(n − 1) 1 · · · 1
1 −1 · · · 0
.
A= .. .. .. ..
. . . .
1 0 ··· −1
−(n − 1) − λ 1 ··· 1
1 −1 − λ · · · 0
B = A − λI = .
.. .. .. ..
. . . .
1 0 ··· −1 − λ
By definition,
X
det B = sgn(π )b1π(1) · · · bnπ(n) ,
π∈Sn
where the summation runs over all permutations π of {1, . . . , n}. Clearly, the only
permutations that contribute nonzero terms are the following.
• The identity permutation, which contributes (−(n − 1) − λ)(−1 − λ)n−1 .
• Permutations where π(1) = i for a certain i ∈ {2, . . . , n}, π(i) = 1, and π( j) = j
for j ∈ {2, . . . , n}\{i}. Each of these permutations is a transposition, and con-
tributes −(−1 − λ)n−2 to the final sum.
It follows that
1 1 ··· 1
1 −1 + n ··· 0
A − λI = .
.
.. .
.. .. ..
. .
1 0 ··· −1 + n
The eigenvector components must satisfy vi = v1 /(1 − n), for 2 ≤ i ≤ n, and we can
take v = (1 − n, 1, . . . , 1). Finally, if λ = −1, then
−n + 2 1 · · · 1
1 0 ··· 0
A − λI = .
.. .. . . .. .
. . .
1 0 ··· 0
T1 T2 T3 T4 T5 T6 T7 T8
Again, the flow through each pipe is f gallons per unit of time. Using the same
notation as before, we obtain the following system of differential equations:
x1 (t) x2 (t)
x10 (t) = − f + f ,
V V
xi−1 (t) xi (t) xi+1 (t)
xi0 (t) = f −2f + f , for 2 ≤ i ≤ n − 1,
V V V
and
xn−1 (t) xn (t)
xn0 (t) = − f − f .
V V
1 0 ··· 0 0 0
−1
1 −2 1 ··· 0 0 0
0 1 −2 · · · 0 0 0
. .. .. . . .. .. ..
A= .
. . . . . . ..
0 0 0 · · · −2 1 0
0 0 0 ··· 1 −2 1
0 0 0 ··· 0 1 −1
In our case, we have a11 = ann = −1 and r1 = rn = 1, while aii = −2 and ri = 2 for
i ∈ {2, . . . , n − 1}. Consequently, all eigenvalues of A are contained in the interval
[−4, 0].
Let us now return to the calculation of det(A − λI ). We start by expanding the
determinant with respect to the first column:
−1 − λ 1 0 ··· 0 0
1 −2 − λ 1 ··· 0 0
0 1 −2 − λ · · · 0 0
det(A − λI ) = det
.. .. .. .. .. ..
. . . . . .
0 0 0 ··· −2 − λ 1
0 0 0 ··· 1 −1 − λ
−2 − λ 1 ··· 0 0
1 −2 − λ · · · 0 0
= (−1 − λ) det
.. .. .. .. ..
. . . . .
0 0 ··· −2 − λ 1
0 0 ··· 1 −1 − λ
1 0 ··· 0 0
1 −2 − λ · · · 0 0
. .. .. .. ..
− det . .
. . . . .
0 0 · · · −2 − λ 1
0 0 ··· 1 −1 − λ
The second determinant remains unchanged if we omit the first row, together with the
first column. Consequently,
−2 − λ 1 0 ··· 0 0
1 −2 − λ 1 ··· 0 0
0 1 −2 − λ ··· 0 0
.
.. .. .. .. .. ..
. . . . . .
0 0 0 ··· −2 − λ 1
0 0 0 ··· 1 −1 − λ
The value of Dk can be calculated by the same method that works for tridiagonal
Toeplitz matrices (see [8, p. 133]). Expanding this determinant with respect to the first
column, we obtain the recurrence relation
The initial values are D1 = −1 − λ and D0 = 1. We apply the standard procedure for
solving second-order linear homogeneous recurrence relations (see [6]). The auxiliary
equation
x 2 + (2 + λ)x + 1 = 0
has discriminant (2 + λ)2 − 4 = λ(λ + 4), which is nonpositive for λ ∈ [−4, 0]. (We
restrict our attention to λ ∈ [−4, 0] because we are trying to find the roots of det(A −
λ), and we already know they are contained in [−4, 0].)
√
−λ − 2 −λ(λ + 4)
x1,2 = ±i .
2 2
Performing the substitution λ + 2 = u, we obtain
√
−u 4 − u2
x1,2 = ±i = e±iγ ,
2 2
√
4−u 2
where γ ∈ [0, π] satisfies cos γ = −u/2, sin γ = 2 . Thus Dk is a linear combi-
nation of x1k = eikγ and x2k = e−ikγ , and Dk = α cos(kγ ) + β sin(kγ ) for some con-
stants α, β and every k ≥ 0. Using the initial conditions D0 = 1 and D1 = −1 − λ =
1 − u = 1 + 2 cos γ , we find α = 1 and β = (1 + cos γ )/ sin γ = cot(γ /2), provided
that γ 6= 0 (this assumption will be justified shortly). Therefore, Dk = cos(kγ ) +
cot(γ /2) sin(kγ ) and
kπ
λk = −2 cos − 2, for k ∈ {1, . . . , n},
n
are the eigenvalues of A; since we found n distinct eigenvalues, there is no need to
investigate the case γ = 0, which we excluded earlier.
The eigenvectors corresponding to λk are solutions of the linear homogeneous sys-
tem with the matrix
1 + 2 cos kπ 1 0 0 0
n
···
1 2 cos kπ
n
1 ··· 0 0
kπ
0 1 2 cos n · · · 0 0
.. .. .. . . . .
. . . . . .
. .
.
kπ
0 0 0 · · · 2 cos n 1
0 0 0 ··· 1 1 + 2 cos kπ
n
kπ
v2 = − 1 + 2 cos v1 , (1)
n
and
kπ
vi = −2 cos vi−1 − vi−2 , for i ∈ {3, . . . , n}, (2)
n
kπi kπ kπi
i i
vi = (−1) cos − (−1) cot sin , for i ∈ {1, . . . , n},
n 2n n
∂f ∂2 f
(t, x) = k 2 (t, x),
∂t ∂x
which models the conduction of heat in a one-dimensional rod (see [3], [10]); in par-
ticular, f (t, x) is the temperature at the point x on the rod at time t. The same equation
also describes the one-dimensional diffusion in a homogeneous medium; in this case,
f (t, x) corresponds to the concentration of a chemical at the point x at time t. Consider
∂f ∂2 f
yi0 (t) = (t, xi ) = k 2 (t, xi ),
∂t ∂x
it follows that
. k
y00 (t) = (−y0 (t) + y1 (t)),
(1x)2
. k
yi0 (t) = (yi−1 (t) − 2yi (t) + yi+1 (t)), for i ∈ {1, . . . , n − 1},
(1x)2
and
. k
yn0 (t) = (yn−1 (t) − yn (t)).
(1x)2
The right-hand side of this system is the same as in our mixing problem for n + 1 tanks
in a row. Thus, we have discovered that this particular mixing problem is equivalent
to the spatial discretization of the one-dimensional heat equation. Conversely, we can
start with the mixing problem as a toy model for heat conduction or diffusion, and
derive the one-dimensional heat equation by letting n → ∞ and 1x → 0 (see also [6,
p. 167] for a related model corresponding to both spatial and temporal discretization
of the heat equation).
T2 T3
T1 T4
T8 T5
T7 T6
This configuration is quite similar to the linear arrangement and we leave it to the
reader to verify that the matrix of the resulting linear system of differential equations is
1 0 ··· 0 0 1
−2
1 −2 1 ··· 0 0 0
0 1 −2 · · · 0 0 0
. .. .. . . .. .. ..
A= .. . . . . . ..
0 0 0 · · · −2 1 0
0 0 0 ··· 1 −2 1
1 0 0 ··· 0 1 −2
This is an example of a circulant matrix, i.e., a matrix whose rows are obtained by
cyclically shifting the first row. Since the problem of calculating the eigenvalues and
eigenvectors of a circulant matrix is well known, we state only the results and refer
the reader to appropriate sources (for the general theory of circulant matrices, see [9]
and the references given there; for a direct calculation of the eigenvalues of A without
circulant matrix theory, see Lemma 4.9 in [2]).
1. All circulant matrices of order n share the same set of eigenvectors, namely
for j ∈ {1, . . . , n − 1}. Thus, for odd values of n, the spectrum of A consists of the
simple eigenvalue λ0 and the eigenvalues λ1 , . . . , λ(n−1)/2 having multiplicity 2. For
even values of n, we have the simple eigenvalues λ0 , λn/2 (with (−1, 1, . . . , −1, 1) be-
ing the eigenvector corresponding to λn/2 ) and the eigenvalues λ1 , . . . , λ(n/2)−1 having
multiplicity 2. In any case, all eigenvalues except λ0 are negative, and thus all solu-
tions of our system approach the equilibrium state with all tanks containing the same
amount of salt.
Again, we remark that the matrix A occurs quite frequently in situations where we
discretize second-order ordinary differential equations subject to suitable boundary
conditions; see [12] for a detailed discussion.
T1 T4
T8 T5
T7 T6
deg (vi ) if i = j,
+
li j = −1 if i 6 = j and (vi , v j ) ∈ E,
0
otherwise
(where deg+ (v) stands for the outdegree of the vertex v) is known as the Laplacian
matrix of the directed graph G.
What is the relation between mixing problems and Laplacian matrices? A mixing
problem can be represented by a directed graph G with vertices T1 , . . . , Tn correspond-
ing to tanks and edges representing the pipes. It follows from our previous discussion
that such a mixing problem leads to the linear system x 0 (t) = Ax(t), where L = −A
is the Laplacian matrix of G.
Clearly, λ is an eigenvalue of A if and only if −λ is an eigenvalue of L (in this case,
both eigenvalues share the same eigenvectors). For an arbitrary directed graph, the sum
of each row in L is zero, and we see that (1, . . . , 1) is an eigenvector corresponding to
the zero eigenvalue. To prove that all solutions of our general mixing problem tend to
the equilibrium state with all tanks containing the same amount of salt, it is sufficient
to show that the zero eigenvalue is a simple one, and that all remaining eigenvalues
of L have positive real parts. The second assertion is an immediate consequence of
the Gershgorin circle theorem, which says that P all eigenvaluesP are contained in the
union of the n disks with centers lii and radii j6=i |li j | = − j6=i li j = lii , where
i ∈ {1, . . . , n}. To prove that the zero eigenvalue is a simple one, note that the graphs
arising from our mixing problems have two important properties: They are connected
and balanced, i.e., the indegree deg− (v) of an arbitrary vertex v equals its outdegree
deg+ (v). However, a balanced connected graph is also strongly connected, i.e., there
is a directed path from each vertex in the graph to every other vertex. (Assume that the
graph has more than one strongly connected component. The number of edges leaving
a certain component equals the number of edges entering that component, and we can
find an oriented cycle passing through different components—a contradiction.)
We now proceed to show that every balanced connected directed graph has a simple
zero eigenvalue. For undirected graphs, this result is well known (see [2], [7]); the
usual proof is based on the observation that L = M M T , where L is the Laplacian
matrix and M is the incidence matrix. Such a decomposition is possible only when
L is symmetric, and thus we have to use a different approach. Unfortunately, the lit-
1 X
xT Lx = (xi − x j )2 .
2 (v ,v )∈E
i j
Proof. We have
n n n
T
lii xi2 l j j x 2j
X X X X
2x L x = 2 li j x i x j = +2 li j x i x j +
i, j=1 i=1 i, j∈{1,...,n}, j=1
i6= j
n n
deg+ (vi )xi2 − 2 deg+ (v j )x 2j .
X X X
= xi x j +
i=1 (vi ,v j )∈E j=1
Observing that
n
deg+ (vi )xi2 = xi2 ,
X X
n n
deg+ (v j )x 2j = deg− (v j )x 2j
X X
j=1 j=1
x 2j ,
X
=
(vi ,v j )∈E
we obtain
Theorem 2. The null space of the Laplacian matrix of a connected balanced directed
graph G = (V, E) has dimension 1.
1 X
0 = xT Lx = (xi − x j )2 .
2 (v ,v )∈E
i j
Consequently, all terms in the last sum must vanish and xi = x j whenever (vi , v j ) ∈ E.
It follows that xi = x j if there is a directed path between vi and v j . Since our graph is
strongly connected, we conclude that x1 = · · · = xn , i.e., the null space of L is spanned
by the single vector (1, . . . , 1).
Theorem 2 was already proved in [11], but our proof is different and shorter. We
now proceed to show that the algebraic multiplicity of the zero eigenvalue equals its
geometric multiplicity.
Theorem 3. For the Laplacian matrix of a connected balanced directed graph, the
zero eigenvalue is a simple one.
where a0 , . . . , an−1 are certain constants and λ1 , . . . , λn are the eigenvalues of L, in-
cluding multiplicity. Without loss of generality, assume that λ1 is the zero eigenvalue.
Then
n
a1 = (−1)n
X Y
(−λ ) = −λ2 · · · λn ,
i
k=1 i∈{1,...,n},
i6 =k
l11 − λ · · · l1n
.. .. ..
det(L − λI ) = det . . .
ln1 ··· lnn − λ
remains unchanged if we first add all rows except the ith one to the ith row, and then
add all columns except the jth one to the jth column. Using the fact that both the rows
and columns of L have zero sums, we see that
Corollary 4. Consider a general mixing problem with n tanks such that conditions
(1)–(3) are satisfied. Then
x1 (0) + · · · + xn (0)
lim xi (t) = , for i ∈ {1, . . . , n}.
t→∞ n
Proof. The mixing problem leads to a linear system x 0 (t) = Ax(t), where A has a
simple zero eigenvalue and all remaining eigenvalues have negative real parts. The
eigenvectors corresponding to the zero eigenvalue have all components identical. Con-
sequently, for t → ∞, any solution of the system tends toward the state where all tanks
contain the same amount of salt. Since the total amount of salt is preserved, all com-
ponents of the solution must approach the arithmetic mean of the initial conditions.
Note that for disconnected balanced graphs, we have the following result.
Theorem 5. For the Laplacian matrix of a balanced directed graph, the algebraic
multiplicity of the zero eigenvalue equals the number of connected components.
Proof. After a suitable permutation of the vertices, the Laplacian matrix of the given
graph is the block diagonal matrix
L1 0 · · · 0
0 L2 · · · 0
L= ... .. . . . ,
. . ..
0 0 ··· Lk
and the conclusion follows from the fact that each L i has a simple zero eigenvalue.
T1
T2
T3
T4
T5
T6
T7
T8
and therefore
n
Y
det(A − λI ) = (−ki − λ).
i=1
0 0 0 ··· 0 0
λ 0 0 ··· 0 0
0 λ 0 ··· 0 0
A − λI = .. .. .. . . . .
. .. ..
. . .
0 0 0 ··· 0 0
0 0 0 ··· λ 0
has the form of a nilpotent Jordan block. The nullspace of A − λI has dimension 1
and is spanned by the vector v 1 = (0, . . . , 0, 1). Since we need n linearly independent
solutions, we have to look at the nullspaces of higher powers of A − λI . With each
successive power, the only nonzero band containing the values λ shifts one position
lower. It follows that the vector v k = (0, . . . , 0, 1, . . . , 1), whose first n − k compo-
nents are zero and the remaining equal to one, satisfies (A − λI )k v k = 0. This gives a
system of n linearly independent solutions of the form
e At v k = e(A−λI )t eλt v k
2 k−1
t 2t k−1 t
= I + (A − λI ) + (A − λI ) + · · · + (A − λI ) eλt v k ,
1! 2! (k − 1)!
where k ∈ {1, . . . , n}. Since λ < 0, all solutions approach the state with no salt in the
tanks.
7. CONCLUSION. The aim of this article was to present a topic that is accessible
to undergraduate students and displays a nice interplay between differential equations,
linear algebra, and graph theory. We remark that similar mathematical problems occur
in a completely different setting, namely in the study of coordination of multiagent
systems (see [11], [4] and the references there).
A possible project for the interested reader is to find other configurations (closed
or not) of n tanks leading to systems of differential equations that can be solved ana-
lytically. A slightly more ambitious project is to consider mixing problems where we
take into account the time necessary for the transport of brine between two tanks. The
corresponding mathematical model then becomes a linear system of delay differential
equations, which is naturally more difficult to analyze.
ACKNOWLEDGMENTS. I am grateful to Stan Wagon, who brought my attention to mixing problems, and
to the anonymous referees, whose suggestions helped to improve the paper.
REFERENCES
1. R. Agaev, P. Chebotarev, On the spectra of nonsymmetric Laplacian matrices, Linear Algebra Appl. 399
(2005) 157–168, available at http://dx.doi.org/10.1016/j.laa.2004.09.003.
2. R. B. Bapat, Graphs and Matrices, Universitext, Springer, London, 2010.
3. W. E. Boyce, R. C. DiPrima, Elementary Differential Equations and Boundary Value Problems, ninth
edition, Wiley, New York, 2009.
4. P. Yu. Chebotarev, R. P. Agaev, Coordination in multiagent systems and Laplacian spectra of di-
graphs, Autom. Remote Control 70 no. 3 (2009) 469–483, available at http://dx.doi.org/10.1134/
S0005117909030126.
ANTONÍN SLAVÍK is an assistant professor at the Charles University in Prague, where he received his
Ph.D. in 2005. His professional interests include differential equations, integration theory, history of math-
ematics, and computer science.
Charles University, Faculty of Mathematics and Physics, Sokolovská 83, 186 75 Praha 8, Czech Republic
slavik@karlin.mff.cuni.cz
REFERENCES
1. C. Alsina, M. S. Tomás, A geometrical proof of a new inequality for the Gamma function, J. Ineq.
Pure Appl. Math. 6 (2005) Art. 48.
2. F. Gao, Volumes of generalized balls, Amer. Math. Monthly 120 (2013) 130.
3. X. Wang, Volumes of generalized unit balls, Math. Mag. 78 (2005) 390–395.
—Submitted by Esther M. Garcı́a–Caballero and Samuel G. Moreno,
Departamento de Matemáticas, Universidad de Jaén, 23071 Jaén, Spain, and
Michael P. Prophet, Department of Mathematics, University of Northern Iowa,
Cedar Falls, IA, USA
http://dx.doi.org/10.4169/amer.math.monthly.120.09.821
MSC: Primary: 33B15, Secondary: 26D07
Abstract. We first provide a short survey of reciprocal sums. We discuss some of the history
of their computation and application, show how they are applied in various modern contexts,
and discuss some ways that their values are computed. We give an example of computing a
reciprocal sum by providing (we believe) the first computation of the sum of the reciprocals
of perfect numbers. Second, we introduce a new use for reciprocal sums; that is, they can be
used as a knowledge metric to classify the current state of number theorists’ understanding of
a given class of integers.
1. INTRODUCTION. Most math majors learn early in their education that there are
at least two “sizes” of infinity: countable and uncountable. Since all countably infinite
sets can be placed in one-to-one correspondence, such sets are said to have the same
cardinality. This beautiful fact, however, can obscure important distinctions among
these sets.
For example, the set of perfect (integer) squares, which we will call S, is an infinite
subset of the integers. Similarly, the set of primes P is also an infinite subset of the in-
tegers, and hence both sets are countable. However, in a very real sense, these two sets
have different “sizes”. Specifically, say we fix a positive real number x and consider
the integers of size at most x. Given a set A of integers, its counting function A(x) is
the number of elements of A up to x; thus S(x) represents the number of squares less
than or equal to x, and P(x) represents the number of primes less than or equal to x.
We can describe the growth of S(x) approximately by writing
√
S(x) = O( x),
which indicates
√ that for sufficiently large x, the value of S(x) is less than a constant
multiple of x. (Of course it is possible to get more precise bounds on S(x), but we
do not need those at this time.)
Similarly, it is known that
x
P(x) = O ,
log x
where log x denotes the natural logarithm of x (see, for example, the discussion and
proof in Chapter 3 of [24]). While both underlying sets are countably infinite, these
bounds let us see that there are more primes than squares up to x (at least for suffi-
ciently large x).
Comparing counting functions is a useful exercise. However, the comparison does
not take into account computational work on finding and enumerating the small ele-
ments of a set. For this, we need a different metric. One interesting measure comes
http://dx.doi.org/10.4169/amer.math.monthly.120.09.822
MSC: Primary 11A99
not only do we find that this sum converges, but we also know that its precise value is
π2
6
. In contrast, the reciprocal sum of the primes,
X1
n∈P
n
diverges. In this case, the best bounds that apply for x ≥ 3 [13] are
X1
log log x < < log log x + 1.
n∈P
n
n≤x
1 1 1 1 1 1
+ + + + + + ···
2 3 5 7 11 13
is infinitely great, but is infinitely smaller than the sum of the harmonic series
1 1 1 1
1+ + + + + ··· .
2 3 4 5
Also, the sum of the former is the logarithm of the latter.
Neither the proof, nor even the statement of the theorem, are rigorous by modern
standards, but it is not too difficult to see a way to modernize Euler’s claim to get a
deep (and correct!) result, namely that
1
P
log n≤x n
lim P 1
= 1.
x→∞
p≤x p
diverges [10]. This allowed him to answer one of the most important open questions
of his time; namely, that there are infinitely many prime numbers in any arithmetic
progression.
Interest in reciprocal sums continued throughout the 20th century. Viggo Brun fa-
mously proved [5] that the sum of the reciprocals of the twin primes converges, giving
birth to sieve theory—a major tool in the modern arsenal of analytic number theorists.
His sieve, now called Brun’s Sieve, has inspired many other sieve methods with fa-
miliar names, including the Large Sieve and Selberg’s Sieve [7, Chs. 7 and 8]. Even
more recently, reciprocal sums have motivated research on amicable numbers [1, 25]
and on prime-indexed-primes [2]. Thus, reciprocal sums have continued to influence
mathematics for at least 250 years.
Definition 2. Given a (nonempty) set of integers with a convergent reciprocal sum, its
Reciprocal Sum Metric is the quotient of the best-known upper bound and the best-
known lower bound on this reciprocal sum.
where A(t) is again the counting function of A up to t and n 0 is the smallest member
of A. Occasionally, the bounds available only apply for very large values of x or have
extremely large constants involved, and thus are only useful for very large x. When
this happens, it is usually a good idea to use some other method to bound the sum over
“medium” values.
We demonstrate this process by computing the reciprocal sum of the perfect num-
bers. To our knowledge, this has never appeared in print. It is known that all even
perfect numbers have the form
2 p−1 (2 p − 1)
where both p and 2 p − 1 are prime. This was shown by Euclid and Euler, and in their
honor we denote the set of perfect numbers by E. As sets go, E is relatively small.
However, the set of odd perfect numbers is probably even smaller. There are none
known, though we do know that the smallest odd perfect number must be greater than
10300 . Furthermore, we can show
√ that the number of odd perfect numbers up to some
(large) value of x is at most x. Since the proof of this is fairly straightforward, we
recall it here. Though this was first proven by Hornfeck [18], we shall follow the clear
exposition in [24, p. 251].
√
Theorem 3. The number of odd perfect numbers up to x is at most x.
Proof. We start by recalling a result of Euler, from his posthumous Tractatus [15], that
an odd perfect number n can be written as n = p k m 2 with p prime and gcd( p, m) = 1.
(Euler’s result itself is proven as Theorem 8.2 in [24].) √ We seek the number of such
n less than or equal to x. Certainly we must have m ≤ x, so we fix such an m and
consider the prime powers p k for which gcd( p, m) = 1. Since n is perfect and σ (n) is
a multiplicative function, we have σ ( p k m 2 ) = σ ( p k )σ (m 2 ) = 2 p k m 2 , and thus
σ ( pk ) 2m 2
= . (1)
pk σ (m 2 )
Since the fraction σ (ppk ) is already in lowest terms, we have that the values of σ (ppk ) are
k k
all distinct as p k ranges over prime powers. Thus,√there is at most one prime power p k
(with p - m) satisfying (1). Therefore,
√ each m ≤ x generates at most one odd perfect
n ≤ x, and there are at most x odd perfect numbers up to x.
To sum the reciprocals of the perfect numbers, we first calculate the sum over known
small values. Thanks to centuries of searching, many even perfect numbers have been
In this case, the “≈” symbol indicates that this sum is accurate to 150 decimal
places. It now remains for us to put bounds on the reciprocal sum of perfect numbers
greater than 10300 . We do this by splitting them into two cases, considering even and
odd perfect numbers separately.
We begin with the sum of reciprocals of the odd perfect numbers.
Theorem 4. The sum of the reciprocals of all odd perfect numbers greater than 10300
is less than 2 × 10−150 .
In order to sum the reciprocals of even perfect numbers greater than 10300 , we recall
the Euclid–Euler criterion that all such numbers are of the form 2 p−1 (2 p − 1) for prime
p. We use this result in the following.
Theorem 5. The sum of the reciprocals of all even perfect numbers greater than 10300
is less than 2 × 10−300 .
Proof. If an even perfect number m is larger than 10300 , then m = 2n−1 (2n − 1) for
some n ≥ 499. We derive an upper bound on the sum of the even perfect reciprocals
by assuming that in fact, the expression 2n−1 (2n − 1) gives a perfect number for all
n ≥ 499. We sum all such values, finding
X 1 X 1 X 1
= < < 2 × 10−300 ,
n≥499
(2 n−1 )(2n − 1)
n≥499
22n−1 − 2n−1
n≥499
2 2n−2
Theorem 6. The sum of the reciprocals of the perfect numbers greater than 10300 is
less than 3 × 10−150 .
This gives us the pleasing conclusion that the explicitly calculated value of S above
gives the reciprocal sum of all perfect numbers with an accuracy of at least 149 decimal
places.
Theorem 7. The sum of the reciprocals of the perfect numbers lies in the range
[S, S + 3 × 10−150 ].
Table 1. The Reciprocal Sum Metric (RSM) on the current knowledge of some arithmetic sequences
5.1. Prime-indexed primes. The prime-indexed primes are primes whose index in
the list of primes is itself prime. This sequence was first considered in 1975, and since
then mathematicians have improved their knowledge of the behavior of these integers
[2, 4, 12]. Using extensive computations and the best-available explicit bounds on
primes [13, 27], the best bounds on this reciprocal sum have been strengthened [2] to
X1
1.04299 < < 1.04365,
q
q
5.2. Twin primes. When Brun proved that the reciprocal sum of the twin primes
converges, his result was notable for the fact that in order to show this, he gave, for the
first time, an effective upper bound on the density of twin primes (here we can think
of the “density” of a set as its counting function). The prime numbers have asymptotic
density x/ log(x), and heuristically we expect that twin primes have density x/ log2 x.
Although Brun was not able to show this precisely, he was able to show that π2 (x), the
number of twin primes up to x, satisfies
5.3. Amicable numbers. Amicable numbers have been studied at least since the
time of the Pythagoreans. (The reader who has not thought lately about elementary
number theory or ancient Greek mathematics may wish to be reminded that two pos-
itive integers m, n are amicable if s(m) = n and s(n) = m, where s(n) is the sum
of the divisors of n other than n itself.) Amicable numbers remain fairly mysterious
objects. It was only in 1981 that Pomerance showed that the reciprocal sum of the
amicable numbers converges [25]. This sum is known as the Pomerance constant, and
is denoted P. No explicit bound was known for this sum until 2009, when the authors
showed [1] that
Note that the large gap between the lower and upper bounds on P reflects a deep
lack of understanding about the behavior of amicable numbers, particularly for small
values, which may have been difficult to notice without considering reciprocal sums.
All of the insights above about the accuracy of bounding these sums can be effec-
tively captured by the reciprocal sum metric from Definition 2.
X1
L(x) ≤ ≤ U (x)
n∈A
n
n≤x
U (x)
lim .
x→∞ L(x)
To illustrate this definition, we note that the reciprocal sum metric can tell us some-
thing interesting about the history of our knowledge of primes. Using the methods
described in Section 4, it is straightforward to show that if we know constants c1 , c2
such that c1 logx x < π(x) < c2 logx x , then the reciprocal sum metric is c2 /c1 . Since the
Prime Number Theorem was proven in 1896 by de la Valeé Poussin [26] and Hadamard
[16], the Reciprocal Sum Metric for the primes has been precisely 1. Before the proof,
π(x) π(x)
.92129 ≤ lim inf ≤ 1 ≤ lim sup ≤ 1.10555.
x→∞ x/ log x x→∞ x/ log x
< 1850 ∞
1850 1.200
1896 1
8. CONCLUSION. Humans love to break records. Being able to break a record nec-
essarily implies that something relating to the record is measurable. In sport, this is
often easy—running 100 meters faster than anyone else neatly proves that the runner
is the fastest. Even in computational mathematics, records are easy to measure and
break. There is a record for the largest prime yet discovered, a record for the best
bound on the density of abundant numbers [20], and a record for the highest level to
which the Goldbach Conjecture has been verified [22]. These records seem to be good
for us; their very existence pushes the mathematical community to greater efforts. It is
our hope that the reciprocal sum metric may help motivate somebody to continue the
centuries’ long tradition of exploring the values and applications of reciprocal sums.
conjecture. If Shinichi Mochizuki’s proof of the ABC conjecture [23] is verified, one of the many consequences
will be to render Wieferich primes a bit less mysterious.
REFERENCES
1. J. Bayless, D. Klyve, On the sum of reciprocals of amicable numbers, Integers 11A (2011) Article 5.
2. J. Bayless, D. Klyve, T. Oliviera e Silva, New bounds and computations on prime-indexed primes, to
appear in Integers.
3. R. Brent, Irregularities in the distribution of primes and twin primes, Math. Comp. 29 (1975) 43–56,
available at http://dx.doi.org/10.1090/S0025-5718-1975-0369287-1.
4. K. A. Broughan, A. R. Barnett, On the subsequence of primes having prime subscripts, J. Integer Seq. 12
(2) (2009).
5. V. Brun, La serie 1/5 + 1/7 + 1/11 + 1/13 + 1/17 + 1/19 + 1/29 + 1/31 + 1/41 + 1/43 + 1/59 +
1/61 + . . ., les dénominateurs sont nombres premiers jumeaux est convergente oú finie, Bull. Sci. Math.
43 (1919) 124–128.
6. P. Chebyshev, Mémoire sur les nombres premiers. Académie Impériale des Sciences, 1850.
7. A. Cojocaru, M. R. Murty, An Introduction to Sieve Methods and Their Applications, London Mathemat-
ical Society Texts, Vol. 66. Cambridge University Press, Cambridge, 2006.
8. R. Crandall, C. Pomerance, Prime Numbers: A Computational Perspective. second edition. Springer,
New York, 2005.
9. J. Derbyshire, Prime Obsession: Bernhard Riemann and the Greatest Unsolved Problem in Mathematics,
Plume, New York, 2004.
10. P. G. L. Dirichlet, Über eine neue Anwendung bestimmter Integrale auf die Summation endlicher oder
unendlicher Reihen. Abh. Königl. Pr. Wiss. Berlin (1835) 391.
11. F. Dorais, D. Klyve, A Wieferich Prime Search up to 6.7 × 1015 , J. Integer Seq. 14 (2011) Article 11.9.2.
12. R. E. Dressler, S. T. Parker, Primes with a prime subscript, J. Assoc. Comput. Mach. 22 (1975) 380–381,
available at http://dx.doi.org/10.1145/321892.321900.
13. P. Dusart, Autour de la fonction qui compte le nombre de nombres premiers. Ph.D. thesis, Université de
Limoges, Limoges, France, 1998.
14. L. Euler, Variae observationes circa series infinitas. (E72) Commentarii academiae scientiarum Petropoli-
tanae 9, 1744, 160–188. Republished in Opera Omnia: Series 1, Vol. 14, 217–244. A scan of the orig-
inal paper and an English translation by P. Viader Sr., L. Bibiloni, and P. Viader Jr. are available at
www.eulerarchive.org.
15. , Tractatus de numerorum doctrina capita sedecim quae supersunt, Commentationes arithmeticae
2, 1849, 503–575. Republished in Opera Omnia: Series I, Vol. 5, 182–283. Also available online at
eulerarchive.maa.org.
16. J. Hadamard, Sur la distribution des zéros de la fonction ζ (s) et ses conséquences arithmétiques, Bull.
Soc. Math. France 24 (1896) 199–220.
17. J. Havil, The Harmonic Series, in Gamma: Exploring Euler’s Constant. Princeton University Press,
Princeton, NJ, 2003, 21–25.
18. B. Hornfeck, Zur Dichte der Menge der vollkommenen Zahlen, Arch. Math. (Basel) 6 (1955) 442–443,
available at http://dx.doi.org/10.1007/BF01901120.
19. D. Klyve, Explicit Bounds on Twin Primes and Brun’s Constant. Ph.D. Thesis, Dartmouth College, 2007.
20. M. Kobayashi, On the Density of Abundant Numbers. Ph.D. Thesis, Dartmouth College, 2010.
21. P. Ochem, M. Rao, Odd perfect numbers are greater than 101500 , Math. Comp., 81 (2012) 1869–1877,
available at http://dx.doi.org/10.1090/S0025-5718-2012-02563-4.
22. T. Oliveira e Silva, S. Herzog, S. Pardi, Empirical Verification of the Even Goldbach Conjecture up to
4 · 1018 , to appear in Math Comp.
23. S. Mochizuki. Inter-universal teichmuller theory I-IV, Available on the author’s webpage, http://www.
kurims.kyoto-u.ac.jp/~motizuki/top-english.html.
24. P. Pollack, Not Always Buried Deep, American Mathematical Society, Providence, RI, 2009.
25. C. Pomerance, On the distribution of amicable numbers. II, J. Reine Angew. Math. 325 (1981) 183–188.
26. C. J. de la Vallée Poussin, Recherches analytiques sur la théorie des nombres (3 parts). Ann. Soc. Sci.
Bruxelles 20, Part II (1896) 183–256, 281–397.
27. J. B. Rosser, L. Schoenfeld, Sharper bounds for the Chebyshev functions θ(x) and ψ(x), Math. Comp.
29 (1975) 243–269.
JONATHAN BAYLESS is an assistant professor and mathematics coordinator at Husson University in Ban-
gor, Maine. In addition to number theory, his interests include philosophy and inquiry-based learning. He
enjoys logic puzzles, playing basketball, and spending time with his family.
Husson University, 1 College Circle, Bangor, ME 04401
baylessj@husson.edu
DOMINIC KLYVE is an associate professor at Central Washington University, where he also serves as the
Director of the Math Honors Program. He works in number theory and the history of mathematics, and is
always pleased to have the chance to combine these fields in a single work. He also enjoys learning about
new (non-mathematical) fields, and looking for opportunities to use mathematics to help him understand them
better.
Central Washington University, 400 E University Way, Ellensburg WA 98926
klyved@cwu.edu
Let pi denote the ith prime number, and suppose the sum converges. Then there is an index k
such that
∞
X 1
< 1.
p
i=k+1 i
Take A to be the set of positive integers whose prime factors are all ≤ pk , and let B be the
positive integers whose prime factors are all ≥ pk+1 . Note that 1 is vacuously a member of
both A and B since it has no prime factors. By the fundamental theorem of arithmetic, every
positive integer can be uniquely expressed as a product ab for some a ∈ A and b ∈ B. We
exploit the convergence of the geometric series to get
X1 ∞ ∞ ∞ ∞
X X 1 X 1 X 1 < ∞.
= ··· n n =
n
··· n
a∈A
a n =0
p 1 · · · pk k
n =0 1
p1
n =0 1 n =0 k
pk
1 k 1 k
Letting Bm denote the members of B with exactly m prime factors (not necessarily distinct),
we also get
∞ X ∞ ∞
!m
X1 X 1 X X 1
= ≤ < ∞.
b∈B
b m=0 b∈B
b m=0 i=k+1 pi
m
http://dx.doi.org/10.4169/amer.math.monthly.120.09.831
MSC: Primary 11A41
Over the centuries, this function and its various generalizations have turned up in many
central problems in number theory.
One place where the function has appeared is in countingPthe number Pof lattice
points below a hyperbola. It is a simple matter to show that x y≤c 1 = n≤c d(n),
where the former sum can be interpreted as the number of integer lattice points (x, y)
lying on or below the hyperbola x y = c, which are in the open first quadrant region
x > 0 and y > 0.
In 1849, Dirichlet [1] proved that
X
d(n) = x log x + (2γ − 1)x + q(x), (1.1)
n≤x
where | q(x) |≤ 4x 1/2 and γ is the Euler–Mascheroni constant. Since then, there have
been many papers devoted to exploring q(x) (we refer the reader to [4]–[10]).
Another noteworthy result is the paper written by J. B. Friedlander and H. Iwaniec
[3], in which asymptotic formulas for the sum of the divisor function applied to arith-
metic progressions are established.
InP this brief note, we establish some unanticipated inequalities among sums of the
n
type i=1 d(kq + a), using elementary q-series identities. The proofs reduce to show-
ing that the coefficients of certain infinite series are nonnegative. Such inequalities
seem to have no precedents in the mathematical literature.
Our main results are given in the following theorem.
http://dx.doi.org/10.4169/amer.math.monthly.120.09.832
MSC: Primary 11A25, Secondary 26D15
n
X n
X
d(24i + 1) ≥ d(6i + 1), (1.2)
i=1 i=1
n
X n
X
d(24i + 17) ≥ d(6i + 5), (1.3)
i=1 i=1
n
X n
X
d(3(8i + 3)) ≥ d(3(2i + 1)), (1.4)
i=1 i=1
n
X n
X n
X
d(8i + 1) ≥ d(4i + 1) ≥ d(2i + 1), (1.5)
i=1 i=1 i=1
n
X n
X
d(8i + 5) ≥ d(4i + 3), (1.6)
i=1 i=1
and
n
X n
X
d(72i + 1) ≥ d(18i + 1). (1.7)
i=1 i=1
∞ ∞ ∞
2 1 + qn X qn
qn d(n)q n .
X X
= = (2.1)
n=1
1 − qn n=1
1 − q n
n=1
Lemma 2.1. Let n be a positive integer and q be a complex number such that |q| < 1.
It follows that
∞ ∞
1 + q 2n+1
d(2n + 1)q n = q 2n(n+1)
X X
, (2.2)
n=0 n=0
1 − q 2n+1
∞ ∞
n 1 + q 2n+1
q n(n+1)
X X
d(4n + 1)q = , (2.3)
n=0 n=0
1 − q 2n+1
and
∞ ∞
X
n
X n(n+1) 1 + q 2n+1
d(8n + 1)q = q 2 . (2.4)
n=0 n=0
1 − q 2n+1
Next, we divide both sides of (2.7) by q, and replace q 2 by q to obtain (2.2). The iden-
tity (2.2) is obtained by first replacing q by −q in (2.2) and then adding the resulting
identity to (2.2). The final result is
∞ ∞ ∞
1 + q 2n+1
d(2n + 1)q n + (−1)n d(2n + 1)q n = q 2n(n+1)
X X X
from which (2.2) follows. The proof of (2.3) is identical to that of (2.2), and so we
leave this to the reader.
Since
∞
1
q 3n =
X
,
n=0
1 − q3
we find that
∞
! ∞ ! ∞
3n n 1−q 3n(n+1)/2 1+q 2n+1
q n(n+1)/2
X X X
q (d(8n +1) − d(2n +1))q = .
n=0 n=1 n=1
1−q 3 1−q 2n+1
(3.2)
and
m
X
(d(8(3i + 2) + 1) − d(2(3i + 2) + 1)),
i=1
on the right-hand side of (3.2) have Taylor series expansions with nonnegative coeffi-
cients. We complete the proofs of (1.2), (1.3), and (1.4) using these observations.
Following the above method, we subtract (2.2) from (2.3) to deduce that
∞ ∞
n 1 + q 2n+1
q n(n+1) (1 − q n(n+1) )
X X
(d(4n + 1) − d(2n + 1)) q = . (3.3)
n=1 n=1
1 − q 2n+1
n=0
1 − q2
(3.4)
and
m
X
(d(4(2i + 1) + 1) − d(2(2i + 1) + 1)) ≥ 0
i=1
1
now follow. Another possibility is to multiply both sides of (3.3) by 1−q
to obtain the
right side of (1.5).
m=1
1 − q 3(2m+1)
∞ 1 + q 6m+5
3(3m+2)(m+1) 9(3m+2)(m+1)
X
+ q 2 1−q 2 . (4.1)
m=1
1 − q 6m+5
In the second series on the right side, all the exponents of q are of the form 3n + 1.
Therefore, only the first and second sum on the right-hand side of (4.1) will contribute
to the coefficient of q 9k .
Thus, if
∞
1
q 9i =
X
i=0
1 − q9
m=1 i=0
∞ 9m(3m+1)
X 3m(3m+1) 1−q 2 1 + q 6m+1
= q 2 · ·
m=1
1 − q9 1 − q 6m+1
∞ 3(3m+2)(3m+1)
X 9m(m+1) 1−q 2 1 + q 6m+3
+ q 2+1 · ·
m=1
1 − q9 1 − q 6m+3
9(3m+2)(m+1)
∞
!
X 3(3m+2)(m+1) 1−q 2 1 + q 6m+5
+ q 2 · · . (4.2)
m=1
1−q 9 1 − q 6m+5
is found.
In the second series on the right side, the Taylor series expansion of
3(3m+2)(3m+1)
1−q 2
1 − q9
does not consist solely of positive coefficients. However, the coefficient of q 9m in (4.2)
is always positive or zero. Therefore,
m
X
(d(72n + 1) − d(18n + 1)) ≥ 0
n=1
now follows.
REFERENCES
1. P. G. L. Dirichlet, Über die Bestimmung der mittleren Werthe in der Zahlentheorie, Abhandlungen der
Königlich Preussischen Akademie der Wissenchaften 2 (1849) 69–83.
2. N. J. Fine, Basic Hypergeometric Series and Applications. Mathematical Surveys and Monographs, 27,
American Mathematical Society, Providence, RI, 1988.
3. J. B. Friedlander, H. Iwaniec, The Divisor Problem for Arithmetic Progressions, Acta Arithmetica 45 (3)
(1985) 273–277.
4. M. N. Huxley, Exponential Sums and Lattice Point, Proc. London Math. Soc. 60 (1990) 471–502.
5. , Exponential Sums and Lattice Points II, Proc. London Math. Soc. 66 (1993) 279–301.
6. , Exponential Sums and Lattice Points III, Proc. London Math. Soc. 87 (2003) 591–609, available
at http://dx.doi.org/10.1112/S0024611503014485.
7. G. A. Kolesnik, An Improvement of the Remainder Term in the Divisor Problem, Mat. Zametki 6 (1969)
545–554.
8. J.G van der Corput, Zum Teilerproblem, Math. Ann. 98 (1928) 697–716, available at http://dx.doi.
org/10.1007/BF01451619.
9. I. M. Vinogradov, Anzahl der Gitterpunkte in der Kugel, Traveaux Inst. Phys. Math. Stekloff 9 (1935)
17–38.
10. G. Voronoı̈, Sur un probleme du calcul des fonctions asymptotiques, Journal für die reine und ange-
wandte Mathematik 126 (1903) 241–282.
lim ein j α
j→∞
does not exist for almost all α ∈ R. The result is then applied to provide an alternative proof
that L 1 (R) is not weakly sequentially compact.
1. INTRODUCTION. It is well known that the sequence {einα }, for α a real number,
is divergent when α is an irrational multiple of 2π. More generally, consider the se-
quences {ein j α } with {n j } a subsequence of natural numbers. It is the purpose of this
note to explore the divergence properties of such sequences.
As a first example, consider the sequence {ei(an+b)α } with a and b positive integers.
For α an irrational multiple of 2π , this sequence is divergent. To see this, assume that
lim ei(an+b)α = L
n→∞
http://dx.doi.org/10.4169/amer.math.monthly.120.09.837
MSC: Primary 30A99, Secondary 28A99
Theorem. If {n j } is any subsequence of natural numbers, then the lim j→∞ ein j α fails
to exist for almost all α ∈ R.
for g integrable on [0, 2π ] (see [4, p. 2]). Applying it with g = χ E (α), the character-
istic function of the set E, we find that
Z Z 2π
inα
lim e dα = lim einα χ E (α) dα = 0;
n→∞ E n→∞ 0
as well. Consequently,
Z
f (α) dα = 0.
E
holds for any such subset. From this it follows that the subsets of E on which
Re f (α) and Im f (α) are positive or negative have measure zero, and therefore
we conclude that f (α) = 0 almost everywhere on E. But this is impossible, since
| f (α)| = lim j→∞ |ein j α | = 1 on E; hence lim j→∞ ein j α does not exist on any set of
positive measure in [0, 2π).
A similar result holds for the sequences {sin(n j α)} and {cos(n j α)}. We sketch
the proof for the sine sequence. Assuming that lim j→∞ sin(n j α) exists on a set E ⊂
[0, 2π ) of positive measure, the same reasoning as used above shows that this is only
possible if lim j→∞ sin(n j α) = 0 almost everywhere on E. But then
and
lim ei2n j α = 1
j→∞
by choosing an α for which lim j→∞ ein j α does not exist, provided that the last integral
on the right, the Fourier integral of f , never vanishes for α ∈ R. The latter can be
arranged for by (for example) taking f (x) = e−x for x ≥ 0, and f (x) = 0 for x < 0,
in which case
1
Z
e−iαx f (x) d x = 6= 0
R 1 + iα
for α ∈ R.
This takes care of the complex case. The real case is taken care of by observing that
at least one of
Z Z
cos(−αx) f (x + n j ) d x or sin(−αx) f (x + n j ) d x
R R
must diverge as j → ∞.
Another direct proof of the failure of weak sequential compactness for L 1 (R) can
be found in [1, p. 173]. An indirect proof of this failure can be based on the equivalence
of weak sequential compactness with reflexivity (see [2, p. 119 and p. 251] as well as
[5, p. 141]). Since L 1 (R) is not reflexive [3, p. 23], it follows immediately that it is not
weakly sequentially compact.
ACKNOWLEDGMENTS. I wish to thank the reviewers for suggestions that greatly improved the exposition.
REFERENCES
1. P. M. Fitzpatrick, H. L. Royden, Real Analysis, fourth edition, Prentice Hall, Boston, 2010.
2. R. E. Megginson, An Introduction to Banach Space Theory, Springer-Verlag, New York, 1998.
3. R. Shakarchi, E. M. Stein, Functional Analysis, Princeton University Press, Princeton, NJ, 2011.
4. E. M. Stein, G. Weiss, Fourier Analysis on Euclidean Spaces, Princeton University Press, Princeton, NJ,
1971.
5. K. Yosida, Functional Analysis, sixth edition, Springer-Verlag, Berlin, 1980.
Editor’s Note: Dr. Charles S. Kahane and his wife, Claire, were killed in an automobile accident in
May of this year. Charles received his Ph.D. from New York University in 1962, under the direction
of Professor Louis Nirenberg. Before coming to Vanderbilt University in 1969, he was an Assistant
Professor at the University of Minnesota. He retired from Vanderbilt in 1998 as Professor of Math-
ematics. His seminal work was on nonlinear parabolic partial differential equations. We extend our
deepest condolences to Dr. and Mrs. Kahane’s family.
1. INTRODUCTION. In [7, p. 82], just after his now famous triangle theorem,
Routh writes:
The author has not met with these expressions for the area of two triangles which
often occur. He has therefore placed them here in order that the argument in the
text may be more easily understood.
In this note, we revisit “these expressions” of Routh and show that they can be thought
of as special cases of only one expression. To our surprise, this unifying expression
relating the areas of two triangles seems to be missing from the literature. Thus, para-
phrasing Routh, we deemed it useful to place it here. We hope that the reader will find
our generalization of Routh’s triangle theorem not only natural but also intrinsically
beautiful in its symmetry.
Let ABC be a triangle determined by three non-collinear points A, B, C in the Eu-
clidean plane. A line joining a vertex to a point on the line containing the opposite side
is called a cevian. Given a point D on the line BC, the line AD is a cevian through
the vertex A. If D 6 = C, then this cevian is uniquely determined by x ∈ R \ {−1} such
−→ −→ −
→ −→
that BD = x DC. Conversely, for x ∈ R \ {−1}, the equality BD = x DC determines
uniquely a point D 6 = C on the line BC. In this note, for such a point D, we write
D = A x . It follows directly from the definition that A0 = B. We adopt the notation
A∞ = C. The line through A, which is parallel to BC, will also be considered as a
cevian through the vertex A. We denote it by AA−1 , thinking of A−1 as the “point at
infinity” on the line BC. In this way, we establish a bijection between the points of the
line BC and the set R ∪ {∞}. This bijection associates positive numbers to the points
between B and C; it associates numbers in (−1, 0) to the points between the point at
infinity and B; and it associates numbers less than −1 to the points between the point
at infinity and C. We use an analogous notation for the cevians through vertices B
−−→ −−→
and C. We will denote by B y the unique point on the line CA such that CB y = y B y A,
−→ −−→
and by C z the unique point on the line AB such that ACz = z C z B. By definition,
B0 = C, B∞ = A, C0 = A, and C∞ = B.
Bv Ax
P
By
Q
Au
R
A Cw Cz B
AB, BB0 = BC, CC0 = CA, and (3) yields P = A x , Q = B y , R = C z . In this case, the
triangle PQR is the cevial triangle A x B y C z ; see Figure 3.
C C
Bv
Au Ax
Ax P
R
By By Q
Bv
P
Q
R Au
A B A B
Cz Cw Cw Cz
Theorem. With the points P, Q, R defined in (3), the ratio between the area 11 of the
triangle PQR and the area 1 of the triangle ABC is given by
The natural domain of (5) is the set of all (x, y, z, u, v, w) ∈ R6 that satisfy (4).
However, it is again an exercise in multivariable limits, this time with six variables,
to check that formula (5) extends by continuity to all (x, y, z, u, v, w) ∈ (R ∪ {∞})6
that satisfy (4). Beautifully, with u = x, v = y, w = z, (5) simplifies to (2), and with
u = v = w = 0, it simplifies to (1). Also, as x, y, z → ∞, formula (5) becomes (1),
with x in (1) substituted by u, y by v, and z by w.
It is quite possible that any of the many proofs of Routh’s theorem (see, for ex-
ample, [2, Section 13.7], [4], [6], to mention a few) can be modified to prove this
generalization. We will prove it by using two standard undergraduate tools: linear
algebra and analytic geometry. First, we observe that the ratio of the areas remains
z zu
R= , . (6)
1 + z + zu 1 + z + zu
Let 2 stand for the left-hand side of (4). Since we assume that each pair of cevians
(AAx , BBv ), (BB y , CCw ) and (CCz , AAu ) intersects at exactly one point, we have
2 6= 0. As the area of the triangle ABC is 1/2, the ratio in the theorem is given by the
determinant
1 yw z
1 yw z
1+x+xv 1+y+yw 1+z+zu
x 1 zu
1
x 1 zu
=
1+x+xv 1+y+yw 1+z+zu 2
1 1 1 1 + x + xv 1 + y + yw 1 + z + zu
1 yw z 1 yw z 1 yw z
1 1 1
= x 1 zu + x 1 zu + x 1 zu
2 2 2
1 1 1 x y z xv yw zu
1
= (1 + ywzu + zx − z − zu − x yw)
2
1
+ (z + ywzux + zx y − zx − zuy − zx yw)
2
1
+ (zu + ywzuxv + zx yw − zxv − zuyw − zux yw)
2
1
= (1 − x yw − xvz − uyz + x yz + uvwx yz).
2
This proves (5). As a consequence of our theorem, we also obtain the following
unification of the theorems of Ceva and Menelaus.
Corollary. The points P, Q, and R defined in (3) are collinear if and only if
1 − x yw − xvz − uyz + x yz + x yzuvw = 0.
C C C
Au
Ax Au
R R
Bu P Au Bx Ax Bx Ax
R
Bx Bu P Q
Bu
Q P Q
A B A B A B
Cu Cx Cx Cu Cx Cu
C C
Bx Au Ax
Bu
P
Q
Au
R
Bu P Bx R
Ax Q
A B A B
Cx Cu Cu Cx
REFERENCES
Abstract. We prove a maximum principle for high-order derivatives under initial conditions.
Definition 1.1. The maximum principle holds for f (x) on [a, b] if f (x) ≤ max{ f (a),
f (b)} on [a, b].
It is well known that f 0 (x) ≥ 0 implies the maximum principle because of nonde-
creasingness, and that f 00 (x) ≥ 0 implies the maximum principle because of noncon-
cavity. It is also well known that f (n) (x) ≥ 0 where n ≥ 3 does not necessarily imply
the maximum principle. For example, consider f (x) = ±x 2 and its third derivative.
We present conditions under which f (n) (x) ≥ 0 where n ≥ 3 does imply the maxi-
mum principle. Let I = [a, b] be a closed interval of the real line, and let C n (I ) be the
http://dx.doi.org/10.4169/amer.math.monthly.120.09.846
MSC: Primary 26A06
Theorem 1.1. Let f (x) ∈ C n ([a, b]) for some n ≥ 2. If f (n) (x) ≥ 0 on [a, b], then
f (x) ≤ vn (x) on [a, b], where
The following theorem is a corollary of Theorem 1.1. It shows that the maximum
principle holds under initial conditions.
40
30
20
10
−3 −2 −1 1 2 3
It is false that f 0 (x), f 00 (x) ≥ 0 on [0, 3], but it is true that f 000 (x) ≥ 0 on [0, 3].
Also, f (3) = 40 ≥ P1 (3) = 4. Therefore, by Theorem 1.2, the maximum principle
holds for f (x) on [0, 3].
Proof of Theorem 1.1. Suppose that f (x) ∈ C n ([a, b]) for some n ≥ 2 and f (n) (x) ≥
0 on [a, b]. By Taylor’s theorem,
1
f (x) − Pn−2 (x) 1
Z
g(x) := = (1 − t)n−2 f (n−1) (a + t (x − a)) dt.
(x − a) n−1 (n − 2)! 0
Since f (n) (x) ≥ 0 on [a, b], g 0 (x) ≥ 0 on [a, b], so g(x) ≤ g(b) on [a, b], which
implies
on [a, b].
f (b) − P1 (b)
f (x) ≤ v3 (x) = P1 (x) + (x − a)2
(b − a)2
on [a, b]. Because P100 (x) = 0 and f (b) − P1 (b) ≥ 0, we have v300 (x) ≥ 0, which, by
convexity, implies that v3 (x) ≤ max{v(a), v(b)} on [a, b]. It is obvious that v3 (a) =
f (a) and v3 (b) = f (b), so f (x) ≤ v3 (x) ≤ max{ f (a), f (b)} on [a, b]. Therefore, the
theorem holds in the initial case n = 3.
Now, we assume that the theorem holds in the case n = k where k ≥ 3, and we
want to show that the theorem holds in the case n = k + 1. To do this, suppose that
for some f (x) ∈ C k+1 ([a, b]), we have f (k+1) (x) ≥ 0 on [a, b] and f (b) ≥ Pi (b) for
i = 1, . . . , k − 1. By Theorem 1.1, we have
j j (i)
f (i) (a) vk+1 (a)
(b − a)i = (b − a)i
X X
vk+1 (b) = f (b) ≥ P j (b) =
i=0
i! i=0
i!
on [a, b]. Therefore, f (x) ≤ vk+1 (x) ≤ max{ f (a), f (b)} on [a, b]. This concludes
the mathematical induction, and the proof is complete.
Abstract. We survey results on equiangular n-vertex polygons with edge lengths in arithmetic
progression. Such a polygon exists if and only if n has at least two distinct prime factors.
We tell a tale of mathematics, old and aged, and rooted deeply in the last century.
Indeed, let us go back, all the way to the year 1983, when the proud nation of Sweden
proposed the following problem for the 24th International Mathematical Olympiad in
Paris; see page 167 in the problem collection [3].
Let n be a positive integer having at least two different prime factors. Show that
there exists a permutation a1 , a2 , . . . , an of the integers 1, 2, . . . , n such that
n
X 2πak
k · cos = 0.
k=1
n
Alas, the Swedish proposal was only one out of 25 strong candidates on the problem
short-list, and the international jury decided not to select it for the competition. The
problem fell into oblivion until 1986, when Murray Klamkin published his collection
[4] of IMO problems together with 40 supplementary problems. The first one of the
40 supplementary problems was the Swedish proposal from 1983.
Klamkin’s solution [4, p. 61]) writes n = pq as the product of two relative prime
integers p, q > 1, and considers the unit vectors v k = e2kπi/n for k = 1, . . . , n. If o
denotes the zero vector, then Klamkin observes that
vr + vr + p + vr +2 p + · · · + vr +(q−1) p = o (1)
holds for r = 1, 2, . . . , p, as the vectors in each of these sums form a (closed!) regular
q-gon. Analogously, he observes that
This does not only settle the Swedish problem, but also yields the following strength-
ening.
http://dx.doi.org/10.4169/amer.math.monthly.120.09.849
MSC: Primary 12D10, Secondary 52B12
The plot thickened again in fall 2005, after two decades of endless silence, when
Brendan McKay posted the following problem to the SEQFAN mailing list.
Suppose you have n objects with weights 1, 2, 3, . . . , n. How many ways are
there to place these objects evenly spaced around the circumference of a disk so
that the disk will exactly balance on the center point?
In algebraic terms, how many permutations π ∈ Sn are there such that the
polynomial π(1) + π(2)x + · · · + π(n)x n−1 has e2πi/n as a zero?
McKay [5] said that his question was based on a recent puzzle in the New Scientist,
and he also wrote that he didn’t even know which values n have a solution. Within a
couple of days, the positive result of Murray Klamkin was rediscovered, and William
Edwin Clark from the University of South Florida showed that all remaining cases are
unsolvable.
Clark’s argument [1] is as elegant as it is simple: Let n = p k be a prime power. If
a polynomial P(x) = π(1) + π(2)x + · · · + π(n)x n−1 has the nth primitive root of
unity e2πi/n as a zero, then it is divisible by the nth cyclotomic polynomial 8n (x) =
P p−1 i pk−1
i=0 x . In other words. P(x) = 8n (x)Q(x) for some polynomial Q ∈ Z[x] of
degree p k−1 − 1. But then the coefficients of P will just be the p k−1 coefficients of Q
repeated p times, and can never form a permutation of the integers 1, . . . , p k .
Theorem 2. [1] If n is a prime power, then there is no equiangular n-gon whose edge
lengths form a permutation of 1, . . . , n.
Although Theorems 1 and 2 provide the full picture on such equiangular n-gons,
there are three further chapters to this fascinating tale. The first chapter added se-
quence A118887 to the On-Line Encyclopedia of Integer Sequences [6], the sequence
described in McKay’s posting. The second chapter was written by Robert Dawson [2],
who rediscovered the above results for even n and for n = 3, 5, 7. The third and last
chapter is the current article, summarizing the history of this problem. And that’s the
end of this story.
REFERENCES
Abstract. It has been observed many times, both in the M ONTHLY and elsewhere, that the
set of all quotients of prime numbers is dense in the positive real numbers. In this short note
we answer the related question: “Is the set of all quotients of Gaussian primes dense in the
complex plane?”
Figure 1. Gaussian primes a + bi satisfying |a|, |b| ≤ 50 and |a|, |b| ≤ 100, respectively
http://dx.doi.org/10.4169/amer.math.monthly.120.09.851
MSC: Primary 11A41, Secondary 11A99
π3 (x) 1
lim = ,
x→∞ x/ log x 2
whence
π3 (xa)
lim [π3 (xb) − π3 (xa)] = lim π3 (xb) 1 −
x→∞ x→∞ π3 (xb)
xa log xb
= lim π3 (xb) 1 −
x→∞ xb log xa
a
= 1− lim π3 (xb)
b x→∞
= ∞,
where b > 0 is an absolute constant [9] (see also [6, Thms. 2,3]).
ρ N K ρ N K
100 50 53 1,000 0 5
500 946 940 5,000 0 100
1,000 3,327 3,346 10,000 369 367
5,000 66,712 66,651 50,000 7,823 7,732
10,000 245,085 245,200 100,000 28,964 28,971
25,000 1,384,746 1,385,602 250,000 167,197 167,099
50,000 5,168,740 5,167,941 500,000 632,781 631,552
2π 2π
(a) π
24
≤ arg z ≤ 47
(b) π
31415
≤ arg z ≤ 31415
Figure 2. The number N of Gaussian primes in the specified sector with |z| < ρ, along with the corresponding
estimate K (rounded to the nearest whole number) provided by (2)
|γ | |γ |
<q< .
R r
Since q is real and positive, it follows that r < | γq | < R and α < arg γq < β so that
γ /q is a quotient of Gaussian primes which belongs to the desired region (1).
ACKNOWLEDGMENTS. We thank the anonymous referees for several helpful suggestions. This work was
partially supported by National Science Foundation Grant DMS-1001614.
REFERENCES
1. J. Bukor, J. T. Tóth, On accumulation points of ratio sets of positive integers, Amer. Math. Monthly 103
(1996) 502–504, available at http://dx.doi.org/10.2307/2974720.
2. J.-M. DeKonick, A. Mercier, 1001 Problems in Classical Number Theory, American Mathematical So-
ciety, Providence, RI, 2007.
3. B. Fine, G. Rosenberger, Number Theory: An Introduction via the Distribution of Primes, Birkhäuser,
Boston, 2007.
4. S. R. Garcia, V. Selhorst-Jones, D. E. Poore, N. Simon, Quotient sets and diophantine equations, Amer.
Math. Monthly 118 (2011) 704–711.
5. G. H. Hardy, E. M. Wright, An Introduction to the Theory of Numbers, sixth edition, Oxford University
Press, Oxford, 2008. Revised by D. R. Heath-Brown and J. H. Silverman, with a foreword by Andrew
Wiles.
6. G. Harman, P. Lewis, Gaussian primes in narrow sectors, Mathematika 48 (2001) 119–135, available at
http://dx.doi.org/10.1112/S0025579300014388.
7. S. Hedman, D. Rose, Light subets of N with dense quotient sets, Amer. Math. Monthly 116 (2009) 635–
641, available at http://dx.doi.org/10.4169/193009709X458618.
8. D. Hobby, D. M. Silberger, Quotients of primes, Amer. Math. Monthly 100 (1993) 50–52, available at
http://dx.doi.org/10.2307/2324814.
9. I. Kubilyus, The distribution of Gaussian primes in sectors and contours, Leningrad. Gos. Univ. Uč. Zap.
Ser. Nauk 137(19) (1950) 40–52.
10. A. Nowicki, Editor’s endnotes, Amer. Math. Monthly 117 (2010) 755–756.
11. P. Pollack, Not Always Buried Deep: A Second Course in Elementary Number Theory, American Math-
ematical Society, Providence, RI, 2009.
12. P. Ribenboim, The Book of Prime Number Records, second edition, Springer-Verlag, New York, 1989.
13. P. Starni, Answers to two questions concerning quotients of primes, Amer. Math. Monthly 102 (1995)
347–349, available at http://dx.doi.org/10.2307/2974957.
PROBLEMS
where the product runs over all primes, taken in increasing order. Evaluate M(2).
SOLUTIONS
A Singular Matrix
11593 [2011, 747]. Proposed by Peter McGrath, Brown University, Providence, RI.
For positive integers k and n, we let T (n, k) be the n × n matrix with (i, j)-entry
((i − 1)n + j)k . Prove that for n > k + 1, det(T (n, k)) = 0.
Solution by Moubinool Omarjee, Paris, France. We prove a more general result. Let
K be a commutative field with characteristic 0. Let Q be a polynomial of degree k
with k > 0, and let a1 , . . . , an be distinct scalars in K . For n > k + 1, we show that
the determinant of the matrix M whose (i, j)-entry is Q(ai + j − 1) is 0.
An Integral Product
11594 [2011, 747]. Proposed by Harm Derksen and Jeffrey Lagarias, University of
Michigan, Ann Arbor, MI. Let
n k−1
Y Y j ,
Gn =
k=1 j=1
k
is a borrow for some j, and p divides G n (see E. Kummer, Jour. für Math. 44, 1852,
115–116). Theorems by Lucas and N.J. Fine also can be used.
Also solved by M. Bataille (France), A. Bostan (France), P. Budney, B. S. Burdick, N. Caro (Brazil), R. Chap-
man (U. K.), W. ChengYuan (Singapore), P. P. Dályay (Hungary), A.-M. Ernvall-Hytönen (Finland), D. Fleis-
chman, J. Freeman, O. Geupel (Germany), M. Goldenberg and M. Kaplan, J.-P. Grivaux (France), D. Hender-
son, E. Hysnelaj & E. Bojaxhiu (Australia & Germany), E. J. Ionascu, Y. J. Ionin, B. Karaivanov, J. C. Kieffer,
O. Kouba (Syria), H. Kwong, J. H. Lindsey II, L. Lipták, O. P. Lossers (Netherlands), R. Martin (Germany),
Á. Plaza (Spain), P. Pongsriiam & T. Pongsriiam (U. S. A. & Thailand), C. R. Pranesachar (India), R. E. Prather,
B. Schmuland (Canada), A. Stenger, R. Stong, R. Tauraso (Italy), M. Tetiva (Romania), D. B. Tyler, Z. Vörös
A Fibonacci Congruence
11602 [2011, 846]. Proposed by Roberto Tauraso, Università di Roma “Tor Vergata,”
Rome, Italy. Let p be a prime. Let Fn denote the nth Fibonacci number. Show that
X Fi
≡0 (mod p).
0<i< j<k< p
i jk
(A rational number is deemed congruent to 0 mod p if, when put in reduced form, the
numerator is a multiple of p.)
Solution by Yury J. Ionin, Champaign, IL. The statement holds vacuously for p = 2
and p = 3, and it can be easily verified for p = 5 (the left side simplifies to 5/12),
so assume that p > 5. Consider the field F p of the residue classes modulo p. The
discriminant of the quadratic equation x 2 − x − 1 = 0 is 5, so the equation has distinct
roots α and β. These roots are in F p if 5 is a quadratic residue modulo p; otherwise,
they are in a quadratic extension of F p . In either case, the Fibonacci numbers are given
by Fn = (α n − β n )/(α − β), and they lie in F p . The congruence to be shown is then
equivalent to the following identity in a field K containing α, β, and all of F p :
p−1 k−1 j−1 i
X X X α − βi
= 0. (1)
k=3 j=2 i=1
i jk
and let T (x) = S(1 − x).P Since α + β = 1, identity (1) holds if S = T . Clearly S(0) =
0, and T (0) = S(1) = 16 1
i jk
, where the summation extends over all ordered triples
(i, j, k) of distinct nonzero elements of F p . Choose an element l ∈ F p such that l 3 6 = 1
and l 6= 0. Since Tl(0) = 16 1
= T (0), we obtain T (0) = 0. Since S and T
P
3 (li)(l j)(lk)
are polynomials of degree at most p − 3, identity (1) holds if S 0 = T 0 . We perform
and
p−1 k−1
X X (1 − x) j−1 − 1
T 0 (x) = −S 0 (1 − x) = .
k=3 j=2
jkx
Let V (x) = x(x − 1)U 0 (x). Since U 0 is a polynomial, V (0) = 0. The degree of V is
at most p − 1, so (1) holds if V 0 (a) = 0 for all a ∈ F p . Since k=1
P p−1 1 P p−1
k
= k=1 k = 0
P p−1 2
in F p , we have k=3 k = −3. Thus
p−1
X x p−1 − x 2 (1 − x) p−1 − (1 − x)2
V 0 (x) = 3 + x k−1 + (1 − x)k−1 = 3 + .
−
k=3
x −1 x
Also solved by O. Geupel (Germany), O. P. Lossers (Netherlands), Ellington Management Problem Solving
Group, and the proposer.
4-cycle. Independence of the three square roots is ruled out, since p only has degree
4 over the rationals. Thus, P has exactly one rational (and hence integral) root, which
we may assume is pr + qs. Henceforth call it m.
(b) We have a 2 + 4(m − b) = ( p − q + r − s)2 ; suppose that the value is the
nonzero square k 2 . Note that p + r = (a + k)/2 and q + s = (a − k)/2, so both are
rational, and hence they are preserved by any σ ∈ G. Thus, σ ( p) + σ (r ) = p + r 6 =
q + s, and so {σ ( p), σ (r )} and { p, r } cannot be disjoint. Since these two-element sets
overlap and have the same sums, they must agree. Hence, G preserves both { p, r } and
{q, s}. This again forces G to be a subgroup of Z22 , again a contradiction.
(c) Note that G is a subgroup of the group h( p q r s), ( p r )i of permutations. We
can verify that
2( p − r )(q − s)( p − q + r − s)
is fixed by ( p q r s) and is negated by ( p r ). Therefore, G = h( p q r s)i is cyclic
if and only if X is a square, where X = 4( p − r )2 (q − s)2 ( p − q + r − s)2 . But X =
(4m − 4b + a 2 )(2m + 2b − a 2 )2 = (a 3 − 4ab + 8c)2 , so X is a square. In the special
case where a = 0, we obtain
X/16 = (m − b)(m + b)2 − 4c2 ,
so the characterization is as claimed.
Editorial comment. The Ellington Management Problem Solving Group also changed
the problem statement in the same way, and for the same reason.
n−1
X X Y1
(2n + 1 − 2k) = (n + 1)((n + 2) − H (n + 1)).
k=0 A∈S (n) j∈A
j
k
we have
n
Y n
j +1 X j
g (1) =
0
3+2
j=2
j j=2
j +1
n
n+1 X 1
= 3+2 1−
2 j=2
j +1
3 1 1 1
= (n + 1) + n − 1 − − − ··· −
2 3 4 n+1
= (n + 1)(n + 2 − H (n + 1)),
as desired.
Also solved by T. Amdeberhan & A. Straub, D. Beckwith, N. Caro (Brazil), R. Chapman (U.K.), P. P. Dályay
(Hungary), E. S. Eyeson, O. Geupel (Germany), Y. J. Ionin, B. Karaivanov, O. Kouba (Syria), R. Leroy,
J. H. Lindsey II, O. P. Lossers (Netherlands), G. Martin (Canada), M. A. Prasad (India), R. Pratt, J. Schlosberg,
E. Schmeichel, B. Schmuland (Canada), J. H. Steelman, A. Stenger, R. Stong, R. Tauraso (Italy), M. Tetiva
(Romania), D. B. Tyler, Z. Zhang, GCHQ Problem Solving Group (U.K.), NSA Problems Group, and the
proposer.
Calculus: Modeling and Application, second edition. By David A. Smith and Lawrence
C. Moore. Mathematical Association of America, Washington, D.C., 2010. http://maa.
pinnaclecart.com. Price: $35.00. ISBN 978-1-6144-610-1.
REFERENCES
1. B. Darken, R. Wynegar, S. Kuhn, Evaluating calculus reform: a review and a longitudinal study, CBMS
Issues in Mathematics Education, Vol. 8. Edited by E. Dubinsky, A. Schoenfeld, and J. Kaput. American
Mathematical Society, Providence, RI, 2000, 16–41.
2. D. Smith, L. Moore, The Calculus Reader. D.C. Heath, Lexington, MA, 1994.
The book is intended as the primary text for an introductory course in proving
theorems, as well as for self-study or as a reference. Throughout the text, some pieces
(usually proofs) are left as exercises; Part V gives hints to help students find good
approaches to the exercises. Part I introduces the language of mathematics and the
methods of proof. The mathematical content of Parts II through IV were chosen so as
not to seriously overlap the standard mathematics major. In Part II, students study
sets, functions, equivalence and order relations, and cardinality. Part III concerns
algebra. The goal is to prove that the real numbers form the unique, up to isomor-
phism, ordered field with the least upper bound; in the process, we construct the real
numbers starting with the natural numbers. Students will be prepared for an abstract
linear algebra or modern algebra course. Part IV studies analysis. Continuity and dif-
ferentiation are considered in the context of time scales (nonempty closed subsets of
the real numbers). Students will be prepared for advanced calculus and general topol-
ogy courses. There is a lot of room for instructors to skip and choose topics from among
those that are presented.
www.cambridge.org/mathematics
800.872.7423
@cambUP_maths