Beruflich Dokumente
Kultur Dokumente
Editorii
S. Axler
F.W. Gehring
K.A. Ribet
The Fundamental
Theorem of Algebra
Springer
Benjamin Fine Gerhard Rosenberger
Department of Mathematics Department of Mathematics
Fairfield University University of Dortmund
Fairfield, cr 06430 Dortmund
USA Germany
Editorial Board
S. Axler F.W. Gehring K.A. Ribet
Mathematics Department Mathematics Department Department of
San Francisco State East HalI Mathematics
University University of Michigan University of California
San Francisco, CA 94132 Ann Arbor, MI 48109 at Berkeley
USA USA Berkeley, CA 94720-3840
USA
Mathematics Subject Classification (1991): 11-01, 30-01, 54-01
9 876 543 2
ISBN 978-1-4612-7343-1
1b our families:
These notes grew out of two courses, one given in the United States and
one given in Germany on the Fundamental Theorem of Algebra. The
purpose of these courses was to present a great deal of nonelementary
mathematics, all centered on a single topic. The Fundamental Theorem
ofAlgebra was ideal for this purpose. Analysis, algebra and topology each
have developed different techniques which surround this result. These
techniques lead to different proofs and different views of this impor-
tant result. It is startling how much mathematics can be introduced and
learned in this manner.
In the United States it was presented as a "capstone" course for upper
level undergraduates. Many of the topics were familiar to the students
but many were new. The goal of continually returning to a proof of the
Fundamental Theorem of Algebra gave a focus to a large body of (what is
at first glance) seemingly unrelated material. In addition, many nice ap-
plications, such as the insolvability of the quintic and the transcendence
of e and 7T: could be introduced. We feel that undergraduates in such a
capstone course are an ideal audience for the book. Many departments
in the U.S. are adopting the idea of a summary course. In addition, the
book could serve as a foundation reference for beginning graduate stu-
dents. We also feel that the algebra sections, Chapters 2,3,6, 7, could be
used, with some additions from outside sources, as an alternative version
of an undergraduate algebra course or as a supplement for such a course.
The United States version of the course covered in one semester, with
some omissions, most of the material in Chapters 1 through 7. The whole
book could be covered at a relatively moderate pace in two semesters.
viii Preface
Preface vii
2 Complex Numbers 5
2.1 Fields and the Real Field 5
2.2 The Complex Number Field 10
2.3 Geometrical Representation of Complex Numbers 12
2.4 Polar Form and Euler's Identity 14
2.5 DeMoivre's Theorem for Powers and Roots 17
Exercises 19
Index 205
Introduction
and Historical
CHAPTER Remarks
particular the definitions ofgroups,rings and fields, and has studied some
linear algebra, in particular the abstract definition of a vector space over
a general field. Beyond this we have tried to make these notes as self-
contained as possible. However, our goal is to arrive at an understanding
of the proofs of the Fundamental Theorem. Therefore, along the way we
have proved only those results that can be obtained directly and have left
more difficult results (such as the proofofthe fundamental theorem ofGa-
lois theory) to the references. A note here about terminology. In standard
usage equations have roots, and functions and polynomials have zeros.
Historically however, the word root has been most often connected with
the Fundamental Theorem of Algebra. Therefore we use the term root
throughout these notes, and do not really differentiate between the Zero
of the polynomial F(z) and the root of the polynomial equation F(z) = O.
The first mention ofthe Fundamental Theorem ofAlgebra, in the form
that every polynomial equation of degree n has exactly n roots, was given
by Peter Roth of Nurnberg in 1608. However its conjecture is generally
credited to Girard who also stated the result in 1629. It was then more
clearly stated by Descartes in 1637 who also distinguished between real
and imaginary roots. The first published proof of the Fundamental The-
orem of Algebra was then given by D'Alembert in 1746. However there
were gaps in D'Alembert's proof and the first fully accepted proof was
that given by Gauss in 1797 in his Ph.D. thesis. This was published in
1799. Interestingly enough, in reviewing Gauss's original proof, modern
scholars tend to agree that there are as many holes in this proof as in
D'Alembert's proof. Gauss, however, published three other proofs with no
such holes. He published second and third proofs in 1816 while his final
proof, which was essentially another version of the first, was presented
in 1849.
In the main part of these notes we do not touch on Gauss's orig-
inal proof, which in outline went as follows. Since F(z) is a complex
number for any z E C and since z = x + iy with x, y E lR we have
F(z) = u(x, y) + iv(x, y). The equations u(x, y) = 0 and vex, y) = 0 then
represent curves in the plane lR 2 . Bya careful examination ofthe possible
functions u(x, y), vex, y) for a complex polynomial P(z), Gauss showed that
the curves u(x, y) = 0, vex, y) = 0 must have a common solution (Xo, Yo).
The complex number Zo = Xo + iyo is then a root of p(z). In Appendix A
we present a version of Gauss's original proof.
The Fundamental Theorem of Algebra is actually part of a general
development in the theory of equations. The ability to solve quadratic
equations and in essence the quadratic formula was known to the Baby-
lonians some 3600 years ago. With the discovery of imaginary numbers,
the quadratic formula then says that any degree two polynomial over C
has a root in C. In the sixteenth century the Italian mathematician Niccolo
Thrtaglia discovered a similar formula in terms of radicals to solve cubic
equations. This cubic formula is now known erroneously as Cardano's
4 1. Introduction and Historical Remarks
EXAMPLE 2.1.1
Consider the set
Q(../2) = {a + b../2; a, bE Q}.
On Q(,J2) define addition and multiplication via algebraic manipulation;
that is, if x = a + b,J2, y = e + d,J2 then
x + y = (a + e) + (b + d)../2
and
xy = (ae + 2bd) + (be + ad)../2.
1b verify that Q(,J2) is a field we must show that is satisfies properties
(1) through (9). We leave most of this to the exercises and only show that
nonzero elements have multiplicative inverses.
First of all we can identify Q with {a + o,J2; a E Q} so that Q can
be considered as a subset of Q( ,J2). From the definitions of addition and
multiplication it is clear that 0 E Q and 1 E Q are respectively the additive
and multiplicative identities in Q( ,J2). Now suppose x = a + b,J2 with
x # o. Then a and b are not both zero and since ,J2 is irrational, a 2 - 2b2 #
O. Let x = a - b,J2. Then by a straightforward computation we have
xx = a 2 - 2b 2 .
Now let
x a b,J2
y=
a2 - 2b2 a2 - 2b2 a2 - 2b2 •
If F} is a subset ofa field F that is also a field under the same operations
and identities as F we say that F} is a subfield of F and F is an extension
2.1. Fields and the Real Field 7
Theorem 2.1.1
F l ~ F is a subfield if and only if Fl =f: 0 and F l is closed under addition,
subtraction, multiplication, and division.
EXAMPLE 2.1.2
Show that Q(./2) is a subfield of JR and therefore a field.
In example 2.1.1 and in the exercises we showed directly that Q(./2)
is a field. Here we use Theorem 2.1.1 to accomplish the same thing.
Now, Q C Q(./2), so Q(./2) =f: 0. Therefore, to show that it is a field
we must show that Q(./2) closed under the four arithmetic operations.
Suppose then that x = a + b./2 and y = c + d./2 with y =f: 0 and
a, b, c, d E Q. Then:
x ± y = (a ± c) + (b ± d)h E Q(h)
since a ± CEQ, b ± d E Q
c d.J2 r;:;.
1/Y = c2 _ 2d2 - c2 _ 2d2 E Q( v 2)
c tr1I d
since E'l EQ
c2 - 2d2 ' c2 - 2d2
From the last two facts it follows that x/y is in Q(./2). Therefore Q(./2) is
nonempty and closed under the arithmetic operations so it is a subfield
~~ 0
(5) 1V = V for V E V.
If FI is a subfield of F then multiplication ofelements of F by elements
of F I are still in F. Since F is an abelian group under addition, F can
be considered as a vector space over Fl. Thus any extension field is a
vector space over any of its subfields. If F is an extension field of F I , the
dimension of F, as a vector space over F I , is called the degree of the
extension.
EXAMPLE 2.1.3
(1) Q( viz) has degree 2 over Q since {I, viz} form a basis.
(2) JR is infinite dimensional over Q, so JR is an extension ofinfinite degree
o~rQ. 0
EXAMPLE 2.1.4
Q, as well as any other subfield of JR, is also an ordered field. However,
the integers modulo a prime, Zp, is not. Having a set of positive elements
closed under addition and multiplication and satisfying the trichotomy
law forces a field to be infinite. Therefore, ordered fields must be infinite,
and since Zp is finite it cannot be an ordered field. 0
LUB Property
If 8 C JR is bounded above, then 8 has a least upper bound So (s :::: So
for all s E 8, and if s :::: SI for all s E 8 then So :::: SI).
Corollary 2.1.1
Suppose f:[a,b] ~ JR is continuous and f(a)f(b) < O. Then there exists a
value c with a < c < band fCc) = o.
1b see the corollary notice that f(a)f(b) < 0 implies that f(a) and f(b)
have different signs. Withoutlossofgeneralitysupposef(a) < O,f(b) > O.
Then 0 is between f(a) and f(b), and the intermediate value theorem
applies.
These special properties completely determine JR up to isomorphism.
If Rand S are rings, a function f : R ~ S is a homomorphism if it
satisfies:
(1) f(rI + r2) = f(rI) + f(r2) for rIt r2 E R.
(2) f(rIr2) = f(rI)f(r2) for rI, r2 E R.
If f is also a bijection, then f is an isomorphism, and Rand S are
isomorphic. Isomorphic algebraic structures are essentially algebraically
th~ same. We have the following theorem.
1Mneorem 2.1.3
Let F be a complete Archimedean ordered field. Then F is isomorphic to R
EXAMPLE 2.2.1
Let z = 3 + H, W =7- 2 i then:
(1) z + W = 10 + 2 i.
(2) z - W = -4 + 6 i.
(3) zw = 29 + 22 i. o
It is easy to verify [see the exercises]that under these definitions C
forms a commutative ring with an identity and that JR is a subring of C.
The zero is 0 E JR, identified as 0 + 0 i, and the multiplicative identity is
1 E JR identified with 1 + 0 i. In order for C to be a field we must have
multiplicative inverses. We now show how to construct these.
Definition 2.2.1
If z E C with z = x + i y, then the complex conjugate of z, denoted by
Z, is z = x - iy, and the absolute value, or modulus, of z, denoted by
Jx
Izl, is Izl = 2 + yZ.
EXAMPLE 2.2.2
Let z = 3 + H. Then z = 3 - Hand Izi = ../9 + 16 = 5. o
The following lemmas give the properties ofthe complex conjugate and
the absolute value, and from these we will be able to construct inverses.
Lemma 2.2.1
Let z,w
E C. Then
Lemma 2.2.2
E C. Then
Let z,w
(l)z+w=z+w.
(2) ~ = (Z)(W).
(3) z= z.
(4) Izl = Izl·
(5) z = z if and only if z is real.
Properties (1) through (4) are computations. For (5) supposez = x+iy.
Then z = x - i y, and z = z if and only if y = -Yo But this is possible if
and only if y = 0, and then z = x + 0 i and z is real.
12 2. Complex Numbers
Now consider
zz = (x + iy)(x - iy) = x 2 - i 2 y2 = x2 + y2 = Iz1 2 •
Therefore we have the following result.
Lemma 2.2.3
lfz e C, then zz = Iz1 2 •
Definition 2.2.2
t = 1~2 for any z e C with z ¥= O.
EXAMPLE 2.2.3 o
Let z = 3 + 4 i and w = 7 - 2 i. Then
(1) Z =
1
= 2s
3-4i
32 H2
3 4.
- 2s 1·
(2) 1.. - z.l - (3 + 4 .) 7+2 i _ 13+34 i _ 13 + 34 •
w- w- 1 53 - 53 -53 53 1 .
Theorem 2.2.1
C is a field. lR is a subfield ofC, and the degree ofC as an extension field oflR
is 2. Further, C is not an ordered field.
The degree being 2 comes from the easily observed fact that as a vector
space over lR, {I, il forms a basis. C cannot be an ordered field for the
following reason. In any ordered field F, 12 = 1 so 1 is positive. But then,
from the trichotomy law -1 must be negative and therefore, as explained
in the last section, cannot have a squareroot.
EXAMPLE 2.3.1
(see figure 2.1)
(x,y) -z
(x,-y) - z
Figure 2.1. Geometrical Representation o
We can describe the arithmetic operations in terms ofthis geometrical
interpretation. Since addition and subtraction are done componentwise,
addition and subtraction of complex numbers corresponds to the same
vector operations as pictured in the diagram below.
EXAMPLE 2.3.2
Here we restrict 9 to be in the range [0, 21r') so that each point in ]R2 has
only one set of polar coordinates. The rectangular coordinates (x, y) of a
point P are related to its polar coordinates (r, 9) by the formulas:
(i) x = r cos 9.
(ii) y = r sin 9.
(2.4.1)
(iii) r = Jx 2 + y2.
(iv) 9 = arctan(y/x) chosen in the appropriate quadrant.
If z = x + iy corresponds to the point (x,y) with polar coordinates
(r, 9), then from (2.4.1), z can be written as
z = r(cos9 + isin9) (2.4.2.)
This is called the polar form for z. The angle 9 is the argument of z,
denoted by Arg z, and in this context Izi is called the modulus of z.
EXAMPLE 2.4.1
Suppose z = 1 - i; then Izi = ../2 and Arg z = arctan(-l) = 7rr/4
since (1,-1) is in the fourth quadrant. Therefore, z = ../2(cos(7rr/4) +
i sin(7rr/4)). 0
There is a very nice exponential way to express the polar form due to
Euler. Before we describe this, we must look more closely at the powers
ofi.
Now, i 2 = -1 so i 3 = i 2 i = - i. Then i 4 = i 3 i = - i 2 = 1 and
therefore is = i. From this it follows that the powers of i repeat cycli-
cally as {I, i, -1, - i}, and i m = in if and only if n == m (modulo 4).
For example, i S1 = i 3 = - i . Further, the multiplicative inverse of any
power of i is another power of i, and so these powers form a group under
multiplication.
Lemma 2.4.1
The powers of i form a cyclic group oforder 4 under multiplication.
16 2. Complex Numbers
This result is actually a special case ofa result about primitive nth roots
of unity that we will discuss in the next section.
If t is a variable, recall that the functions et , sin t, cos t have the
following power series expansions.
(1) et = 1 + t + t2/2! + ... + tnln! + ....
(2) sint = t - tJ/3! + t5 /5! - + (-Itt 2n+l/(2n + I)! + .... (2.4.3)
(3) cos t = 1 - t /2! + t /4! -
2 4 + (-Itt 2n l(2n)! + ....
Now consider t = i () with () real, and substitute into (2.4.3) to find ei D.
(Although t is not a real variable, we do this formally.)
eiD = 1 + (iO) + (i0)2/2! + ... = 1 + (iO) - ()2/2! - i()3/3! + ....
using the rules for the powers of i. Then
eiD = (1 - ()2/2! + ()4/4! + ...) + i(() - ()3/3! + ()5/5! + ...)
= cos(O) + i sin(O).
This is known as Euler's identity.
Euler's Identity
eiD = cos (()) + i sin (()) for () E IR.
Lemma 2.4.2
Ifz,w
E C, then Izwl = Izllwl and Arg(zw) = Arg z + Arg w.
Notice that Arg i = rr/2, and multiplication i z rotates z by 900 • That
is, Arg (i z) = rr/2 + Arg z = Arg i + Arg z, which follows directly from
the lemma.
Euler's identity leads directly to what is called Euler's magic formula.
Suppose () = rr. Then
eill" = cos(rr) + i sin(rr) = -1 + 0 i = -1.
Put succintly, eill" +1 = O.
equation. If one thinks about how diversely these five numbers appear -
oas the additive identity, 1 as the multiplicative identity, e as the natural
exponential base, 1r as the ratio of the circumference to the diameter of
any circle, and i as the imaginary unit, this result is truly amazing.
EXAMPLE 2.5.1
Let z = 1 - i and let us find Z1O.
Now, Izi = ../2 and Arg z = 7lr/4, so z = ../2ei 71f/4. Therefore,
Z10 = (../2yoei701f/4. Now, (../2)10 = 32, while 70lr/4 = 16lr + 3lr/2 ==
3lr/2 (as angles). It follows than that ZIO = 32ei31f/2 = 32(cos(3lr/2) +
i sin(3lr/2)) = -32 i . 0
Notice that since Arg(ztl) = n Arg z, the function fez) = ztl winds the
complex number z around the origin. In Chapter 8 we give a proof of the
Fundamental Theorem of Algebra based on this observation.
If n E N, z E C and w tl = z then w is an nth root of z which we
denote by Zl/tl. If r E JR, r > 0 then there is exactly one real nth root
of r. If z = rei 9, then w = rl/tlei9/tl satisfies w tl = z and is one nth
root of the complex number z. However, if 9+~1fk < 2lr for kEN, then
Wk = rl/tlei O+~"k is also an nth root of z and is different from w. There are
precisely n values of k that produce different nth roots of z for example
k = 0, I, ... , n - 1. The n different nth roots of z are then
I/tl 1. O+2"k
Wk = r e n , k = 0, I, ... , n - 1.
We have therefore proved the following theorem, which is known as
DeMoivre's theorem for roots.
EXAMPLE 2.5.2
Let z = 1 + i. Find the sixth roots of z.
Now Izl =../2, Argz = arctan 1 = 1l'/4 so z = ../2ei1r / 4. The sixth roots
ofz are then
(1) WI = 21/12 ei R~4 = 21/12(COS(1l'/24) + i sin(1l'/24)).
(2) W2 = 21/ 12ei R/4t = 21/12(cos(91l'/24) + i sin(91l'/24)).
2R
(6) W6 =
21/12 ei R/4t
IOR
= 21/ 12 (COS(4I1l'/24) + i sine4I1l'/24)). 0
Corollary 2.5.1
There are exactly n distinct nth roots ofunity given by
21l'k )
Wk =e 1• 2Rk
" = cos (
-;;- + i sin ( 21l'k )
-;;- , k = 0,1, ... , n - 1.
Geometrically then, nth roots of unity all fall on a circle of radius 1 and
are located at the vertices of an inscribed regular n-gon with one vertex
on the positive real axis. In the Figure 2.5 we picture the six, sixth roots
of unity forming the vertices of a regular hexagon. For z ::f:. 0 with Izi ::f:. 1
the nth roots are at the vertices of a regular n-gon on a circle of radius
Izl1/n.
-I-w4 I -wI
Let w = ei 21r/n, the nth root of unity with the smallest positive angle.
This is called the principal primitive nth root of unity. The other nth
roots are the distinct powers of w, that is, {w, w 2 , U?, ... , w n = I}. Further
these nth roots of unity form a group under multiplication. (Notice that
w is not the only primitive nth root of unity - see the exercises.)
Corollary 2.5.2
The nth roots of unity form a cyclic group of order n under multiplication.
They are generated by w, the nth root ofunity with the smallest positive angle.
Exercises
2.1. Verify that Q( viz) is a field.
2.2. Solve the equation 3x + 7 = 6 in Z13. (Hint: Find the value of 1/3 in Z13).
2.3. Prove that Zn is a field if and only if n is a prime.
2.4. If d E Q and d is not a perfect square show that Q( v'd) is a subfield of R.
(If d is a perfect square then Q( v'd) = Q.)
2.5. Letu? = 1, w =I- 1.Showthenthatl+w+ul = O. (Hint: Factoru?-1 = 0.)
2.6. Letu? = 1, w =I- 1, so that from Exercise 2.5, 1 +w+w2 = 0, orul = -1-
w. Let Q(w) = (a + bw; a, b E Q}. Define arithmetic on Q(w) by algebraic
manipulation. Show that Q(w) is a subfield oflC. (Hint: Use ul = -1 - w.)
What is the degree ofQ(w) over Q?
2.7. This exercise is to show that there are infinitely many transcendentals.
(i) For n E N let P n =[all rational polynomials of degree:::: n ]. Show that
P n is countable. (Hint: P n can be considered as all (n + I)-tuples with
entries from Q. Recall that if A, B are countable then the Cartesian
product A x B is also countable.)
(ii) For n E N let Qn =[all roots for polynomials in P n ]. Show that Qn
is countable. (Hint:Use the facts that a polynomial of degree n has at
most n roots and countable unions of countable sets are countable.)
(iii) Let A be the set ofalgebraic numbers over Q. Show that A is countable.
(Hint: A = U~ Qn.)
(iv) Let T be the set of transcendental numbers over Q. Show that T is
uncountable. (Hint: lR = AuT. Suppose T were countable.)
2.8. Prove that the LUB property for lR is equivalent to the nested intervals
property.
2.9. Let z = 4 + 7 i, w = 6 - i . Find
20 2. Complex Numbers
(1) z, w,lzl,lwl.
(ii) z + w, zw, z/w.
(iii) .;z (2 solutions).
(iv) z in polar form.
(v) zs.
(vi) Solve the equation for Z: zZ + 6w =I - i..
2.10. Prove Lemma 2.2.1.
2.11. Prove Lemma 2.2.2.
2.12. In Exercise 2.9 multiply zw = 4w+ i 7w and trace out the geometrical steps
onw.
2.13. In Exercise 2.9 multiply z and w by putting them in polar form. Estimate
Arg z and Arg w by calculator.
2.14. Letz = 1 + J3i. Find:
(i) Z26
(ii) The five distinct fifth roots of z
(iii) Plot the answers to (ii) geometrically.
2.15. Let I, W, w2 , ... , w"-l form a cyclic group G of order n. Show that tJr also
generates G if and only if (k, n) = 1. (Hint: If (k, n) = 1 that is k and n are
relatively prime then 1 can be written as a linear combination ofk and n.)
2.16. Let w = ,r1r/8 be the principal primitive eighth root of unity. What powers
of w also generate the group of eighth roots?
Polynomials
and Complex
CHAPTER Polynomials
We will see that F[x) becomes a ring with much the same properties as
the integers Z. We first define addition, subtraction, and multiplication
on F[x) by algebraic manipulation. That is, suppose p(x) = ao + alx +
... + anxn, Q(x) = bo + blx + ... + bmxm then
p(x) ± Q(x) = (ao ± bo) + (al ± bl)x + ... (3.4)
that is, the coefficient of Xi in P(x) ± Q(x) is ai ± bi.
p(x)Q(x) = (aobo) + (alb o + aObl)x + (a Ob2 + alb l + a2bO)x2+
... + (anbm)x n+m (3.5)
that is, the coefficient of Xi in P(x)Q(x) is (aOb i + albi- I + ... + aibO)'
EXAMPLE 3.1.1
Let P(x) = 3x2 + 4x - 6 and Q(x) = 2x + 7 be in Q[x). Then
p(x) + Q(x) = 3x2 + 6x + 1
and
p(x)Q(x) = (3x 2 + 4x - 6)(2x + 7) = 6x3 + 29x2 + 16x - 42.
o
From the definitions the following degree relationships are clear. The
proofs are in the exercises.
Lemma 3.1.1
Let P(x) i- O,Q(x) i- 0 E F[x], Then:
(1) deg P(x)Q(x) = deg P(x) + deg Q(x).
(2) deg (P(x) ± Q(x)) ~ Max(deg P(x), deg Q(x)).
Lemma 3.1.2
If R is a commutative ring with an identity and r E R is a unit, then r is not
a zero divisor. In particular, if R is a field, then R is an integral domain.
Proof
Suppose r E R is a unit and rs = O. Since r is unit there exists r- I such
that r-Ir = 1. Then r-I(rs) = r-IO = 0 while r-I(rs) = (r-lr)s = Is = s.
Hence s = 0, and r is not a zero divisor.
3.1. The Ring of Polynomials over a Field 23
11teorenn 3.1.1
If F is a field, then F[x] forms a commutative ring with identity (actually an
integral domain). F can be naturally embedded into F[x] by identifying each
element of F with the corresponding constant polynomial. The only units in
F[x] are the nonzero elements of F. Fn[x] forms a vector space of dimension
n + lover F.
Proof
1b verify the basic ring properties is solely computation and is left to the
exercises. Since deg p(x)Q(x) = deg P(x) + deg Q(x), it follows that if
neither p(x) 1= 0 nor Q(x) 1= 0 then p(x)Q(x) 1= 0 and therefore F[x] is an
integral domain.
IfG(x) is a unit inF[x], then there exists anH(x) E F[x] with G(x)H(x) =
1. From the degrees we have deg G(x) + deg H(x) = 0 and since deg
G(x) ::: O,deg H(x) ::: O. This is possible only if deg G(x) = deg H(x) = O.
Therefore G(x) E F.
Finally, from Lemma 3.1.1, Fn[x] is closed under addition and subtrac-
tion and thus is an abelian group. Multiplication by elements ofF doesn't
raise the degree, so Fn[x] admits scalar multiplication from F and there-
fore forms a vector space over F. The set {I, x, x2 , ••. , xn} constitutes a
basis, so dim(Fn[x]) = n + 1. •
Definition 3.2.1
If rex), g(x) E F[x] with g(x) # 0 then g(x) divides rex), or g(x) is a factor
of f(x), if there exists a polynomial q(x) E F[x] such that f(x) = q(x)g(x).
We denote this by g(x)lrex).
If 0 # rex) has no nontrivial, nonunit factors (it cannot be factorized
into polynomials of lower degree); then rex) is an irreducible poly-
nomial, or prime polynomial. Clearly, if deg g(x) = 1 then g(x) is
irreducible.
The fact that F[x] is a UFO follows from the division algorithm for
polynomials, which is entirely analogous to the division algorithm for
integers.
EXAMPLE 3.2.1
(a) Letf(x) = 3x4 - 6x 2 + 8x - 6, g(x) = 2x2 + 4. Then
3x4 -6x2 +8x-6 3
2 = _x 2 - 6 with remainder 8x + 18.
2x +4 2
Thus, here q(x) = ~ x 2 - 6, rex) = 8x + 18.
(b) Let f( x) = 2x s + 2x4 + 6x3 + lOx 2 + 4x, g(x) = x 2 + x. Then
2xs + 2x4 + 6x3 + lOx2 + 4x
2
x +x
= 2x3 + 6x + 4.
Thus here q(x) = 2x3 + 6x + 4 and rex) = o. o
Using the division algorithm, the development of unique factorization
follows as in Z. We need the idea of a greatest common divisor, or gcd,
and the following lemmas.
Definition 3.2.2
(1) If f(x), g(x) E F[x], then d(x) E F[x) is the greatest common divisor,
or gcd, of f(x), g(x) if d(x) is monic, d(x) divides both g(x) and f(x),
and if d 1 (x) divides bothg(x) andf(x) then dl(x) divides d(x). We write
d(x) = (g(x), f( x)). If (f(x), g(x)) = I, then we say that f( x) and g(x) are
relatively prime.
(2) An expression of the form f(x)h(x) + g(x)k(x) is called a linear
combination off(x), g(x).
Lemma 3.2.1
Given f(x),g(x) E F[x], then the gcd exists,is unique, and equals the monic
polynomial of least degree that is expressible as a linear combination of
f(x),g(x).
Since each division reduces the degree, and the degree is finite, this
process will ultimately end. Let rk(x) be the last nonzero remainder poly-
nomial and suppose c is the leading coefficient ofrk(x). Then C-1rk(X) is
the gcd. We give an example.
EXAMPLE 3.2.2
InQ[x] find thegcdofthe polynomialsf(x) = x3 -1 andg(x) = x2 - 2x+1
and express it as a linear combination of the two. 0
x
2
- 2x + 1 = (3x - 3) ( ~ x - ~) .
Therefore, the last nonzero remainder is 3x - 3, and hence the goo is
x-I. Working backwards we have
3x - 3 = (x3 - 1) - (x 2 - 2x + 1)(x + 2)
so
Lemma 3.2.2
(Euclid's Lemma) If p(x) is an irreducible polynomial and p(x) divides
f(x)g(x), then p(x) divides fcx) or p(x) divides g(x).
Proof
Suppose p(x) does not divide f(x). Then since p(x) is irreducible, p(x) and
f(x) must be relatively prime. Therefore, there exist hex), k(x) such that
f(x)h(x) + p(x)k(x) = l.
Multiply through by g(x) to obtain
g(x)f(x)h(x) + g(x)p(x)k(x) = g(x).
Now, p(x) divides each term on the left-hand side since p(x)/g(x)f(x) and
therefore p(x)lg(x). •
EXAMPLE 3.2.3
Show that x2 + x + 1 is irreducible over Q[x] and lR.[x] but not irreducible
overC[x].
Suppose f(x) = x 2 + X + 1 had a nontrivial factor in lR.[x]. Since deg
f(x) = 2, this factor must have degree 1 and f(x) would factor into two
3.3. Roots of Polynomials and Factorization 27
linear factors
f(x) = X2 + X + 1 = (ax + b)(cx + d).
Then f(x) has a real root, namely -bla. However, from the quadratic
formula the roots are
-1 + .J3i -1 - .J3i
WI =
2
,W2 = ----2
which are both nonreal. Therefore, f(x) must be irreducible over lR.[x) and
also over Q[x).
A computation shows that in qx), f(x) = (x - wI)(x - Wz) and therefore
it factors in qx]. 0
Theorem 3.2.1
If 0 #- f(x) E F[x), then f(x) has a factorization into irreducible polynomials
that is unique up to ordering and unit factors. In other words F[x] is a UFD.
The proof is almost identical to the prooffor Z, and we sketch it. First
we use induction on the degree of f(x) to obtain a prime factorization. If
degf(x) = I, thenf(x) is irreducible, so suppose degf(x) = n > 1. Iff(x) is
irreducible, then it has such a prime factorization. If f(x) is not irreducible,
thenf(x) = h(x)g(x) with degg(x) < nand deg hex) < n. By the inductive
hypothesis, both g(x) and hex) have prime factorizations, and so f(x) does
as well.
Now suppose thatf(x) has two prime factorizations
f(x) = PI (x)", .. ,Pk(X)"k = qI(X)m, ... qt(x)m,
where Pi(X), i = I, ... , n, qix),j = I, ... , t are prime polynomials. Con-
siderpi(x). Then Pi(x)lqI (xT' ., . qt(x)m" and hence from Euclid's lemma,
pi(x)lqj(x) for some j. Since both are irreducible, Pi(X) = c%(x) for some
unit c. By repeated application of this argument we get that ni = mj. Thus
we have the same primes with the same multiplicities but perhaps unit
factors, proving the theorem.
This whole development could be done starting withjust a ringR rather
than a field F. It can be proved that if R is a UFO, R[x) is also a UFO. Thus,
for example, Z[x) is a UFO. The proof is different and somewhat more
complicated than that for fields.
Lemma 3.3.1
If c is a root of P(x) then (x - c) divides P(x) - that is, P(x) = (x - c)Q(x)
with deg Q(x) = deg P(x) - 1.
Proof
Suppose P(c) = O. Then from the division algorithm p(x) = (x - c)Q(x) +
rex) where rex) = 0 or rex) = f E F, since deg rex) < deg (x - c) = 1.
Therefore
= (x - c)Q(x) + f
P(x)
Substituting, we have P(c) = 0 + f = 0, and so f = O. Hence P(x) =
(x - c)Q(x). •
Corollary 3.3.1
An irreducible polynomial of degree greater than one over a field F has no
roots in F.
Theorem 3.3.1
A polynomial ofdegree n in F[x] can have at most n distinct roots.
Proof
Suppose p(x) has degree n and suppose CI, ... ,Cn are n distinct roots.
From repeated application of Lemma 3.3.1,
P(x) = k(x - CI) ... (x - cn)
wherek E F. Supposecisanyotherroot. ThenP(c) = 0 = k(C-CI) ... (C-
cn). Since a field F has no zero divisors, one of these terms must be zero:
c - Ci = 0 for some i, and hence c = Ci. •
Corollary 3.3.2
If P(x) E F[x] with deg P(x) = 2, then P(x) is irreducible if and only if P(x)
has no root in F.
Proof
Suppose P(x) is irreducible. Since deg P(x) > I, P(x) can have no root in
F. Conversely suppose P(x) has no root in F. If P(x) were not irreducible
then it would factor into two linear factors,
P(x) = (ax + b)(cx + d).
But then P(x) would have the root -b/a E F.
•
3.4 Real and Complex Polynomials
We now consider the underlying field to be IR or C and consider real and
complex polynomials - that is, polynomials in lR[x] and C[x] respectively.
We first need the following important result.
Theorem 3.4.1
A real polynomial ofodd degree has a real root.
Proof
Suppose p(x) E lR[x] with deg P(x) = n = 2k + 1 and suppose the leading
coefficient an > 0 (the proof is almost identical if an < 0). Then
P(x) = anxn + (lower terms) and n is odd.
Then
(1) limx ooP(x) = limx ooanx n = 00 since an > O.
(2) limx _ooP(x) = limx _ooanx n = -00 since an > 0 and n is odd.
From (1) P(x) gets arbitrarily large positively so there exists an Xl with
P(XI) > O. Similarly, from (2) there exists an X2 with P(X2) < O.
A real polynomial is a continuous real-valued function for all X E R
Since P(XI)P(X2) < 0, it follows from the intermediate value theorem that
there exists an X3, between Xl and X2, such that P(X3) = o. •
As an immediate consequence we have the following corollary:
Corollary 3.4.1
If P(x) E lR[x] is irreducible and nonlinear, then its degree is even. (We will
see later that it must be 2.)
Lemma 3.4.1
Every degree 2 complex. polynomial has a root in C.
Proof
This is just the quadratic formula. If p(x) = ax 2 + bx + c, then the roots
formally are
-b + Jb 2 - 4ac -b - Jb 2 - 4ac
Xl = 2a I x2 = 2a
From DeMoivre's theorem every complex number has a square root;
hence Xl, X2 exist in C. They of course may be the same, if b2 - 4ac = O.
•
1b go further we need the concept of the conjugate of a polynomial
and some straightforward consequences of this idea.
Definition 3.4.1
If p(x) = ao + ... + anxn is a complex polynomial then its conjugate
is the polynomial F(x) = ao + ... + anxn. That is, the conjugate is the
polynomial whose coefficients are the conjugates of those of p(x).
Lemma 3.4.2
For any P(x) E C[x]
(1) P(z) =
Fez) ifz E C
(2) P(x) is a real polynomial if and only if P(x) F(x) =
(3) If P(x)Q(x) = H(x) then H(x) = (F(x))(Q(x))
Proof
(1) Suppose z E = ao + ... + anzn. Then
C and p(z)
p(z) = ao + ... + anz n = ao + alz + ... + anz n = Fez).
(2) Suppose P(x) is real then ai = ai for all its coefficients and hence
p(x)= F(x). Conversely suppose p(x) = F(x). Then ai = aj for all
its coefficients and hence aj E IR for each aj and so p(x) is a real
•
polynomial.
(3) The proof is a computation and left to the exercises.
Lemma 3.4.3
Suppose G(x) E C[x]. Then H(x) = G(x)G(x) E lR[x].
Proof
H(x) = G(x)G(x) =
G(x)G(x) = G(x)G(x) = G(x)G(x) = H(x). Therefore,
H(x) is a real polynomial. •
3.5. The Fundamental Theorem of Algebra· Proof One 31
Lemma 3.4.4
If f(x) E lR.[x] and f(zo) = 0 then f(ZO) = 0; the complex roots of real
polynomials come in conjugate pairs.
Proof
f(zo) = 0 implies that f(zo) = O. But then [(ZO) = O. Since f(x) is real,
f(x) = [(x), so f(ZO) = O. •
Notice that if Zo is a root of f(x) E lR.[x] then both (x - zo) and (x - ZO)
divide f(x).
Lemma 3.4.5
(x - z)(x - Z) E lR.[x] for any z E C.
We leave the proof for the exercises. Notice that this now implies that
any real polynomial of degree::: 3 completely factorizes over C.
Finally, to complete this section, we prove the following theorem,
which shows that to prove the Fundamental Theorem ofAlgebra we need
only to prove that real polynomials must have complex roots.
Theorem 3.4.2
Ifevery nonconstant real polynomial has at least one complex root, then every
nonconstant complex polynomial has at least one complex root.
Proof
Let P(x) E qx] and suppose that every nonconstant real polynomial
has at least one complex root. Let H(x) = P(x)P(x). From Lemma 3.4.3,
H(x) E lR[x]. By supposition there exists a Zo E C with H(zo) = 0. Then
.
P(zo)P(zo) = 0, and since C has no zero divisors, either p(zo) = 0 or
P(zo) = 0. In the first case Zo is a root of p(~. In the second case P(zo) = 0,
bu then from Lemma 3.4.2 0 = P(zo) = P(ZO) = P(ZO). Therefore Zo is a
~~~.
Lemma 3.5.1
If f:D -+ IR is continuous where D is a closed and bounded (compact) subset
oflR2 , then f(x,Y) has a minimum and maximum value on D.
Theorem 3.5.1
(The Fundamental Theorem ofAlgebra) Iff( x) E C[x], with f(x) nonconstant,
then f(x) has at least one complex root.
Lemma 3.5.2
Let f(x) E C[x]. Then lrex) I takes on a minimum value at some point Zo E C.
Proof
It is straightforward that as Ixl -+ oo,lf(x)1 -+ 00. Since If(x)I is large for
large Ixl it follows that the greatest lower bound m of If(z)1 for z E Cis
also the greatest lower bound in some sufficiently large disk Izl ::: r.
Since If(x)I is a continuous real-valued function it follows from Lemma
3.5.1 that If(x) I will attain its minimum value on this disk. •
Lemma 3.5.3
Suppose f(x) E C[x] with f(x) nonconstant. If f(Xo) =f:. 0, then If(xo) I is not
the minimum value oflf(x)l.
Proof
Let f(x) be a nonconstant complex polynomial and suppose Xo is a point
with f(Xo) =f:. O. Make the change of variable x + Xo for x. This shifts Xo to
the origin, so that we may assume that f(0) =f:. O. Next multiply f(x) by
f(O)-I so that f(0) = 1. We must then show that 1 is not the minimum
value of If(x)l.
Let k be the lowest nonzero power of x occurring in f( x). Then f(x) can
be assumed to have the form
f(x) = 1 + axk + terms of degree > k.
Now let a be a ~-th root of -a-I, which exists by DeMoivre's theorem.
Make the final change of variable ax for x. Now f(x) has the form
f(x) = 1 - x k + xk+lg(x) for some polynomial g(x).
3.6. Some Consequences of the Fundamental Theorem 33
For small real x, xlg(x) I is small, sothenxo canbe chosen so that Xo Ig(xo) I <
1. It follows that x~(1 - xolg(xo)/) > 0, so then Ij{xo)/ < 1 = 1j(0) I
completing the proof. •
Proof
Let j{x) be a nonconstant complex polynomial. From Lemma 3.5.2, Ij{x) I
has a minimum value at some point Xo E C. Then from Lemma 3.5.3 it
follows that lj{xo)1 = 0, and hence j{xo) = 0 for otherwise it would not be
the minimum value. Therefore, j{x) has a complex root. •
Corollary 3.6.1
A complex polynomial completely factorizes into linear factors.
Proof
Let j{x) E C[x] and use induction on the degree. The corollary is clearly
true if deg f(x) = I, since thenf(x) is itself linear. Suppose degj{x) = n.
From the Fundamental Theorem of Algebra, there exists a root Xo, and
therefore (x-xo) dividesf(x). Hencej{x) = (x-xo)g(x) with degg(x) < n.
From the inductive hypothesis, g(x) factors into linear factors, so therefore
j{x) does also. •
Corollary 3.6.2
Suppose f(x) E C[x] with deg fCx) = n. Suppose the roots of f(x) are
Xl,X2, ... ,Xn (some may be repeated). Then
34 3. Polynomials and Complex Polynomials
Corollary 3.6.3
A real polynomial factorizes into degree 1 and degree 2 factors. Equivalently,
the only irreducible real polynomials are linear polynomials and quadratic
polynomials without real roots.
Proof
Suppose p(x) E lR[x], then p(x) E C(x). Suppose Zl, ... , Zn are its complex
roots, so that
p(x) = a(x - Zl) ... (x - zn)
where here ex E JR, since ex is the leading coefficient of P(x).
If Zj is real, then (x - Zi) is a real linear factor. If Zj e IR then its complex
conjugate Zi is also a root. But then (x - Zj)(x - Zi) is a real factor of degree
~o. •
Corollary 3.6.4
An irreducible real polynomial must be ofdegree 1 or 2.
Exercises
3.1. Prove Lemma 3.1.1 - that is, if 'p(x), Q(x) E F[x], then deg .P(x)Q(x) = deg
.P(x) + deg Q(x) and deg (.P(x) ± Q(x)) ~ Max(deg .P(x), deg Q(x)).
3.2. Verify that F[x] is a commutative ring by showing that the ring axioms hold.
3.3. Let S be a subring of the field F (such as Z in lR). Let S[x] consist of the
polynomials in F[x] with coefficients from S. Show that S[x] is a subring of
F[x]. Recall that to show a subset is a subring we must only show that it is
nonempty and closed under addition, subtraction, and multiplication.
3.4. The following theorem can be proved:
Theorem
Given (n + 1) distinct values XO,Xl, " . ,xn in a field F and (n + 1) other values
YO,Yl, ... ,Yn in F, then there exists a unique polynomial f(x) E F[x] of degree
~ n such that [(x;) = Yi for i = 0,1, ... ,n.
The proofis from linear algebra. Suppose fcx) = ao +alx+, .. +anxn with
the ai considered as variables. UsingfCxi) = Yi set up the (n + 1) x (n + 1)
Exercises 35
system of equations:
ao + alXo + + an~ = Yo
ao + alXl + + anx~ = Yl
The function u(z) is called the real part of ftz), denoted by Reftz), and
v(z) is the imaginary part of ftz), denoted by Imf(z). Analytic (calculus)
questions about ftz) will in many cases be referred back to questions about
u(x, y) and vex, y).
36
4.1. Complex Functions and Analyticity 37
EXAMPLE 4.1.1
Consider the complex function fez) = Z2. Determine its real and
imaginary parts.
Suppose z = x + iy. Then Z2 = (x + iy)2 = (x 2 - y2) + i(2xy). Hence
here Ref(z) = x2 - y2 while Imf(z) = 2xy. 0
Definition 4.1.1
limz->zoftz) = woifforallE > Othereexistsao > Osuchthatlftz)-wol <
Ewhenever 0 < Iz - zol < o.
Here, of course, the distances Iftz) - wol and Iz - zol are distances in
C, an<~ the approach is within a circular neighborhood of zoo Hence there
are infinitely many modes of approach.
All the basic limit theorems from elementary calculus - products,
sums, constants, etc. - carry over to this situation. 1b actually compute
limits we pass to the real and imaginary parts.
Lemma 4.1.1
Suppose fez) = u(z) + iv(z)j then
lim ftz) = lim u(z)
Z->Zo Z->Zo
+ i Z->Zo
lim v(z).
EXAMPLE 4.1.2
Letftz) = (x 2 + y2) + i(2xy). Find limz->1+iftz).
Then
lim ftz) lim (x 2 + y2) + i lim (2xy) = 2 + 2i.
= (x.y)->(l,l)
z->1+i (x,y)->(l,l)
Using limits we can continue on to continuity and differentiability. 0
Definition 4.1.2
w = ftz) is continuous at Zo if limz-.zo ftz) = ftzo). ftz) is continuous
on a region U ifit is continuous at all points of U.
All the basic results on continuity for one-variable real functions (sums,
products, etc.) carry over to complex functions. Further, as with limits,
4.1. Complex Functions and Analyticity 39
Lemma 4.1.2
fez) = u(z) + iv(z) is continuous at Zo = (xo,Yo) ifand only ifu(x,y),v(x,y)
are continuous real-valued functions at (xo,Yo).
Lemma 4.1.3
Let fez) E C[z] then:
(1) fez) is continuous everywhere in C.
(2) limz-. oo \f(z)\ = 00 iff(z) is non-constant.
(3) If(z) I is bounded on every compact region in C.
Defintion 4.1.3
If f( z) is any complex function, then its derivative f' (zo) at Zo E C is
l' f(zo + Llz) - f(zo)
f
f
(zo) = 1m
~z-.o Llz
whenever this limit exists. Iff'(zo) exists, thenf(z) is differentiable there.
f(z) is differentiable on a whole region if it is differentiable at each point
of the region.
As with limits and continuity, all the basic differentiation rules from
elementary calculus - sums, products, quotients, chain rule etc. - can be
shown to be valid for complex functions. In particular the power rule -
f(z) = zn implies f'(z) = nzn-l for n a natural number - follows purely
formally from the definition. Therefore, a complex polynomial must have
a derivative at each point of C.
Lemma 4.1.4
IffCz) = ao + alz + ... + anz n E C[z] then f'(z) exists at each point Zo E C
and f'(zo) = al + ... + nanz~-l. Formally, iffCz) E C[z] and deg fez) ::: I,
then f'(z) E C[z] and deg f'(z) = deg fez) - 1. IffCz) = ao is constant,
then f'(z) = o.
40 4. Complex Analysis and Analytic Functions
Recall that ify = fCx) is a single-variable real function, then [,(xo) gives
the slope of the tangent at xo. The complex derivative can also be inter-
preted geometrically. We will do this in Section 4.3. First we introduce
some important general ideas.
Definition 4.1.4
w = fCz) is analytic (also called regular or holomorphic) at Zo if fcz) is
differentiable in a circular neighborhood ofzoo fcz) is analytic in a region
U if it is analytic at each point of U. IffCz) is analytic throughout C, then
it is called an entire function.
Proof
From the definition,
' l' fCzo + ~z) - fCzo)
f (zo) = 1m
Llz~O ~z
.
Since fcz) is real-valued, we must have fcz) = u(z) its real part. Then
' l' u(xo + ~x, Yo + ~y) - u(xo, Yo)
f (zo) = (Llx,Lly)~(O.O)
1m
~z
EXAMPLE 4.1.3
Let fcz) = Iz1 2 • We show that ['(0) exists but that fcz) is not analytic at
z = o.
4.2. The cauchy-Riemann Equations 41
Therefore, f' (0) exists and f' (0) = O. We show, however, that it cannot be
analytic at O.
Now, if z = x + iy, Izl 2 = x2 + y2. If f'(zo) exists, then from Lemma
4.1.5, f'(zo) = ~ (zo) = 2xo = ~ (zo) = 2yo. This then is possible only
if xo = Yo, and thus the derivative can only possibly exist along the line
y = x and hence does not exist in a circular neighborhood ofO. Therefore,
it is not analytic at O. In the next section we will show that this function
is differentiable only at O. 0
' l' u(Xo + ~x, Yo) + iv(xo + ~x, Yo) - (u(xo, Yo) + iv(xo, Yo))
[ (zo) = ~x-+o
1m
~x
au &v
= ax (zo) + i ax (zo)
Now allow ~z to approach 0 along a line parallel to the imaginary axis.
In this case ~z = i~y and ~x = O. Then
'()
[ Zo
l'
= ~y-+o
1m
u(xo, Yo + ~y) + iv(xo, Yo + ~y) - (u(xo, Yo) + iv(Xo, Yo))
i~y
&v au
= ()y (zo) - i ay (zo)·
42 4. Complex Analysis and Analytic Functions
Definition 4.2.1
u(x, y), vex, y) satisfy the Cauchy-Riemann Equations if
au
-ax = -ay
av
and -
au = --.
au
ay ax
Theorem 4.2.1
[ff(Z) = u(z) + iv(z) is differentiable at zo, then ~' ~' ~' ~ all exist at Zo
and satisfy the Cauchy-Riemann equations. Further,
/
f(zo) =
au au
-(zo)+i-(zo)
av au
= -(zo)-i-(zo).
ax ax ay ay
More generally, if f(z) is analytic in some domain U then its real and
imaginary parts must satisfy the Cauchy-Riemann equations throughout
U. Further, if f(z) = u(z) + iv(z) and u(z), v(z) have continuous partials
in U and satisfy the Cauchy-Riemann equations throughout U, thenf(z)
is analytic in U, which we now prove.
Let Zo E U. We must then show thatf'(zo) exists. Consider
. u(zo
11m
+ ~z) + iv(zo + ~z) - (u(zo) + iv(zo))
~z-+o ~z
= ~z-+o
lim
~u + i~v = lim -~u. + t lim
~v
~z ~z-+o ~z ~z-+o ~z
Since u(x, y), vex, y) have continuous partials at (xo, Yo), we have
au au
~u = -~x+
ax
-~y+El~X+E2~y
ay
and
4.2. The Cauchy-Riemann Equations 43
lim
t.z~o
2- (au (L\x + iL\y) + i au
L\z ax ax
(L\x + iL\y) + 81 L\x + 82 L\y) .
where 81 , 82 -+ 0 as L\z -+ O.
Since jL\xl ::: lL\zl and lL\yl ::: lL\zl, we have I ~: I ::: I, I ~ I ::: 1,
and hence the final two terms in the limit above tend to zero with L\z. It
follows then that at Zo we must have
au au
+ i ax (zo),
I
f (zo) = ax (zo)
Theorem 4.2.2
(1) Suppose fez) = u(z) + iv(z). Iff'(zo) with Zo = (Xo,Yo) exists, then the
partials ofu(x,y),v(x,y) must exist at (Xo,Yo) and satisfy the Cauchy-Riemann
equations.
(2) Suppose fez) = u(z) + iv(z). If u(x,y),v(x,y) and their partials are
continuous at Zo = (xo,Yo) and satisfy the Cauchy-Riemann equations at
(xo,Yo), then f'(zo) exists, that is, fez) is differentiable at zoo
Corollary 4.2.1
Suppose fez) = u(z) + iv(z) with u(x,y),v(x,y), and their partials continuous
on some open domain U C C. Then fez) is analytic on U if and only ifu,v
satisfy the Cauchy-Riemann equations.
EXAMPLE 4.2.1
Let fez) = f! cosy + if! siny. Show that fez) is everywhere analytic and
f'(z) = fez).
Here u(x,y) = f! cosy, v(x,y) = f! siny. These are everywhere
continuous, differentiable, two-variable real-valued functions. There-
fore, to show that fez) is analytic, we must show that they satisfy the
Cauchy-Riemann equations.
Now,
au au av au
ax = f! cosy, (Jy = -f! siny, ax = e" siny, ay = e" cosy.
44 4. Complex Analysis and Analytic Functions
EXAMPLE 4.2.2
Using the Cauchy-Riemann equations show thatfCz) = z2 is everywhere
analytic and f'(z) = 2z.
Now, ifz = x + iy, thenfCz) = Z2 = (x + iy)2 = x 2 - y2 + i(2xy). Then
u(x, y) = x 2 - y2, vex, y) = 2xy. Computing the partial derivatives, we
have
au au av av
ax = 2x, Oy = -2y, ax = 2y, Oy = 2x.
Clearly, these are then continuous and satisfY the Cauchy-Riemann
equations throughout ((;, and hence fCz) is everywhere analytic. Further,
au au
f'(z) = -
ax
+ i-
ax
= 2x + i(2y) = 2(x + iy) = 2z.
o
Corollary 4.2.2
(1) The only real-valued analytic functions are constants.
(2) Iff' = 0 on a region U, then fez) is a constant.
Proof
(1) Suppose fez) is real-valued. Then fCz) = u(z) with v(z) = O. If fez)
is analytic, it must satisfY the Cauchy-Riemann equations, so that ~ =
~ = 0 and ~ = - ~ = O. Therefore, ~ = ~ = 0, and hence u(x, y)
is constant, and therefore so is fez).
(2) Iff' = 0, thenf' = ~ + i~ = ~ - i~ = O. This implies that
~ = ~ = ~ = ~ = O,andhencebothu(x,y)andv(x,y)areconstants.
Therefore, fCz) is constant. •
Definition 4.2.2
A real-valued function u(x, y) is a harmonic function ifit has continuous
second partials and satisfies Laplace's equation
a2u a2u
-2
ax
+ -&y2 =0.
Lemma 4.2.1
If fez) = u(z) + iv(z) is an analytic function, then u(x,y) and v(x,Y) are
harmonic functions.
Proof
We leave the continuity of the second partials until later. Suppose f(z) is
analytic; then it must satisfy the Cauchy-Riemann equations ~ = ~
and ~ = - ~ . Since there are continuous second partials, the order of
partial differentiation can be flipped, so that
a2u a2v a2v a2u
ax 2 = &yax = ax&y =- &y2 .
Th erelore,
&. azu iJy2u , or azu
ax2 = - a
2
ax2 + azu
iJy2 = 0 and so u ( x, Y).IS h armonlC.
I
. An
analogous argument shows that vex, y) is harmonic. •
EXAMPLE 4.2.3
Show that u(x, y) = y3 - 3x2y is a harmonic function and find a conjugate
harmonic function v(X, y) such that f( z) = u + iv is analytic.
Now ax au = - 6xy, ay
au = 3y 2 - 3X2 so aax2 2
u = - 6y, a
iJy2u = 6y. Th erelore,
2
&.
This implies thatg'(x) = 3x2, and now integrating with respect to x gives
g(x) = x3 + c. Any constant will work, so take c = 0, and therefore
v(x,y) = x 3 - 3xy2 is a conjugate harmonic to u(x, y), and
is analytic. o
Definition 4.3.1
A curve y in C is a continuous function y : [a, b] --+ C given by
with x(t), yet) real-valued functions on the real interval [a, b]. If x(t), yet)
are differentiable at to, then y is called a differentiable curve at to and
then y (to) = x' (to) + iy' (to). A curve is differentiable ifit is differentiable
for all t E [a, b]. A curve is continuously differentiable if it is differ-
entiable and the derivative is also continuous on [a, b]. The curve YCt) is
regular at to if y(to) ::f:. O. The direction of YCt) at a regular point to is
Arg veto). In general, a regular curve is a curve that is regular at all its
points. .~
EXAMPLE 4.3.1
The curve YCt) = r cos t + ir sin t = reit , 0 :5 t :5 2]'(, represents a circle
of radius r centered at the origin.
The derivative is y'(t) = -r sin t + ir cos t, which is never zero, so YCt)
is regular for all t.
At t = 0, yeO) = ir. Since this is purely imaginary, the argument is
]'(/2, as would be expected by looking at the circle in Figure 4.3.
4.3. Conformal Mappings and Analyticity 47
g'(O) - ir
(r, 0) - g(O)
Definition 4.3.2
Suppose U C C and F : U ~ Co Then F is conformal, or isogonal, at
Zo E U if for any curve Y, regular at to, where YCto) = zo, F 0 Y is also
regular at to and F preserves angles at zoo Preservation of angles means
that if YI (t), yz(t) are two curves with values in U and YI(tI) = Yz(tz) = zo,
then the angle from YI to yz at Zo is equal to the angle from F 0 YI to F 0 yz
at F(zo). If F is conformal throughout U, then it is called a conformal
mapping.
Theorem 4.3.1
(1) If a continuous complex function fez) on a domain U C C has a nonzero
complex derivative at Zo, then fez) is conformal at zoo
(2) Suppose fez) is continuous on a domain U C C and conformal at
Zo E U, and suppose the partials all exist and are continuous at zoo Then
f'(zo) exists and is not zero.
48 4. Complex Analysis and Analytic Functions
Proof
We prove part (1) and leave a sketch ofthe proofofpart (2) to the exercises.
Suppose fez) is a continuous complex function with a nonzero
derivative at zoo We show that f is conformal at zoo
Suppose that Ylt yz are two regular curves with values in V and YI (tl) =
yz(tz) = zoo Since f'(zo) =P 0,[ 0 YI,[ 0 yz are regular at tl, tz. Then the
angle fromf 0 YI to f 0 yz atfezo) is
Arg ((f 0 yz)'(tz)) - Arg ((f 0 yI)'(tI))
= Arg (f'(Yz(tz))y~(tz))
- Arg (f'(YI(tl))y~(tI))'
Recall that for z, W E C, Arg (zw) = Arg z + Arg w. Therefore, the above
becomes
Corollary 4.3.1
A continuous complex function fez) for which all the partials exist and are
continuous is a conformal mapping on a domain U C C ifand only iff(z) is
analytic on U and f'(z) =P 0 on U.
Proof
If fCz) is analytic and f'(z) =f:. 0, then at each point Zo E U, f'(zo) =P O.
Then from part (l) ofTheorem 4.3.1, fCz) is conformal at Zo and therefore
a conformal mapping on U.
Conversely, suppose fCz) is continuous and conformal at each point of
U. Then from part (2) of the theoremf'(zo) exists and is not zero for each
Zo E U. Therefore, fez) is analytic on V, andf'(z) =p O. •
Definition 4.3.3
Suppose f : U .... C, U C C, Zo E U, and M > O. Then fez) is a
magnification at Zo by M if
lim IfCzo + ~z) - fCzo)1 = M.
Az-+O I~zl
Exercises 49
Theorem 4.3.2
(1) Iff(z) is differentiable at zo, then fez) is a magnification at Zo by 1f'(zo)l.
(2) Let U be a domain in C and fez) a continuous complex function on
U. Suppose Zo E U,M ::: 0, and fez) is a magnification at Zo by M, and
suppose further that at Zo all the partials exist and are continuous and f 0 y is
differentiable at to for any curve y differentiable at to where Zo = y(to). Then
either fez) is differentiable at zo, or fez) is differentiable at zoo
EXAMPLE 4.3.2
Letftz) = ef. Show that this is a conformal mapping throughout Co
Now, ft z) = ff = ef cos Y + ief sin y is never zero in C. From our earlier
example we saw thatf'(z) = ef also. Therefore, f'(z) f. 0 in C, and hence
it is conformal.
Further, ftz) = ff maps the strip -rr < y < rr one-to-one and con-
formally onto the split plane that omits 0 and the negative real axis. We
indicate this in Figure 4.4. 0
Exercises
4.1. Letfez) = z3.
(a) Detennine the real and imaginary parts offez).
(b) Use the Cauchy-Riemann equations to show that fez) is everywhere
analytic andf'(z) = 3z 2 .
(c) Use the fonnal definition (as in elementary calculus) on z3 directly to
show thatf'(z) = 3z 2 .
50 4. Complex Analysis and Analytic Functions
4.2. Use induction and the product rule to show that if Jtz) = zn, n E N, then
f'(z) = nzn-I.
4.3. Show that the function j{z) = z is nowhere differentiable.
4.4. Let Jtz) be analytic. Show that g(z) = j{z) is not analytic unless Jtz) is
constant.
4.5. Let Jtz) = (2x z + y) + i(x3 _ y3).
(a) What is Jt1 + 2t)?
(b) Evaluate limz-->3-i j{z).
(c) At which points (if any) is Jtz) differentiable?
(d) Is j{z) analytic anywhere? Why?
4.6. Show that the following functions are entire.
(a) j{z) = (3x + y) + i(3y - x).
(b) Jtz) = e-Y cos x + ie-Y sin x.
4.7. Show that the following functions are nowhere analytic.
(a) j{z) = xy + iy.
(b) Jtz) = eY cos x + ieY sin x. Here note the distinction with exercise 4.6 and
with the complex exponential function.
4.8. Show that each of the following functions is harmonic and find a harmonic
conjugate.
(a) u(x,y) = 2x - 2xy.
(b) u(x,y) = 2x - x3 + 3xyz.
4.9. Describe pictorially what the following curves look like.
(a) y(t) = (3 + i) + 4e it , 0 ::: t ::: 211".
(b) y(t) = t + it Z, 0 ::: t ::: l.
(c) y(t) = t Z + i(ln(t)), 1 ::: t ::: 2.
4.10. Let YI(t) = t + it z, 0 ::: t ::: 1 and yz(t) = !
+ ieit,O ::: t ::: 211". Then
YI(l/2) = Yz(O) = zoo What is the angle between the two curves at this
common point?
4.11. (a) Prove formally that iflimz-->zoJtz) = WI and limZ-->zog(z) = Wz, then
limz-->zo(f{z) + g(z)) = WI + Wz.
(b) Use part (a) to show that if Jtz) and g(z) are continuous at Zo then Jtz) +
g(z) is also continuous at zoo
4.12. The complex trigonometric functions are defined by
eiz _ e- iz eiz + e- iz
sinz = 2i
, cosz = --:--
2
it
(a) Use Euler's identity e = cos t + i sin t applied to a complex variable z
to show that these definitions are what you would expect.
(b) Using the derivative of the exponential function find the derivatives,
and show that if Jtz) = sin z then f'(z) = cos z, and if j{z) = cos z then
f'(z) = - sinz.
The final exercises will sketch a proof of the second part of Theorem
4.3.1 - that if Jtz) is conformal at zo, then Jtz) is differentiable at Zo and
f'(zo) :f:. O.
Exercises 51
4> - 9 = 4> - 9 + Arg (Of + e- 2i¢ o~) _ Arg (Of + e-2iO o~) .
oz oz oz OZ
It therefore follows that
Definition 5.1.1
Suppose V is a domain in the xy-plane, y is a curve contained in V, and
f(x, y) is a real-valued function defined on V. Choose successive points
Po = (xo,Yo),P1 = (XI,YI), ... ,Pn = (Xn,Yn) on ypartitioning yasin
Figure 5.l.
Then [(Pi) = f(Xi, Yi) is defined. Let ~Xi = Xi - Xi-l and form the
Riemann sum L~=l f(Xi, Yi)~Xi' Then the line integral of f( X, y) over y
52
5.1. Line Integrals and Green's Theorem 53
g
Pz Ds; = arc length from
P j- 1 tOPj
d~Yj=Yi-Yj-l
P 1
j- DXj=Xj-Xj_l
1y
tex,y)dx = lim tf(Xi,yi)flXi
6x,-+O i=l
(5.1.1)
l y
tex,y)dY = lim tteXi,Yi)flYi.
0 6Yi-+ ;=1
(5.1.2)
Finally, if fls i is the arc length along y from Pi-l to Pi, then the line
integral of [(x, y) over y with respect to arc length is
1y
tex ,y)ds = lim tflXi,Yi)flSi .
6s;-+0 ;=1
(5.1.3)
Lemma 5.1.1
Iff(x,y) is continuous on U and y c U is continuously differentiable, then
the various line integrals exist.
ly
tex,y)dx = It'
to
f(xCt),yCt))x(t)dt. (5.1.1')
54 5. Complex Integration and Cauchy's Theorem
1 y
f(x, y)dy = i to
tl
f(x(t), y(t))y'(t)dt. (5.1.2')
1 y
[(x, y)ds = i to
tl
f(x(t), y(t))JX(t)2 + y'(t)2dt. (5.1.3')
i f(x,y)dx =i b
f(x,g(x))dx. (5.1.1")
i =i
b
[(x, y)dy f(x, g(x))g'(x)dx. (5.1.2")
i f(x,y)ds = i b
f(x,g(X))JI + g'(x)Zdx. (5.1.3")
EXAMPLE 5.1.1
Evaluate the line integral f y 2xydx + (x 2 - y2)dy, where y is the curve
given by x(t) = t 2 - I, yet) = t 2 + 1,0 :::: t :::: 1.
Here dx = 2tdt, dy = 2tdt, so
i 2xydx + (x 2 - y2)dy = 1 1
2(t2 - I)(t
2
+ I)2tdt
1
1 10
+ ((t2 - Ii - (t2 + I)2)2tdt = - -.
o 3 o
EXAMPLE 5.1.2
Evaluate the line integral fy(y3 - 3xy2)dy where y is the curve given by
y = 2x2, 0 :::: x :::: 1.
Here dy = 4xdx so,
i (y3 - 3xl)dy = 1 1
((2x i - 3x(2x2i)4xdx
2
1
1 7 6 20
= (32x - 48x )dx = - - .
o 7 o
5.1. Line Integrals and Green's Theorem 55
EXAMPLE 5.1.3
Evaluate the line integral Jy yds, where y is the curve given by y = g(x) =
,[X, 0 ::: x ::: 6.
Here ds = J 1-+-g-:-:,(-'x)"""""2dx =
C
! J1~4x dx so,
{ yds = ~ {6 .JXJ 1 + 4x dx
1y 2 10 x
= -21 1+0
6
(1 4xp1 dx = -.
31
3 0
acting in the plane. (Here i andj are the standard unit vectors). Then the
work done in pushing a particle along a curve y subject to this force is
df = of dx + of dy. (5.1.5)
ax ()y
Thus the total differential of a two-variable function is a first-order dif-
ferential form. Notice that if y is a curve with initial point Po and terminal
point PI, then by substitution in the definitions,
l df = f(PI) - f(Po).
That is, the line integral of a total differential depends only on the end-
J
points. We say that y df is independent of path, a concept we will return
to.
In general, a first-order differential form Pdx + Qfl.y is exact if it is the
total differential of some function f(x, y). In this case P(x, y) = ~ and
Q(x, y) = ~. Assuming P, Q are continuous we would then have
ap a2f a2f aQ
ay = axay = ()yox = ax
This condition turns out to be sufficient for exactness as well.
Lemma 5.1.2
If P(x,y),Q(x,y) are continuous, then Pdx
(lP _ £Q.
+ Qfl.y is exact if and only if
iIy - (Ix·
56 5. Complex Integration and Cauchy'8 Theorem
EXAMPLE 5.1.4
Show that (x 3 + 3x2y)dx + (x3 + y3)dy is exact and determine a function
ftx, y) for which it is the total differential.
Here P(x,y) = (x 3 + 3x2y), Q{x,y) = (x 3 + y3) and therefore ~ =
3x2, ~ = 3x2. Hence from Lemma 5.1.2 it is an exact first order
differential form.
Suppose
Then
Definition 5.1.2
A simple closed curve is a curve y(t) : [a, b] ~ C such that y(a) = y(b)
but y(tl) ;/; y(t2) for no other pair tl, t2 E [a, b]. Basically this is a curve tht
is the union oftwo curves having only their endpoints in common,that is,
there is only one point of self-intersection and it occurs at the endpoint
of the interval of definition. (See Figure 5.2.)
5.1. Line Integrals and Green's Theorem 57
o
Simple closed curve Non-simple closed
curve
The Jordan curve theorem says that a simple closed curve divides
the plane into two regions, an interior and an exterior. This seemingly
obvious fact is surprisingly difficult to prove.
Theorem 5.1.1
(Green's Theorem) Suppose R is a region in C whose boundary aR is a
simple closed curve. Suppose further that P(x,y),Q(x,y) are continuously
differentiable functions on a domain U containing R and aR. Then
1.
hR
Pdx + Qdy = f JR{
(aQ _ ap) dA,
ax ay
where the line integral is taken around aR in the counterclockwise direction.
1. Pdx + Qdy =
hG
f JG{ (aQ _ ap) dA,
ax ay
where the line integral over the boundary is defined as the sum over each
boundary curve, each directed so that the region G is on the left.
EXAMPLE 5.1.5
Green's theorem applies to the region R in Figure 5.3, where the line
integral would be counterclockwise around the outer boundary curve
and clockwise around the inner boundary curves. 0
58 5. Complex Integration and Cauchy's Theorem
Proof
Suppose R is the rectangular region a :::: x :::: b, C :::: y :::: d. This is pictured
in Figure 5.4.
(0. d) (b,d)
(0. c) (b, c)
1.
hR
P(x, y)dx. =- f[ oP dA
JR &y
and
1.
hR
Q(x,y)dy = f[ oQ dA.
JR ox
Consider first
f1 -oP dA
ROy
= II -
a
b
c
d
oP dydx.
&y
= I
a
b
(P(X, d) - P(x, c))dx..
5.1. Line Integrals and Green's Theorem 59
Now consider
~ kJ
r kz ~
r
j p(x,y)dx = p(x,y)dx + P(x,y)dx + P(x,y)dx + p(x,y)dx
k,
r r
where C1 is the side from (a, c) to (b, c), Cz is the side from (b, c) to (b, d),
C3 is the side from (b, d) to (a, d), and C4 is the side from (a, d) to (a, c).
Now, along Cz and C4 , dx = 0, while along CI,y = c and along C3 , y =
d. Therefore,
1.
hR
P(x, y)dx =1 Cj
P(x, y)dx + 1 C3
P(x, y)dx
=l +l l
b a b
p(x, c)dx p(x, d)dx = (p(X, c) - P(x, d))dx
= -l a
b
(P(x, d) - P(x, c))dx = - f JRr oP dA.
oy
An analogous argument works to show that
EXAMPLE 5.1.6
Verify Green's theorem when P(x, y) = 4y, Q(x, y) = 5x, and R is the
region bounded by the unit circle centered on the origin. Hence oR is
given by ret) = eit = cos t + i sin t, 0 :::: t :::: 2Jr.
Here ~ = 5, : = 4, so ~ - : = 1. Hence it follows that
Definition 5.1.3
Suppose P(x, y),Q{x, y) are defined on a domain U containing a curve y.
Then f y Pdx + Qfiy is independent of path if the value of the integral
depends only on the endpoints of y not on the curve itself.
Theorem 5.1.2
Suppose P(x,y),Q(x,y) are continuously differentiable functions on a domain
U. Then the following are equivalent:
Proof
Suppose P(x, y)dx + Q{x, y)dy is exact. Then there exists a functionftx, y)
with df = Pdx + Qdy. If Po, PI are two points in U and y is any curve in
U with endpoints Po, PI, then
gil
gl2
1YII
Pdx + Qfiy = 1
Yl2
Pdx + Qdy, or 1 YII
pdx + Qfiy -1Yl2
pdx + Qfiy = O.
5.2. Complex Integration and Cauchy's Theorem 61
1n
Pdx + Qdy = 1nl
pdx + Qdy -1 nz
pdx + Qdy = o.
Therefore, (2) implies (3).
Finally, suppose 1, pdx + Qdy = 0 for any simple closed curve yin U.
Then from Green's theorem
f[
Q
(a - ap) d.A = o.
Interiory ax ay
Since this holds for any simple closed curve, it holds for any region in U
bounded by a simple closed curve in U. This is possible only if
aQ ap aQ ap
---=Oor-=-.
ax ay ax ay
Therefore, Pdx + Qdy is exact, completing the proof.
•
5.2 Complex Integration and Cauchy's
Theorem
Using the theory of real line integration we can define the complex
integral.
Definition 5.2.1
(a) Suppose f(t) = u(t) + iv(t) is a continuous complex function defined
on the interval to :::: t :::: t 1 . Then we define
[
t
l
f(t)dt =
[tl u(t)dt + i
[tl v(t)dt. (5.2.1)
to to to
1 y
f(z)dz = l to
tl
fCYCt))y'(t)dt. (5.2.2)
1
YI
f(z)dz = 1
Y2
f(z)dz,
EXAMPLE 5.2.1
Evaluate f y Z3 dz, where y is the straight line segment from (0, 0) to (I, 1).
Now, z3 = (x + iy)3 = x3 - 3xy2 + i(3x 2y - y3), while the straight line
segment from (0, 0) to (1, 1) is given by y(t) = t + it, 0 :s t :s 1. Therefore,
+ (x 3 - 3xy2)dy
= 1
1
(t 3 - 3t3)dt - (3t 3 - t3)dt +i 11
(3t 3 - t3)dt + (t3 - 3t3)dt
= 1
1
3
-4t dt = -1.
o
From the basic results on line integration all the standard integration
properties carry over to the complex integral.
Lemma 5.2.1
Let f(z),g(z) be continuous complex functions, a,f3 E C. Then:
1 = 1(au
y
fCz)dz
yOx
dx - av
ox
dY ) + i
yOX
1
(av dx + au
ox
dY ) (5.2.4)
Lemma 5.2.2
Suppose F(z) is analytic in a region U and fez) = F'(z). Then if y is any
curve in U with endpoints ZO,Zl,
Corollary 5.2.1
Jy zndz = = ;.rr -
n+1 zn+1 Z"+I
~+l I;~ ~ for any natural number n and any curve y
with endpoints ZO,Zl.
EXAMPLE 5.2.2
Evaluate Jyz 3 dz, where y is the straight line segment from (0, 0) to (1,1).
Here the endpoints of yare 0 and 1 + i, and hence using Corollary
5.2.1 we have
1+i
{z3dz= ~
4
(1 + i)4
-- ---1 .
J
y 4
4
°
Notice that this of course agrees with what we computed directly in
Example 5.2.1.
As an immediate consequence of Lemma 5.2.2 we see that
i F'(z)dz =0
for any closed curve y. We now generalize this to analytic integrands.
i
Consider
au Ov au
ax = (jy'(jy =
Therefore, if these partials are continuous in U, it follows that each
of the differential forms in the integral above is exact, and hence
the line integrals are independent of path. It follows that under these
conditions
1 fCz)dz = 0 (5.2.6)
for any simple closed curve y contained in U. This result was orig-
inally given by Cauchy in the early 1800's. Goursat, later in the
century, proved that the analyticity of fCz) alone was sufficient (the
proof does not depend on the continuity of the partials and thus is
stronger than what can be obtained from Green's theorem). This is
then what is termed Cauchy's theorem or the Cauchy-Goursat the-
orem. In appendix B we give a formal proof of the extended Goursat
re~ 0
Theorem 5.2.1
(Cauchy's Theorem) Let fez) be analytic throughout a simply connected
domain U and suppose y is a closed contour entirely contained in U. Then
i fCz)dz = O.
Theorem 5.2.2
Let fez) be analytic throughout a simply connected domain U. Then fez) =
F'(z) for some function F(z) analytic in U.
Proof
Let Zo be a fixed point in U. For any curve y in U and beginning at Zo
it follows from Cauchy's theorem that the value of the integral JyfCz)dz
depends only on the endpoint of y. 1b see this suppose y, Yl are two curves
beginning at Zo and ending at Zl as in Figure 5.6.
5.2. Complex Integration and Cauchy's Theorem 65
Let yz be the closed path going out along y and back to Zo along YI.
Since f(z) is analytic throughout U we have from Cauchy's theorem
{ f(z)dz = ( f(z)dz.
Jy JY1
Now let F(z) = J~ f(w)dw. From the above discussion this is well-
defined, and now we show that F'(z) = f(z).
Zo
+
6Z
f(w)dw - 1z
Zo
f(w)dw )
= lim - 1
Llz-+O ~z
l
Z
Z LlZ
+ f(w)dw.
lim - 1
Llz-+O ~z
i
Z
Z
+
LlZ
f(w)dw = fez),
and hence F'(z) = f(z).
•
66 5. Complex Integration and Cauchy's Theorem
Theorem 5.3.1
(Cauchy Integral Formula) Let f (z) be analytic in a simply connected domian
u containing a simple closed contour y. If Zo is any point interior to y, then
f(zo) = -.
1
2m
1 fCz)
--dz,
y z - Zo
where the integral is taken in a counterclockwise direction around y.
Before giving the proof of Theorem 5.3.1, let us note that this theorem
implies that the values of a function that is analytic in U within the in-
terior of a curve y in U are completely determined by the values on the
boundary ofy. Thus analyticity forces this extremely strong relationship
between values on the boundary ofsuch curves and values in the interior.
Proof
(Theorem 5.3.1) Let Co be a circle of radius ro centered at zoo If ro is small
enough, Co will be interior to y as pictured in Figure 5.7.
r
1 fez) dz
y z - Zo
=
lco z -
fCz) dz.
Zo
5.3. The Cauchy Integral Formula and Cauchy's Estimate 67
1 1
Then
1 yZ-ZO
f(z) dz = f(zo)
CoZ-ZO
~ +
Co
f(z) - f(zo) dz.
z-zo
(5.3.2)
1 ~ =i
Co z - Zo
r dt = 211:i.
10
2tr
'1Co
f(z) - f(zo) dzl
Z - Zo
~ 1 Co
If(z) - f(zo)lldzl < .:. 211:Yo =
Iz - zol Yo
211:E.
Thus the absolute value of the second integral in (5.3.2) can be made
arbitrarily small. Its value must therefore be zero and hence
1y
f(z)
--dz
z - Zo
.
= 2mf(zo).
•
EXAMPLE 5.3.1
Evaluate f y zf... g , where y is any simple closed contour not containing
z = -3 in its interior.
Let f(z) = Z~3' If Y does not contain z = -3 in its interior, then f(z) is
analytic on y and its interior. Therefore, from the Cauchy integral formula
1 dz
y Z2 - 9 = 1
y (z
dz
+ 3)(z - 3) = 1 y
fez)
z _ 3 dz = 2:;rif(3).
Therefore, this value is 211:i/6.
Recall from advanced calculus that if g(x, t) is a differentiable
two-variable function and
f(t) = l b
g(x, t)dt
then
f'(t) = l
a
b
ag (x, t)dt.
at
That is, if a function is defined by integrating a two-variable differ-
entiable function with respect to one variable, then its derivative is the
integral of the partial derivative with respect to that variable.
Now let us apply this idea to the Cauchy integral formula to obtain an
expression for the derivative of an analytic function. We have
1
formula
Corollary 5.3.1
Let f (z) be analytic in a simply connected domain Ucontaining a simple closed
curve y. If Zo is interior to y then an expression for the n - th derivative is
fcn) Zo
()
=~
2m
1 fez)
y (z - zO)n+l
dz.
Corollary 5.3.2
If fez) is analytic in a simply connected domain U, then it has derivatives of
all orders in U.
Corollary 5.3.3
The derivative ofan analytic function is itselfanalytic.
We note here the contrast with real-valued functions on lit For func-
tions y = fC x) with x, y E lR. it is possible to be differentiable but not twice
differentiable. In general, a function f : [a, b] ~ lR. is of class en if it is
n-times differentiable. For each n, Cn +1 is a proper subset of en - that
is there are n-times differentiable functions that are not (n + 1)-times
differentiable.
The class Coo consists of those functions that are infinitely many times
differentiable. Examples include e' and sin x. Corollary 5.3.2 says that
for complex functions, being analytic in a region implies Coo in that
region.
5.3. The Cauchy Integral Formula and Cauchy's Estimate 69
Theorem 5.3.2
Let fez) be analytic at all points within a circle Co ofradius ro centered at zoo
Then within Co,f(z) is represented by a convergent Thylor series centered at
Zo- that is, for all z within Co,
(f(n) (zo) n
+ f (zo)(z - + ... + + ....
I
fez) = f(zo) zo) (z - zo)
n!
Theorem 5.3.3
(Cauchy's Estimate) Suppose fez) is analytic in a simply connected domain
U containing the circle Co of radius ro centered at zoo If M is the maximum
value of If(z) I on Co, then
Proof
From Corollary 5.3.1
Mn!
Yf •
70 5. Complex Integration and Cauchy's Theorem
Theorem 5.4.1
(Liouville's Theorem) Suppose f (z) is entire and If(z) I is bounded for all values
of z E C. Then f( z) is a constant.
(2) More generally, if Ifcn)(z) I is bounded throughout C, then fez) is a
polynomial ofdegree at most n + 1.
Proof
Suppose f(z) is entire and If(z) I :::: M for all z E C. Then from Cauchy's
estimate on a circle of radius r centered at the origin,
M
If'(z) I < - .
r
Since fez) is entire, we can let r ~ 00. Therefore, If'(z) I = 0 and hence
f' (z) = O. This of course implies that f(z) must be a constant, completing
part (1).
For part (2), if If cn )(z) I :::: M for all z E C, then again from Cauchy's
estimate
Ifcn+1)(z) I :::: M
r
on any circle of radius r centered at the origin. Again, by letting r ~ 00
we would obtain that fCn+I)(z) = 0 and hence fCn)(z) is a constant. Then
f(z) is a polynomial of degree at most n + 1 byantidifferentiation. •
Theorem 5.4.2
(Fundamental Theorem ofAlgebra) Let P(z) be a complexpolynomial ofdegree
2: 1. Then P(z) has at least one complex root.
Proof
Suppose P(z) is a complex polynomial, and let f(z) = ~Z)' If P(z) had no
complex root, thenf(z) would also be an entire function.
5.5. Some Additional Results 71
Since Ip(z) I -+ 00 as Izi -+ 00, there exists M, r > 0 such that IF(z) I >
M if Izi > r. This implies that for Izl > r, IJtz) I = I~Z)I < fJ·If Jtz)
were entire, it would be continuous and thus bounded on the compact set
Izl ~ r. It follows that ifJtz) were entire it would be bounded throughout C.
From Liouville's theorem it would then follow that f(z) must be a constant.
However, then p(z) would also be a constant, which is a contradiction.
Therefore, F(z) must be zero for at least one value of z E C. •
Theorem 5.5.1
(Morera'S Theorem) Suppose f(z) is continuous throughout a simply connected
domain U and fJ(z)dz = 0 for every simple closed curve Y interior to U.
Then fez) is analytic throughout U.
Proof
The proof is essentially the same as the proof of Theorem 5.2.2.
Let Zo be a fixed point in U. For any curve Yl beginning at zo, the value
of 1,n Jtz)dz depends only on the endpoint of Yl since 1,y Jtz)dz = 0 for
every simple closed curve Y interior to U. Therefore,
F(z) = l
Zo
z
Jtw)dw
The second result, which we just state, is called the maximum modu-
lus theorem. This says that for an analytic function on a compact domain
its maximum modulus must occur on the boundary.
Theorem 5.5.2
(Maximum Modulus Theorem) Iff(z) is nonconstant and analytic through-
out a bounded domain D and continuous on the boundary of D, then If(z) I
assumes its maximum value on the boundary ofD, never at an interior point.
Exercises
5.1. Evaluate the following line integrals:
(a) Jrex2 + y)dx + (x - y2)dy, where y is the straight line segment from (0,0)
to (2,3).
Exercises 73
1-- =
Co
dz
z - Zo
2m. and 1
Co
dz
(z - zor
= 0 lfn
. = 2,3, ....
if Co is the circle YCt) = Zo + re it , 0 :::: t :::: 21f. (See how this compares with
the Cauchy integral formula.)
5.7. Let ybe the boundary of the square region -2:::: x :::: 2, -2 :::: y :::: 2 taken
counterclockwise. What are the values of the following integrals and why?
(a) f y Z-~i/2 dz.
(b) f y z(z1~8) dz.
(c) f y 2z~1 dz.
(d) f y (Z-0~i))2 dz.
5.8. If f(z) is analytic within and on a curve y with Zo in its interior, show that
1 f'(z) dz
y z - Zo
-1
-
f(z)
y (z - zo)z
dz
.
J
5.9. Show that the real function f(x) = e-;I is Coo at x = 0 but not real analytic
atx = O.
Fields and Field
Extensions
CHAPTER
Lemma 6.1.1
If F C F' C F" are fields with F" a finite extension of F, then IF':FI and
IF":F'I are also finite, and IF":FI = IF":F'IIF':FI.
74
6.1. Algebraic Field Extensions 75
Proof
The fact that IF' : FI and IF" : F'I are also finite follows easily from
linear algebra since the dimension of a subspace must be less than the
dimension of the whole vector space.
If IF' : FI = n with al, ... , an a basis for F' over F, and IF" : F'I = m
with f3l, ... , 13m a basis for F" over F' then the mn products {aif3j} form
a basis for F" over F (see the exercises). Then IF" : FI = mn = IF" :
F'IIF': Fl. •
In the case ofthe lemma we say that F' is an intermediate field (when
F and F" are understood) and F is the ground field.
EXAMPLE 6.1.1
(See Example 2.1.3) C is a finite extension of JR, but JR is an infinite
extension of Q. 0
Definition 6.1.1
Suppose F' is an extension field of F and a E F'. Then a is algebraic over
F if there exists a polynomial 0 =I p(x) E F[x] withp(a) = O. (a is a root
of a polynomial with coefficients in F.) If every element ofF' is algebraic
over F, then F' is an algebraic extension of F.
If a E F' is nonalgebraic over F then a is called transcendental over
F. A nonalgebraic extension is called a transcendental extension.
Lemma 6.1.2
Every element of F is algebraic over F.
Proof
Iff E F thenp(x) =x- f E F[x] andpCf) = O. •
The tie-in to finite extensions is via the following theorem.
Theorem 6.1.1
If F' is a finite extension of F, then F' is an algebraic extension.
Proof
Suppose a E F'. We must show that there exists a nonzero polynomial
0=1 p(x) EF[x] withp(a) = O.
Since F' is a finite extension, IF' : FI = n < 00. This implies that
there are n elements in a basis for F' over F, and hence any set of (n + 1)
elements in F' must be linearly dependent over F.
76 6. Fields and Field Extensions
Lemma 6.1.3
If a E F' is algebraic over F, then there exists a unique monic irreducible
polynomial p(x) E F[x] such that pea) = o.
This unique monic irreducible polynomial is denoted by irr(a,F).
Proof
Suppose f(a) = 0 with 0 i= f(x) E F[x]. Then f(x) factors into irreducible
polynomials. Since there are no zero divisors in a field, one of these fac-
tors, say PI (x) must also have a as a root. If the leading coefficient ofPI (x)
is an thenp(x) = a;;-IpI(x) is a monic irreducible polynomial in F[x] that
also has a as a root.
Therefore, there exist monic irreducible polynomials that have a as a
root. Let p(x) be one such polynomial of minimal degree. It remains to
show that p(x) is unique.
Suppose g(x) is another monic irreducible polynomial with g(a) = O.
Since p(x) has minimal degree, deg p(x) ::s deg g(x). By the division
algorithm
g(x) = q(x)p(x) + rex) (6.1.2)
where rex) == 0 or deg rex) < degp(x). Substituting a into (6.1.2) we get
g(a) = q(a)p(a) + rea),
6.1. Algebraic Field Extensions 77
which implies that rea) = 0 since g(a) = pea) = O. But then ifr(x) is not
identically 0, a is a root of r(x) , which contradicts the minimality of the
degree of p(x). Therefore, r(x) = 0 and g(x) = q(x)p(x). The polynomial
q(x) must be a constant (unit factor) since g(x) is irreducible, but then
q(x) = 1 since both g(x),p(x) are monic. This says that g(x) = p(x), and
hence p(x) is unique. •
Suppose a E F' is algebraic over F and p(x) = irr(a, F). Then there
exists a smallest intermediate field E with FeE c F' such that a E E. By
smallest we mean that if E' is another intermediate field with a E E' then
E C E'. 1b see that this smallest field exists, notice that there are subfields
E' in F' in which a E E' (namely F' itself). Let E be the intersection all
subfields of F' containing a and F. E is a subfield of F' (see the exercises)
and E contains both a and F. Further, this intersection is contained in any
other subfield containing a and F.
This smallest subfield has a very special form.
Definition 6.1.2
Suppose a E F' is algebraic over F and p(x) = irr(a, F) = ao + alX + ... +
an_Ixn- 1 + x n . Let
F(a) = {fo + fia + ... + fn_Ian-l; fi E F}.
On F(a) define addition and subtraction componentwise and define mul-
tiplication by algebraic manipulation, replacing powers of a higher than
an by using
Theorem 6.1.2
F(a) forms a finite algebraic extension ofF with IF(a ):FI = deg irr(a,F). F(a)
is the smallest subfield of F' that contains the root a. A field extension of the
form F(a) for some a is called a simple extension of F.
Proof
Recall that Fn-l[x] is the set of all polynomials over F of degree:::: n -
1 together with the zero polynomial. This set forms a vector space of
dimension n over F. As defined in definition 6.1.2, relative to addition
and subtraction F(a) is the same as Fn-l[x], and thus F(a) is a vector
space of dimension deg irr(a, F) over F and hence an abelian group.
Multiplication is done via multiplication of polynomials, so it is
straightforward then that F(a) forms a commutative ring with an iden-
tity. We must show that it forms a field. 1b do this we must show that
every nonzero element of F(a) has a multiplicative inverse.
Suppose 0:1= g(x) E F[x]. Ifdegg(x) < n = deg irr(a, F), theng(a) :1= 0
since irr(a, F) is the irreducible polynomial of minimal degree that has a
as a root.
78 6. Fields and Field Extensions
If hex) E F[x] with deg hex) 2: n, then h(a) = hl(a), where hl(x)
is a polynomial of degree ~ n - I, obtained by replacing powers of a
higherthan an by combinations of lower powers using
an = -aD - ala - ... - an_la n- l
Now suppose g(a) E F(a), g(a) # O. Consider the corresponding poly-
nomialg(x) E F[x] ofdegree ~ n-l. Sincep(x) = irr(a, F) is irreducible, it
follows thatg(x) andp(x) must be relatively prime,that is, (g(x),p(x)) = l.
Therefore, there exist hex), k(x) E F[x] such that
g(x)h(x) + p(x)k(x) = l.
Substituting a into the above we obtain:
g(a)h(a) + p(a)k(a) = l.
However, pea) = 0 and h(a) = hl(a) E F(a), so that
g(a)hl(a) = l.
It follows then that in F(a), h1(a) is the multiplicative inverse of g(a).
Since every nonzero element of F(a) has such an inverse F(a) forms a
field.
F is contained in F(a) by identifying F with the constant polynomials.
Therefore, F(a) is an extension field of F. From the definition of F(a), we
have that {I, a, a 2, ... , a n- 1 } form a basis, so F(a) has degree n over F.
Therefore, F(a) is a finite extension and hence an algebraic extension.
If FeE c F' and E contains a, then clearly E contains all powers of a
since E is a subfield. E then contains F(a), and hence F(a) is the smallest
subfield containing both F and a. •
EXAMPLE 6.1.3
Consider p(x) = x3 - 2 over Q. This is irreducible over Q but has the root
a = 21/ 3 E R The field Q(a) = Q(2 l / 3) is then the smallest subfield oflR
that contains Q and 21/ 3 .
Here
Q(a) = {qo + qla + q2a2; qi E Q and a 3 = 2}.
We first give examples of addition and multiplication in Q(a).
Let g = 3 + 4a + Sa 2 , h = 2 - a + a 2. Then
2
g+h=S+3a+6a
and
gh = 6 - 3a + 3a 2 + 8a - 4a2 + 4a3 + lOa 2 - Sa 3 + Sa 4
= 6 + Sa + 9a2 - a3 + Sa4 •
But a 3 = 2, so a4 = 2a, and then
gh = 6 + Sa + 9a 2 - 2 + S(2a) = 4 + ISa + 9a2 •
6.1. Algebraic Field Extensions 79
Definition 6.1.3
Let F', F" be extension fields of F. An F-isomorphism is an isomorphism
a : F' -+ F" such that a(f) =[ for all [ E F. That is, an F-isomorphism is
an isomorphism of the extension fields that fixes each element of the
ground field. If F', F" are F-isomorphic, we denote this relationship by
F'~F".
F
Lemma 6.1.4
Suppose a,fJ E F' are both algebraic over F and suppose irr(a,F) = irr(fJ,F).
Then F(a) is F-isomorphic to F(fJ).
Proof
Define the map a : F(a) -+ F(fJ) by a(a) = fJ and a(1) = [ for all [ E F.
Allow a to be a homomorphism - that is preserve addition and multipli-
cation. It follows then that a maps [0 + fia + ... + [nan-1 E F(a) to [0 +
[lfJ + ... + [nfJ n- 1 E F(fJ). From this it is straightforward that a is an
F-isomorphism. •
same. We now show that the set of algebraic elements over a ground field
is closed under the arithmetic operations and from this obtain that the
algebraic elements then form a subfield.
Lemma 6.1.5
If a,/3 E F' are two algebraic elements over F, then a ± /3,a/3, and a/ /3 are
also algebraic over F.
Proof
Since a, /3 are algebraic, the subfield Fea, fJ) will be of finite degree over
F and therefore algebraic over F. Now, a, /3 E F(a, fJ) and since F(a, fJ) is
a subfield, it follows that a ± /3, a/3, and a/ /3 are also elements of F(a, fJ).
Since F(a, fJ) is an algebraic extension of F, each of these elements is
algebraic over F. •
Theorem 6.1.3
If F' is an extension field ofF, then the set ofelements ofF' that are algebraic
over F forms a subfield. This subfield is called the algebraic closure of F
in F'.
Proof
Let AF(F') be the set of algebraic elements over F in F'. AF(F') :f. 0
since it contains F. From the previous lemma it is closed under addi-
tion, subtraction, multiplication, and division, and therefore it forms a
subfield. •
We close this section with a final result, that says that every finite
extension is formed by taking successive simple extensions.
Theorem 6.1.4
IfF' is a finite extension ofF, then there exists a finite set ofalgebraic elements
ai, ... ,an such that F' = F(ai, ... ,an).
Proof
Suppose IF' : FI = k < 00. Then F' is algebraic over F. Choose an
al E F', al e F. Then Fe F(al) C F' and IF' : F(aI) I < k. If the degree
of this extension is I, then F' = F(al), and we are done. Ifnot, choose an
az E F', az e F(al). Then as above Fe F(al) C F(al, az) C F' with IF' :
F(al, az)1 < IF' : F(al)l. As before, if this degree is one we are done;
if not, continue. Since k is finite this process must terminate in a finite
number of steps. •
6.2. Adjoining Roots to Fields 81
Theorem 6.2.1
(Kronecker's Theorem) Let F be a field and rex) E F[x] an irreducible polyno-
mial over F. Then there exists a finite extension F' ofF where f(x) has a root.
Further, if a is a root in some extension F" with irr(a,F) = f(x), then F' is
F-isomorphic to F(a).
Proof
1b construct the field F' we essentially mimic the construction of F(a) as
in the last section.
Suppose f(x) = ao + alX + ... + anx n with an f. O. Define a to satisfy
F' = F(a) then forms a field of finite degree over F - the proof being
identical to that ofthe last section. The difference between this construc-
tion and the construction in Theorem 6.1.2 is that here a is defined to be
the root and we constructed the field around it, whereas in the previous
construction a was assumed to satisfy the polynomial and F(a) was an
already existing field that contained a. •
EXAMPLE 6.2.1
Let f(x) = x2 + 1 E lR[x]. This is irreducible over lR. We construct the field
in which this has a root.
82 6. Fields and Field Extensions
Definition 6.2.1
Let R be a commutative ring (*). A subring I C R is an ideal if rI C
I for all r E R.
(*) (Ideals can also be defined for noncommutative rings, but this is
not necessary for us here.)
EXAMPLE 6.2.2
In the integers Z, let nZ = {nz; z E Z} be the set of all multiples of n.
Then nZ is a subring and if m E Z, m(nzl) = n(mzl) E nZ, so nZ is an
W~. 0
EXAMPLE 6.2.3
Let R be a commutative ring and let r E R. Let (r) = {rlr; rl E R} = all
multiples of r in R. Then (r) forms an ideal (see the exercises) called the
principal ideal generated by r.
Note that in this language nZ is just the principal ideal (n) in Z. 0
EXAMPLE 6.2.4
Let R be a commutative ring, I C R, an ideal and let r E R. Define
Then (r, 1) forms an ideal (see the exercises) called the ideal generated
by rand I. 0
Definition 6.2.2
Let I C R be an ideal. A coset of I in R is a subset of the form r + I for r
a given element in R. We denote the set of all cosets of the ideal I C
R by RII.
Lemma 6.2.1
The set R/ I of cosets of I in R partition R.
6.2. Adjoining Roots to Fields 83
Proof
On R define the relation r1 r2 if r1 - r2 E I. This is an equivalence
'V
Theorem 6.2.2
Given a commutative ring R and an ideal I C R, then the set of cosets R/ I
forms a commutative ring under the operations defined above. The coset 0 + I
is the zero element of RII, while if R has a multiplicative identity 1 then the
coset 1 + I is the multiplicative identity for R/I. The ring RII is called the
quotient ring, or factor ring of R modulo the ideal I.
EXAMPLE 6.2.5
Consider the ring of integers Z and the ideal nZ. Let Z/nZ be the set of
cosets of nZ in Z.
If Xl, X2 E Z, then Xl X2 if Xl - X2 E nZ, that is if nlx1 - X2.
'V
Definition 6.2.3
An ideal I C R is a maximal ideal if I i= Rand (r, 1) = R for any r E R,
reI.
Lemma 6.2.2
Suppose R is a commutative ring with an identity. Then RII is a field if and
only if I is a maximal ideal.
Proof
Suppose I is a maximal ideal. We show that R/I is a field. RII is a commu-
tative ring with an identity, so we must show that each nonzero element
has a multiplicative inverse.
For r E R let r = r + I be its corresponding coset. Now let r E R/ I,
r i= O. Then r = r + I with reI since 0 = I is the zero element of R/ I.
Since I is maximal, (r, 1) = R and therefore 1 E (r, 1). It follows that there
exist r1 E R, i 1 E I such that r1 r + i 1 = 1. In terms of cosets then,
r1r E 1 + I or r1r = I = 1 in R/I.
84 6. Fields and Field Extensions
Now we show how this development relates to the field extension the-
orem. Suppose F is a field and suppose ftx) E F[x] is an irreducible
polynomial over F. Now, F[x] is a commutative ring with an identity.
Consider (ftx)), the principal ideal in F[x] generated by ftx).
Suppose g(x) e ([(x)), so that g(x) is not a multiple of ftx). Since ftx)
is irreducible, it follows that (ftx), g(x)) = 1. Thus there exist hex), k(x) E
F[x] with
h(x)ftx) + k(x)g(x) = 1.
The element on the left is in the ideal (g(x), (ftx)), so the identity, I, is in
this ideal. Therefore, the whole ring F[x] is in this ideal. Since g(x) was
arbitrary, this implies that the principal ideal (ftx)) is maximal.
Let F' = F[x]l(ftx)). From Lemma 6.2.2, F' is a field, and since F C
F[x], it follows that F C F'. Let x be the coset of x. Then f(X) = ftx) = 0
in F', so x is a root in F'. Here the overbars represent cosets. We have
therefore constructed a field F' in which the irreducible polynomial f(x)
has a root.
Definition 6.3.1
If 0 i: ftx) E F[x] and F' is an extension field of F, then ftx) splits in F',
(F' may be F) if f(x) factors into linear factors in F'[x]. Equivalently, this
means that all the roots offtx) are in F'.
F' is a splitting field for ftx) over F ifF' is the smallest extension field
ofFin whichf(x) splits. (A splitting field for ftx) is the smallest extension
field in which ftx) has all its possible roots.)
F' is a splitting field over F if it is the splitting field for some finite
set of polynomials over F.
6.3. Splitting Fields 85
Theorem 6.3.1
If 0 =I f(x) E F[x], then there exists a splitting field for f(x) over F.
Proof
The splitting field is constructed by repeated adjoining of roots. Suppose
without loss ofgenerality that f(x) is irreducible of degree n over F. From
Theorem 6.2.1 there exists a field F' containing a with f(a) = o. Then
f(x) = (x-a)g(x) E F'[x]withdegg(x) = n-l.Byaninductiveargument
g(x) has a splitting field and therefore so does f(x). •
Theorem 6.3.2
Let F be a field. Then the following are equivalent:
(1) F is algebraically closed.
(2) Every nonconstant polynomial f(x) E F[x] splits in F[x].
(3) F has no proper algebraic extensions,that is there is no algebraic field
extension E with FeE and F =I E.
Definition 6.3.2
An extension field F' of F is an algebraic closure of F if F' is algebraic
over F and F' is algebraically closed.
EXAMPLE 6.3.1
For this example we assume the Fundamental Theorem ofAlgebra - that
is, that ((; is algebraically closed. Note that ((; is not the algebraic closure
of Q since ((; is not algebraic over Q. However, the complex algebraic
numbers Ac. that is, the set of complex numbers which are algebraic
over Q, is the algebraic closure ofQ.
1b see this, notice that Ac is algebraic over Q by definition. Now we
show that it is algebraically closed. Let f(x) E Ac[x]. If a is a root of f(x),
then a E ((;, and then a is also algebraic over Q since each element of Ac
is algebraic over Q. Therefore, a E Ac and Ac is algebraically closed.
More generally, if K is an extension field of F and K is algebraically
closed, then the algebraic closure of F in K is the algebraic closure
ofF. 0
86 6. Fields and Field Extensions
Theorem 6.3.3
Every field F has an algebraic closure, and any two algebraic closures of F
are F-isomorphic.
Definition 6.4.1
A group G is a set with one binary operation which we will denote by
multiplication, such that
(1) The operation is associative, that is, (glgZ)g3 = gl (gzg3) for all
gl,gZ,g3 E G.
(2) There exists an identity for this operation, that is, an element 1 such
that Ig = g for each g E G.
(3) Each g E G has an inverse for this operation, that is, for each g there
exists a g-l with the property that gg-l = 1.
Groups most often arise from invertible mappings of a set onto itself.
Such mappings are called permutations.
Definition 6.4.2
1fT is a set, a permutation on T is a one-to-one mapping ofT onto itself.
We denote by ST the set of all permutations on T.
6.4. Permutations and Symmetric Polynomials 87
Theorem 6.4.1
For any set T, ST forms a group under composition called the symmetric
group on T. IfT,T I have the same cardinality (size), then ST ~ ST], 1fT is
a finite set with ITI = n, then ST is a finite group and ISTI = n!.
Proof
If ST is the set of all permutations on the set T, we must show that com-
position is an operation on ST that is associative and has an identity and
inverses.
Letf,g EST. Thenf,g are one-to-one mappings ofT onto itself. Con-
sider fog : T --+ T. If f 0 g(tl) = f 0 g(tz), then fCg(tI)) = fCg(tz)) and
g(tt) = g(tz), since f is one-to-one. But then tl = tz since g is one-to-one.
If t E T, there exists tl E T with fCtl) = t since f is onto. Then
there exists tz E T with g(tz) = tl since g is onto. Putting these together,
fCg(tz)) = t, and therefore f og is onto. Therefore, fog is also a permutation
and composition gives a valid binary operation on ST'
The identity function l(t) = t for all t E T will serve as the identity for
ST, while the inverse function for each permutation will be the inverse.
Such unique inverse functions exist since each permutation is a bijection.
Finally, composition of functions is always associative and therefore
ST forms a group.
If T, T I have the same cardinality, then there exists a bijection a :
T --+ T I . Define a map F : ST --+ ST] in the following manner: iff E ST,
let FCf) be the permutation on T I given by FCf)(tI) = acrea-I (tl)))' It is
straightforward to verifY that F is an isomorphism (see the exercises).
Finally, suppose ITI = n < 00. Then T = ttl, ... , tn}. Eachf E ST can
be pictured as
tl ... tn )
f= ( fCtt) ... fCt n ) .
For tl there are n choices for fCtl). For tz there are only n - 1 choices since
f is one-to-one. This continues down to only one choice for tn' Using the
multiplication principle, the number of choices for f and therefore the
size of ST is
n(n - 1) .. ·1 = n!.
EXAMPLE 6.4.1
Write down the six elements of S3 and give the multiplication table for
the group.
88 6. Fields and Field Extensions
Name the three elements 1,2,3. The six elements of 83 are then:
2 2 2
1= (~ 2 ~),a=(~ 3 ~),b=(~ 1 ~)
2 2 2
e = (~ 1 ~),d=(~ 2 ~),e=(~ 3 ~).
The multiplication table for 83 can be written down directly by doing
the required composition. For example,
ac = ( 12 2 3) (1 2 3)
3 1 2 1 3 = (13 2 3)
2 1 = d.
Th see this, note that a : 1 ~ 2, 2 ~ 3, 3 ~ 1; e : 1 ~ 2, 2 ~ I, 3 ~
3 and so ae : 1 ~ 3, 2 ~ 2, 3 ~ 1.
It is somewhat easier to construct the multiplication table if we make
some observations. First, a 2 = band a 3 = 1. Next, e2 = I, d = ac, e = a 2e
and finally ae = ea2 •
From these relations the following multiplication table can be con-
structed
1 a a2 e ae a 2e
1 1 a a2 e ae a 2e
2
a a a 1 ae a 2e e
a2 a2 1 a a 2e e ac
2 2
e e a e ae 1 a a
ac ae e a 2e a 1 a2
a 2e a 2e ac e a2 a 1
Th see this, consider, for example, (ac)a 2 = a(ea 2) = a(ae) = a 2e.
More generally, we can say that 83 has a presentation given by
83 =< a, e; a 3 = e2 = I, ae = ea 2 > .
An important result, the form of which we will see later in our work
on extension fields, is the following.
Lemma 6.4.1
Let T be a set and T 1 eTa subset. Let H be the subset of 8T that fixes
each element of T1 - that is, f E H if f(t) = t for all t E T1. Then H is a
subgroup.
6.4. Permutations and Symmetric Polynomials 89
Proof
H :f; 0 since I E H. Now suppose hI, h z E H. Let tl E Tl and consider
hI 0 hz(tl) = hI (hZ(tl))' Now hZ(tl) = tl since hz E H, but then hI (tl ) = tl
since hI E H. Therefore, hI 0 h z E Hand H is closed under composition.
If hI fixes tl then hI l also fixes tl so H is also closed under inverses and
is therefore a subgroup. •
Definition 6.4.3
Let Yl, . .. , Yn be (independent) indeterminates over a field F. A polyno-
mial fCYl, , Yn) E F[Yl' ... , Yn] is a symmetric polynomial in Yl, ... ,
Yn if fCYl, ,Yn) is unchanged by any permutation a of {Yl' ... ,Yn}, that
is, fCYl' ,Yn) = fCaCYl), ... , aCYn)).
If F C F' are fields and aI, ... , an are in F', then we call a poly-
nomial fCaI, ... , an) with coefficients in F symmetric in aI, ... , an if
fCaI, ... , an) is unchanged by any permutation a of {aI, ... , an} .
EXAMPLE 6.4.2
Let F be a field and fo, Ii E F. Let h(Yl, YZ) = fO(Yl + YZ) + Ii (YlYZ).
There are two permutations on {Yl, Yz}, namelyal : Yl -+ Yl, yz -+ yz
and az : Yl -+ Yz, yz -+ Yl·
Applying either one of these two to {Yl, YZ} leaves hCYl, yz) invariant.
Therefore, h(Yl, yz) is a symmetric polynomial. 0
Definition 6.4.4
Let x, Yl, ... , Yn be indeterminates over a field F(or elements of an
extension field F' over F). Form the polynomial
P(X,Yl, ... ,Yn) = (x - Yl)." (x - Yn).
The ith elementary symmetric polynomial 8i in Yl, ... ,Yn for i
I, ... , n, is (-1 )iai , where ai is the coefficient of xn- i in p(x, Yl, ... , Yn).
EXAMPLE 6.4.3
Consider Yl, yz, Y3· Then
P(X,Yl,Yz,Y3) = (x - Yl)(X - yz)(x - Y3)
Z
= x3 - (Yl + yz + Y3)X + (YlYZ + YlY3 + YZY3)X - YlYzY3·
Therefore, the three elementary symmetric polynomials in Yl, Yz, Y3
over any field are
(1) 81 = Yl + yz + Y3·
(2) 8z = YlYZ + Y1Y3 + YZY3·
(3) 83 = YlYzY3.
90 6. Fields and Field Extensions
In general, the pattern of the last example holds for YI, ... ,Yn. That is,
Sl = YI + yz + ... + Yn
Sz = YIYZ + YIY3 + ... + Yn-IYn
S3 = YIYZY3 + YIYZY4 + .... +Yn-ZYn-IYn
sn = YI .. ·Yn· o
The importance of the elementary symmetric polynomials is that any
symmetric polynomial can be built up from the elementary symmet-
ric polynomials. We makethis precise in the next theorem called the
fundamental theorem of symmetric polynomials. We will use this
important result several times, and a complete proof for it will be given
in Section 6.7.
Theorem 6.4.2
(Fundamental Theorem of Symmetric Polynomials) If P is a symmetric poly-
nomial in the indeterminates YI, ... ,Yn over F, that is, P E F[YI, ... ,Yn]
and P is symmetric, then there exists a unique g E F[YI, ... ,Yn] such
that fCYl, . .. ,Yn) = gCSl,'" ,sn). That is, any symmetric polynomial in
YI, ... ,Yn is apolynomial expression in the elementary symmetric polynomials
inYI, ... ,Yn·
From this theorem we obtain the following two lemmas, which will be
crucial in our next proof of the Fundamental Theorem of Algebra.
Lemma 6.4.2
Let p(x) E F[x] and suppose p(x) has the roots aI, ... ,an in the splitting field
F'. Then the elementary symmetric polynomials in aI, ... ,an are in F.
Proof
Suppose p(x) = fo + fIX + ... + fnxn E F[x]. In F'[x], p(x) splits, with roots
aI, ... , an, and thus in F'[x],
Lemma 6.4.3
Let p(x) E F[x] and suppose p(x) has the roots aI, ... ,an in the splitting field
F'. Suppose fUrther thatg(x) = g(x,al, ... ,an) E F/[x]. Ifg(x) is a symmetric
polynomialinal," .,an, theng(x) E F[x).
Proof
Ifg(x) = g(x, aI, ... , an) is symmetric in aI, ... , an, then from Theorem
6.4.2 it is a symmetric polynomial in the elementary symmetric polyno-
mials in aI, ... , an. From Lemma 6.4.2 these are in the ground field F, so
the coefficients of g(x) are in F. Therefore, g(x) E F[x]. •
Theorem 6.5.1
(Fundamental Theorem ofAlgebra) Any nonconstant complex polynomial has
a complex root. In other words, the complex number field C is algebraically
closed.
Lemma 6.5.1
Any odd-degree real polynomial must have a real root.
Proof
Recall that this was Theorem 3.4.1 and was a consequence of the
intermediate value theorem.
Suppose P(x) E lR[x] with deg P(x) = n = 2k + 1 and suppose the
leading coefficient an > 0 (the proof is almost identical if an < 0). Then
P(x) = anx n + (lower terms)
and n is odd. Then,
(1) limx--+ooP(x) = limx--+ooanxn = 00 since an > O.
(2) limx--+_ooP(x) = limx--+_ooanx n = -00 since an > 0 and n is odd.
From (l), P( x) gets arbitrarily large positively, so there exists an Xl with
P(XI) > O. Similarly, from (2) there exists an X2 with P(X2) < o.
92 6. Fields and Field Extensions
Proof
Recall that this was Lemma 3.4.1 and was a consequence ofthe quadratic
formula and of the fact that any complex number has a square root.
If p(x) = ax.2 + bx + c, a =f 0, then the roots formally are
-b + ,Jb2 - 4ac -b - ,Jb2 - 4ac
Xl = 2a ,X2 = 2a
Proof
This was Theorem 3.4.2 and depended on the concept of the conjugate
of a complex polynomial (see Section 3.4).
Let p(x) E C[x] and suppose that every nonconstant real polynomial
has at least one complex root. Let H(x) = p(x)P(x). From Lemma 3.4.3,
H(x) E lR[x]. By supposition there exists a Zo E C with H(zo) = o. Then
p(zo)P(zo) = 0, and since C has no zero divisors, either P(zo) = 0 or
P(zo) = O. In the firstcasezo isa ro~ofp(x). In the second case P(zo) = o.
Then from Lemma 3.4.2 P(zo) = P(ZO) = P(ZO) = o. Therefore, Zo is a
~cl~. •
Lemma 6.5.4
Any nonconstant real polynomial has a complex. root.
Proof
LetJtx) = ao + alx + ... + anx n E lR[x] with n ~ 1, an =f o. The proof is
an induction on the degree n ofJtx).
Suppose n = 2m q where q is odd. We do the induction on m. Ifm = 0
then f(x) has odd degree and the theorem is true from Lemma 6.5.1.
Assume then that the theorem is true for all degrees d = 2k q' where
k < m and if is odd. Now assume that the degree of f(x) is n = 2m q.
6.5. The Fundamental Theorem of Algebra - Proof Three 93
Suppose F' is the splitting field for f(x) over lR. in which the roots are
al, ... , an' This exists from our discussion in section 6.3. We show that
at least one of these roots must be in C. (In fact, all are in C but to prove
the lemma we need only show at least one.)
Let h E Z and form the polynomial
H(x) = D(x - (ai + aj + haiaj)).
i<j
This is in F'[x]. In forming H(x) we chose pairs of roots {ai, ajl. so the
number of such pairs is the number of ways of choosing two elements
out of n = zmq elements. This is given by
( zm q)(zm q - 1) = Zm-Iq(Zm _ 1) = zm-Iel
q
Z
with q' odd. Therefore, the degree of H(x) is zm-I q'.
H(x) is a symmetric polynomialin the roots aI, ... , an. Since aI, ... ,an
are the roots of a real polynomial, from Lemma 6.4.3 any polynomial in
the splitting field symmetric in these roots must be a real polynomial.
Therefore, H(x) E lR.[x] with degree zm-l q'. By the inductive hypothesis,
then, H(x) must have a complex root. This implies that there exists a pair
{ai, aj} with
ai + aj + haiaj E Co
Since h was an arbitrary integer, for any integer hI there must exist
such a pair {ai, aj} with
ai + aj + hlaiaj E C.
Now let hI vary over the integers. Since there are only finitely many
such pairs {ai, ajl. it follows that there must be at least two different
integers hI, h z such that
Zl = ai + aj + hlaiaj E C and Zz = ai + aj + hZaiaj E Co
Then Zl - Zz = (hI - hZ)aiaj E C and since hI, h z E Z C C it follows
that aiaj E Co But then hlaiaj E C, from which it follows that ai + aj E Co
Then,
p(x) = (x - ai)(x - aj) = XZ - (ai + aj)x + aiaj E C[x].
However,p(x) is then a degree-two complex polynomial and so from
Lemma 6.5.Z its roots are complex. Therefore, ai, aj E C, and therefore
f(x) has a complex root. •
Lemma 6.6.1
a E <C is an algebraic number ifand only ifthere exists a primitive polynomial
pCx) E Z[x] with pCa) = O.
than the degree of Pa(x) , since d(x) divides p~(x), which contradicts the
irreducibility of Pa(x),
The complex numbers aI, ... , an are called the conjugates of a over Q.
Lemma 6.6.2
If a E C is an algebraic number, then its conjugates aI, ... ,an over Q are
exactly the zeros ofan irreducible integral polynomial
qa(x) = bnxn + + blx + bo E Z[x),
with n :::: I,b n > 0 andgcd (b o, ,bn) = 1. Further, n = degree ofPa(x),
The polynomial qa(x) is called the entire minimal polynomial of a
overQ.
Definition 6.6.1
A complex number a E C is an algebraic integer if there exists a monic
integral polynomial with a as a root. That is, there exists f(x) E Z[x) with
ftx) = x n + bn_Ixn- 1 + ... + b o, bi E Z, n :::: I, andfta) = O.
Lemma 6.6.3
If a E C is an algebraic integer, then all its conjugates aI, ... ,an are also
algebraic integers.
Proof
Let f(x) E Z[x) be a monic polynomial with ft a) = O. Since Pa(x) = Pa;(x),
for i = 1, ... , n we have Pa,(x)lftx) for i = I, ... , n. Hence f(aD = 0 for
i = 1, ... / n. •
Corollary 6.6.1
If a E C is an algebraic integer, then its entire minimal polynomial is monic.
Proof
Letftx) E Z[x] be a monic polynomial withf(a) = O. Then also f(ai) = 0
.
for all the conjugates. Then there is a primitive integral polynomial hex) E
Z[x] and an r E Q with rqa(x)h(x) = f(x). It follows easily that r = ±I
since qa(x) and hex) are primitive and f(x) is monic. Hence qa(x) is also
~~.
Lemma 6.6.4
Suppose a,f3 E ([; are algebraic integers. Then so are a ± 13 and af3.
Proof
Let al = a, ... , an be the conjugates of a and f31 = 13,· .. , 13m the
conjugates of 13. Let
fCx) = n
n m
n(x - (ai
i=l j=l
+ f3j)) = x n+m + dn+m_IXn+m-1 + ... + do.
The coefficients dk are symmetric functions in ai, f3j, and therefore from
the remarks above we have dk E Z. (Notice the similarity to the argument
that we used in Lemma 6.5.4). Therefore, a + 13 is an algebraic integer.
We treat a - 13 and af3 analogously. •
Corollary 6.6.2
The set ofalgebraic integers forms a subring ofC Further, the field ofalgebraic
numbers is the quotient field of the ring ofalgebraic integers.
Theorem 6.6.1
e is a transcendental number, that is transcendental, over Q.
Proof
Let fcx) E lR[x] with the degree of fcx) = m ~ 1. Let Zl E ([;, Zl =I- 0, and
y: [0,1] -+ ([;, y(t) = tZI. Let
(l Z'
) y c'-ZfCz)dz = -fCZI) + ClfCO) +
Z
(l
) y c,-zf'(z)dz.
Lemma 6.6.5
Suppose a E C is an algebraic number and f(x) = anx n + ... + ao,n ~
I,a n i= 0, and all ai E Z ([(x) E Z[xD with f(a) = o. Then ana is an
algebraic integer.
Proof
Theorem 6.6.2
7r is a transcendental number, that is, transcendental over Q.
Proof
Assume that 7r is an algebraic number. Then 8 = i7r is also algebraic. Let
81 = 8,82 , ... ,8d be the conjugates of 8. Suppose
p(x) = qo + qlX + ... + qdxd E Z[x], qd > 0, and gcd(qo, ... ,qd) =1
is the entire minimal polynomial of 8 over Q. Then 81 = 8, 82 , ... , 8d are
the zeros of this polynomial. Let t = qd. Then from Lemma 6.6.5, Wi is an
algebraic integer for all i. From ei1r + 1 = 0 and from 81 = i7r we get that
(1 +e(1
)(1 + e(2 ) ••• (1 + e9d ) = o.
The product on the left side can be written as a sum of Zd terms e"',
where 4> = lO 18l + ... + lOd8d, lOj = 0 or 1. Let n be the number of terms
lO 1 8l + ... + lOd8d that are nonzero. Call these aI, •.. , an. We then have an
equation
(4)q + e(Xj + ... + e(Xn =0
with q = Zd - n > O. Recall that all tai, are algebraic integers and we
consider the polynomial
[(x) = tnPxP-l(x - alt . .. (x - ant
with p a sufficiently large prime integer. We have f(x) E lR[x], since the
ai are algebraic numbers and the elementary symmetric polynomials in
aI, ... , an are rational numbers.
Let l(Zl) be defined as in the proof of Theorem 6.6.1, and now let
that is,
IJI cp - l
1 ::: < c---
(p - I)! - (p - I)!
cp - I
This as before gives a contradiction, since (P-l)! -+ 0 as p -+ 00.
Therefore, IT is transcendental. •
axil ...
I
that differs from zero is in fact positive. The highest piece of a polynomial
flXII ... ,xn ) is denoted by HG(f).
Lemma 6.7.1
For f(XI I • • • ,xn),g(XI, ... ,xn ) E R[XI I ••• ,xn ] we have HG(fg) = HG(f)
HG(g).
100 6. Fields and Field Extensions
Proof
We use an induction on n, the number ofindeterminates. It is clearly true
for n = I, and now assume that the statement holds for all polynomials
in k indeterminates with k < nand n 2: 2. Order the polynomials via
exponents on the first indeterminate Xl so that
!tXI, ... , xn) = x~if>r(X2, ... , xn) + x~-lif>r_I(Xz, ... , xn)
+ ... + if>o(xz, ... , xn),
g(XI, ... ,xn) = xt1/ls(X2, ... , xn) + xt- 11/ls-I(XZ, ... , xn)
+ ... + 1/IO(X2, ... , xn).
Then HG(fg) = x~+sHG(if>r1/ls). By the inductive hypothesis HG(if>r1/ls) =
HG(if>r)HG(1/Is). Hence
HG(fg) = x~+sHG(if>r)HG(1/Is) = (x~HG(if>r))(xtHG(1/Is)) = HG(f)HG(g) .•
Sl = Xl + Xz + ... + Xn,
Sn = Xl·· .Xn·
These were found by forming the polynomial p(x, XI, . .. , xn) = (x -
Xl) . .. (X - xn). The ith elementary symmetric polynomial Si in XI, ... ,Xn
is then (-I)i ai , where ai is the coefficient ofxn- i inp(x, Xl, . .. , xn).
In general,
Sk =
il
L
<iz < ... <ik,1::;k::;n
XiJXi z •.• Xik'
Lemma 6.7.2
In the highestpiece ax~J ... x~n,a :j:. 0, ofa symmetric polynomial s(Xl, ... ,Xn)
we have kl 2: kz 2: ... 2: kn·
6.7. The Fundamental Theorem of Symmetric Polynomials 101
Proof
Assume that ki < kj for some i < j. As a symmetric polynomial,
S(Xl, ... ,Xn) a1so must th en contam . th ·
e pIece ax kl ~ k, kn
I ... Xl ... Xi ... Xn ,
· h·IS h·Igh er th an axk, ••• Xik; ... X kj ... X kn, gIvmg
w h IC ·· a contrad ··.
Ichon.
I j n
Proof
From the definition of the elementary symmetric polynomials we have
that
HG(sD = (XIX2 ... xkf, 1 ~ k ~ n, t ::: l.
Theorem 6.7.1
Lets(xI, ... ,Xn) E R[XI, ... ,Xn]be a symmetric polynomial. Thens(xI, ...,xn)
can be uniquely expressed as a polynomial [(SI, ... ,sn) in the elementary
symmetric polynomials SI, ... ,Sn with coefficients from R.
Proof
We prove the existence of the polynomial f by induction on the size of
the highest pieces. If in the highest piece of a symmetric polynomial all
exponents are zero, then it is constant. that is, an element of R and there
is nothing to prove.
Now we assume that each symmetric polynomial with highest piece
smaller than that of s( Xl, ... ,xn ) can be written as a polynomial in the
elementary symmetric polynomials. Let ax~1 ... x~n, a =j:. 0, be the highest
piece of S(XI, ... , xn). Let
kl-kz kn_1-kn kn
(
tXI,···,Xn) =SXI,···,Xn
( ) -as l ... sn-I sn·
Clearly, t(XI, ... , xn) is another symmetric polynomial, and from Lemma
6.7.3 the highest piece of t(XI • ... ,xn) is smaller than that of S(XI, ... ,xn).
Therefore, t(XI, ... , xn) and hence s( Xl, ... ,Xn) = t(XI, ... , xn)+as~,-kz ...
k can b
kn_1-knSnn
sn-I ·
e wntten as a pol · 1 In
ynomla . SI, ... , Sn.
1b prove the uniqueness of this expression assume that S(XI, , xn) =
f(SI, ,Sn) g(SI, ... ,Sn). Then f(SI, ... ,Sn) - g(SI, ,Sn)
h(SI, ,sn) = ¢(Xl, ... , xn) is the zero polynomial in Xl • ... , xn. Hence,
if we write h(SI, ... , sn) as a sum of products of powers of the SI, ... , Sn,
102 6. Fields and Field Extensions
Exercises
6.1. Let p(x) = XZ + x + 1.
(a) Show that p(x) is irreducible over Q.
(b) Let abe a root ofp(x). Letg = 2 + a, h = I - a E Q(a). Findg + h,gh,
and h- I .
(c) Solve the linear equation 2g + h = 5 in the field Q(a).
6.2. Assume that a, 13 are both algebraic over F and suppose irr(a, F) = irr(f3, F).
Verify that the map given in the proof of Lemma 6.1.4 is indeed an F-
isomorphism.
6.3. Let R be a commutative ring and r E R. Let (r) = {rl r; rl E R}. Verify that
(r) is an ideal.
6.4. An ideal I is a prime ideal if I '" R and whenever rl rz E I either rl E I or
rz E I.
(a) Show that mZ is a prime ideal in Z if and only if m is a prime.
(b) For any commutative ring R with identity show that if I is a prime ideal,
then the factor ring RII is an integral domain.
6.5. If F is a field, show that the only ideals in F are the whole field and (0).
6.6. Let I C R be an ideal and r E R, r rf. I. Verify that (r, 1) = {rlr + i l ; rl E
R, i l E I} is an ideal.
6.7. Let FeE c F'. If {al, ... , an} is a basis for E over F and {f3l,"" Pm} is a
basis for F' over E, then the mn products {aif3j} are a basis for F' over F.
Hint: 1b show that {aif3j} is a basis for F' over F we must show that they
(l) span F' over F and (2) that they are independent.
1b show (I): Let g E F'. Then g = elf31 + ... + emf3m with ei E E, since
{f3l,"" 13m} is a basis of F' over E. Now use linear expansion for each ei in
terms of {ai, ... ,an} and collect terms.
1b show (2): Suppose ftalf31 + ..... + fmnanf3m = O. Combine terms
in each f3i first and then use the independence of the {f3;}. Then use the
independence of the {aj} in each coefficient.
6.8. Let K be an algebraic extension of E and E an algebraic extension of F. Then
K is an algebraic extension of F.
6.9. Let F' be a finite extension of F . Let a E F' with n = deg irr(a, F). Then n
divides IF' : n
6.10. Let FeD, where F is a field and D is an integral domain with the same mul-
tiplicative identity. D is then a vector space over F. If D is finite-dimensional
over F then D is a field.
Exercises 103
Definition 7.2.1
If R is an algebraic structure then an automorphism of R is an isomor-
phism a : R -+ R. We let Aut(R) denote the set of automorphisms on
R.
EXAMPLE 7.2.1
Let G be a cyclic group of order n. Suppose g is a generator so that G =
{I, g, gZ, ... , gn-l}. Recall that if (k, n) = I, then gk is also a generator, so
the mapping
a:g-+l
will define an automorphism of Gby making this map a homomorphism.
Further, for any automorphism a the generator g is mapped to gl where
gl is another generator. Therefore,
Aut(G) = {a; a : g -+ l with (n, k) = l.}
It follows that IAut(G)I = <p(n), the number of positive integers less than
n and relatively prime to n. 0
EXAMPLE 7.2.2
Consider the complex numbers C. Let a be an automorphism of C. For
all such automorphisms it follows that a(O) = 0, a(I) = I, a( -1) = -1.
Therefore, a(i z) = (a(i))z = -1 and hence a(i) = ±i.
7.2. Some Results From Finite Group Theory 107
Since I, i form a basis for Cover JR, the images ofl, i will completely de-
termine an automorphism. Hence there are precisely two automorphisms
of C, namely those given by
al : 1 -+ I, i -+ i, the identity automorphism
1 -+ I, i -+ -i.
a2 :
o
EXAMPLE 7.2.3
Consider the finite fieldZp , wherep isa prime. ThenZp = {O, I, ... ,p-l}
with the arithmetic operations done modulo p. Let a be an automorphism.
Since a(l) = I, it follows thata(n) = n for all integer multiples of1. Hence
a(x) = x for all x E Zp. Therefore, the only automorphism on Zp, is the
identity and the automorphism group is trivial. 0
Theorem 7.2.1
For any algebraic structure R the set Aut(R) forms a group under composition
called the automorphism group of R. If 8 is any substructure of R, then
those automorphisms a E Aut(R) that fix 8, that is, a(s) = s for all s E 8
form a subgroup.
EXAMPLE 7.2.4
Let p be a prime and G a cyclic group of order p. We show that Aut(G) is
also cyclic and of order p - 1.
Let Zp be the finite field of order p as in the last example. Its additive
group is cyclic of order p and so isomorphic to G under the isomorphism
1 -+ g, whereg is a generator ofG. Since Zp isa field, its nonzero elements
form a group under multiplication. We will call this group V.
From Example 7.2.1, any automorphism of G is determined by the
image of a generator
a : g -+ l, where (k,p) = 1.
In terms of the isomorphism with the additive group of Zp this is then
given by
a : 1 -+ k, where k =I- O.
This automorphism then behaves just like multiplication by k in Zp, and
so Aut(G) is isomorphic to the multiplicative group V ofZp . We show that
V is cyclic.
Now, IVI = p -I, so if-I = 1 for all y E U. Let m be the maximal order
of any element in V, so m ~ p - 1. Ify E V has order q, then qlm (see the
exercises). Hence, if yq = I, then ym = 1. It follows that ym = 1 for all
y E U. Therefore, each y E V is a root of the polynomial P(x) = x m - 1.
This is a polynomial over a field, so it can have at most m roots. However,
it has at least p - 1 roots so m :::: p - 1. Therefore, m = p - I, and V has
an element of order p - 1 and therefore is cyclic. 0
108 7. Galois Theory
Theorem 7.2.2
(Lagrange's Theorem) Let G be a finite group and H be a subgroup. Then
IGI = IG:HIIHI· In particular, both the order of a subgroup and the index of
a subgroup divide the order of a finite group.
One interesting historical note. Lagrange died about twenty years be-
fore the word "group" was ever used. What Lagrange actually proved was
a result, equivalent to what is now called Lagrange's theorem, on closed
sets of permutations.
EXAMPLE 7.2.5
We show that every finite group of prime order is cyclic.
Let p be a prime and G a finite group of order p. Let x E G, x #- 1,
and let H = < x > the cyclic subgroup generated by x. From Lagrange's
theorem, the order ofH divides the order of G. Hence IHI = 1 or p, since
p is a prime. If IHI = 1, then x = I, contradicting that x#- 1. Therefore,
IHI = p and H = G and G is cyclic.
The above argument also shows that a finite group of prime order can
have no nontrivial proper subgroups. 0
Definition 7.2.2
If G is a group, H eGa subgroup, and g E G, then g-lHg forms a
subgroup, called a conjugate of H. g normalizes H if g-lHg = H. The
set of elements that normalize H is called the normalizer of H in G
denoted by NG(H). H is a normal subgroup denoted by H <l G, ifevery
g E G normalizes H, that is, g-lHg = H for all g E G.
Lemma 7.2.1
The following are equivalent:
(l)H<JG.
(2) The only conjugate of H in G is H.
(3) Each left coset of H is also a right coset.
(4) NG(H) = G.
Lemma 7.2.2
Let He G be a subgroup. Then:
(1) Any conjugate of H is isomorphic to H.
(2) NG(H) is a subgroup of G and H <J NG(H).
(3) IG:NG(H)I is the number ofdistinct conjugates of H in G.
EXAMPLE 7.2.6
In an abelian group G every subgroup H is normal, since g-l Hg =
H(g-lg) = H.
At the other end of the spectrum are simple groups. These are groups
that have no proper nontrivial normal subgroups. For example, a cyclic
group ofprime order is simple, since it has no proper nontrivial subgroups
whatsoever.
Simple groups are really the building blocks for finite groups, and there
is a vast theory of finite simple groups. Much of the work has gone into
the proof ofwhat is called the classification theorem for finite simple
groups. This theorem gives a complete "listing" of all the finite simple
groups. It turns out that any finite simple group is either a member ofone
of a finite number of infinite families ofgroups that are well described or
is one ofa finite number of isolated examples called sporadic groups. The
proof of this theorem, at this point in time, takes over twelve thousand
published pages. 0
EXAMPLE 7.2.7
Suppose IG : HI = 2. We show that H <J G.
Let g e H, then there are two left cosets namely H 1H andgH.
These partition G, so that
G=HUgH
with this being a disjoint union. A similar statement is true for right cosets,
so that
G =HUHg.
Since these are disjoint unions, it follows that gH = Hg, or g-lHg = H.
As a specific example of this, consider the symmetric group on three
symbols 83. In Example 6.4.1 we saw that /831 = 6 and that there is an
element of order 3. By Lagrange's theorem the cyclic subgroup generated
110 7. Galois Theory
by this element has index two and is thus normal. This subgroup is called
A3 - the alternating group on three symbols. In a similar way, for each
n, there is a subgroup of Sn of index two (not cyclic for n > 3 however)
called An the alternating group on n symbols. An consists of the even
permutations (see Exercise 7.18). 0
Lemma 7.2.3
If H<I G, then G/H forms a group called the factor group, or quotient
group, of G modulo H.
Definition 7.2.3
If f : G """* H is a homomorphism, then the kernel of f, denoted by ker
f is the set (g E G; f(g) = I}. The image off, denoted by 1m f is the set
(h E H; f(g) = h for some g E G}.
Theorem 7.2.3
(First Isomorphism Theorem) (1) If fG """* H, is a homomorphism then
ker f <I G, 1m f is a subgroup of H, and
G/kerf ~ Imf
(Note: Iff is onto, then G/ker f ~ H, and H is called a homomorphic
image ofG).
(2) If H <I G, then there exists a homomorphism fG """* G/H such that ker
f = H andImf= G/H.
Definition 7.2.4
If P is a prime, then a p-group is a group G where every element has
order a power ofp. If G is finite, this implies that IGI = pn for some n.
7.2. Some Results From Finite Group Theory 111
Lemma 7.2.4
If G is a finite p-group of order pYl, then G has a subgroup of order pYl-l and
hence of index p.
Corollary 7.2.1
If G is a finite p-group, then for each divisor of IGj there is a subgroup ofG of
that order.
We also need the Sylow theorem which provides a partial converse for
Lagrange's theorem.
Definition 7.2.5
Suppose G is a finite group with IGI = pma , with p a prime and with
(p, a) = 1. Then a p-Sylow subgroup is a subgroup of order pm (a
maximal p-subgroup of G).
Theorem 7.2.4
(Sylow Theorem) Let G be a finite group oforderpma with p a prime and with
(p,a) = 1. Then:
(1) G has a p-Sylow subgroup.
(2) All p-Sylow subgroups in G are conjugate.
(3) Any p-subgroup of G is contained in a p-Sylow subgroup.
(4) The number ofp-Sylow subgroups is congruent to 1 modulo p and divides
IGI and hence divides a.
EXAMPLE 7.2.8
We show that no group of order 12 is simple.
As a consequence of part (2) of Theorem 7.2.4, if there is only one
p-Sylow subgroup in G it must be normal. Now, IGI = 12 = 22 3. We
show that there is either only one 3-Sylow subgroup or only one 2-Sylow
subgroup.
The number of 3-Sylow subgroups is of the form 1 + 3k and divides
4, so there exists either one or four 3-Sylow subgroups. If there is only
one it is normal so suppose that there are four. Each of these has order 3,
and therefore they intersect trivially. It follows that these four subgroups
cover eight elements in G, not including the identity.
Each 2-Sylow subgroup has four elements. There is only trivial inter-
section between the elements in the 2-Sylow subgroups and those in the
112 7. Galois Theory
3-Sylow subgroups. If there were more than one 2-Sylow subgroup there
would be more than three additonal elements outside the eight within
the 3-Sylow subgroups. This is impossible since G contains only twelve
elements. Therefore, there must be only one 2-Sylow subgroup and hence
it must be normal. 0
Definition 7.3.1
K is a normal extension of a ground field F if K is a splitting field over
F.
Several facts about normal extensions are crucial for us. These are
given in the next theorem.
Theorem 7.3.1
Suppose K is a normal extension of F and suppose FeE eKe F, where
F is an algebraic closure of F. Then:
(1) Any automorphism ofF leaving F fixed maps K onto itselfand is thus an
automorphism of K leaving F fixed. Thus any isomorphism of K within
F leaving F fixed is actually an automorphism.
(2) Every irreducible polynomial in F[x] having a root in K splits in K.
(3) K is a normal extension of E.
Definition 7.3.2
If a is a root off(x) then a has multiplicity m ::: 1 iff(x) = (x - a)mg(x),
where g(a) =f O. If m = I, then a is a simple root otherwise it is a
multiple root.
Now, suppose K is a finite extension of F and a E K. Then a is sepa-
rable over F if a is a simple root of irr(a, F). K is a separable extension
if every a E K is separable over F.
Definition 7.3.3
A field F has characteristic n if n is the least positive integer such that
(n)(l) = 0 in F. We denote this by char F = n. Ifno such n exists, we say
that F has characteristic zero denoted by char F = O.
EXAMPLE 7.3.1
char Q = char IR = char C = 0, and thus any extension of these has
characteristic zero.
On the other hand, char Zp = p, and thus any extension of Zp also has
characteristic p. 0
Lemma 7.3.1
The characteristic ofa field is zero or a prime.
Proof
Suppose char F = n =1= O. Ifn is composite then n = mk, with m < nand
k < n. Then (n)(l) = (mk)(l) = ((m)(I))((k)(I)) = O. A field has no zero
divisors, so either (m)(l) = 0 or (k)(I) = 0 contradicting the minimality
ofn. •
Lemma 7.3.2
Ifchar F = 0, then F contains a subfield isomorphic to Q. Ifchar F = p, then
F contains a subfield isomorphic to Zp. In particular, a field of characteristic
zero must be infinite.
Proof
We show the prime case; the zero characteristic case is similar. Sup-
pose char F = p. Since F is a field, it has an identity 1. Let E =
{(n)(I); n E Z}. Since char F = p, E = {O, I, (2)(1), ... , (p - I)(I)}. It
is then straightforward to check that the map (k)(l) ~ k from E to Zp is
an isomorphism. •
Theorem 7.3.2
Any extension ofa field ofcharacteristic zero must be a separable extension.
114 7. Galois Theory
111eorenn 7.3.3
If K is a finite separable extension of F, then the number of automorphisms
of K fixing F is finite and equal to the degree IK:FI.
111eorenn 7.3.4
(Primitive Element Theorem) If K is a finite separable extension ofF, then K
is a simple extension. That is, K = F(a) for some a E K.
Proof
Since K is a finite extension, K = F(aI, ... ,an) for some elements
aI, ... ,an E K. By induction, it is enough to show that if K = F(a, fJ),
then K = F(y) for some y E K.
Let n = IK : Fl. From Theorem 7.3.3 there are then n automorphisms
aI, ... , an of K fixing F. Form the polynomial
p(x) = n(ai(a) + xai(fJ) - aj(a) - xaj(fJ)).
i#i
This is not the zero polynomial, so there exists aCE K with p(c) =j:. O.
Then the elements ai(a + cfJ), i = 1, ... , n, are distinct. It follows that
F(a + cfJ) has degree at least n over F. ButF(a + cfJ) C F(a, fJ), andF(a, fJ)
has degree n over F. Therefore, F(a, fJ) = F(a + cfJ).
If K = F(a), a is called a prinnitive elennent of Kover F. •
Definition 7.3.4
A Galois extension of F is a finite separable normal extension, that is, a
finite separable splitting field over F.
Definition 7.4.1
Let K be a Galois extension of F. Then the group of automorphisms of K
that fix F is called the Galois group of Kover F, denoted by Gal(KlF).
If H is a subgroup of Gal(KlF), we let K H denote the elements of K fixed
byH.
Lemma 7.4.1
IGal(KIF) I = IK:FI.
Lemma 7.4.2
Suppose FeE c K with K Galois over F. Then:
(1) K is Galois over E and Gal(KIE) is a subgroup of Gal(KIF).
(2) IfH isa subgroup ofGal(KIF), thenE = K H isanintennediatefieldand
Gal(KIE) = H.
116 7. Galois Theory
Proof
Part (1) is clear. For part (2) we must show three things: that K H is a field,
that Fe K H , and that Gal(K/KH ) = H.
Since H C Gal(K/F), every element of F is fixed by each element of
H. Therefore, Fe K H and hence K H is not empty. 1b show that it forms
a field we must show closure under the field operations.
Suppose k1, k 2 E K H and a E H. Then a(k1) = k1, a(k2 ) = k 2 and so
a(k1 ± k 2 ) = a(k1) ± a(k2 ) = k1 ± k 2 . Therefore, k1 ± k 2 E K H • Similarly,
a(klk~l) = a(kI)(a(k2))±1 = klk~l, for k 2 =I O. Hence klk~l E K H for
k2 =I 0 and therefore, K H is an intermediate field.
Finally, if E = K H , then Gal(K/E) consists of those automorphisms of
K that fix E. But by definition E consists of those elements of K fixed by
elements of H. Therefore, Gal(K/E) = H. •
Coro1lary 7.4.1
The map .:H -+ K H from subgroups H ofGal(K/F) to intermediate fields is
a bijection.
Lemma 7.4.3
Suppose FeE c K with K Galois over F and suppose E = K H . Then E is
Galois over F if and only if H <I Gal(K/F). In this case
Gal(E/F) ~ G/H ~ Gal(K/F)/Gal(K/E).
The proofs of all these lemmas can be done directly but can also be
done somewhat more easily by using the following theorem of Artin.
Theorem 7.4.1
(Artin) Let K be a field and G a finite group of automorphisms of K with
IGI = n. Then K is a finite Galois extension of F = KG and its Galois group
is G.
Proof
A separable polynomial is one with no multiple roots. If a is a root of
a separable polynomial f(x), then irr(a, F) divides f(x), so a is separable
over F.
SupposeF = KG and a E K. LetgI, ... ,gr be a maximal set ofelements
such thatgl(a), ... , gr(a) are all distinct. Ifg E G, then (ggl(a), ... , ggr(a)}
7.4. AutomorphisJD8 and the Galois Group 117
The results we have just outlined are the main results in Galois theory.
In the next section we summarize them and give examples. For now,
we examine the Galois group of a polynomial. We suppose here and in
the remainder of the section that F has characteristic zero, so that all
extensions are separable.
Definition 7.4.2
LetfCx) be an irreducible polynomial over F and letKbe the splitting field
for I(x). Then K is Galois over F, and the corresponding Galois group is
called the Galois group of the polynomial.
Lemma 7.4.4
If K is Galois over F with IK:FI = n, then Gal(K/F) is a subgroup of the
symmetric group Sn.
118 7. Galois Theory
map aI, , a r onto conjugates will lead to automorphisms and will give
the elements of the Galois group. We illustrate this with an example.
EXAMPLE 7.4.1
Consider the splitting field of x4 + lover Q. In C this factors as (x z +
~)(xz - i), so this polynomial has four roots in C namely
1 +i _ r: 1- i
-Ii - v~, Wz= - - =WI
-Ii '
-1 +i -1 - i
= iWl, = W3·
-Ii
Notice that WI is a primitive eighth root of unity, and wi = W3, w~ =
W4, wr = Wz, w~ = l.
Therefore, WI is a primitive element of Kover Q, irr( WI, Q) = x4 + I,
and K = Q(wI). It follows that IK : QI = 4. Let us examine the Galois
group.
Since WI is a primitive root an automorphism (1 is completely deter-
mined by its action on WI. There are exactly four elements in the Galois
group and four possible conjugates of WI. Mapping WI onto each will
determine the four different automorphisms.
Suppose (11 (WI) = WI. Then (11 fixes wi, w~, wr, and thus (11 fixes K and
is therefore the identity.
Suppose (1Z(Wl) = Wz = wr.
Then (1z(wz) = (1z(wr) = wt
9
= WI. Simi-
larly, (1Z(W3) = W4, (1Z(W4) = W3. Therefore, the automorphism (1z is given
by the permutation
The same type ofanalysis gives that the two remaining automorphisms
are given by the permutations
It is easy to see that this group is then abelian and equal to Zz x Zz, the
direct product oftwo finite cyclic groups of order 2. This group is usually
referred to as the Klein 4-group.
7.5. The Fundamental Theorem of Galois Theory 119
Theorem 7.5.1
(Fundamental Theorem of Galois Theory) Let K be a Galois extension of F
with Galois group G = Gal(K/F). For each intermediate field E let 'r(E) be
the subgroup of G fixing E. Then:
(7) The lattice ofsubfields ofK containing F is the inverted lattice ofsubgroups
of Gal(K/F).
120 7. Galois Theory
Parts (1) through (6) have already been discussed. We say a little more
about part (7). By the lattice ofsubfields ofKover F we mean the complete
collection ofintermediate fields and their interrelationships. Similarly for
the lattice of subgroups. One is the inverted lattice of the other, since in
the field case ifE is a subfield, the degree IK : EI is the order of Gal(KIE),
while its index is the degree of IE : Fl. Therefore, the subfields are from
above in the lattice while the subgroups are from below. That is, we have
the following diagram, where Gal(K/K) = {I}, the trivial group.
K ~ Gal (KIK)
U n
E ~ Gal (KIE)
U n
F ~ Gal (KIF)
We then illustrate this with three examples.
EXAMPLE 7.5.1
Consider the splitting fieldK ofx4 + 1 overQ. As we saw in example 7.4.1,
K = Q(i, ./2). Then IK : QI = 4, and the Galois group is Il z X Il z. There
are then four automorphisms in Gal(K/Q), given by:
1 : i --+ i, ./2 --+ ./2,
a : i --+ i, ./2 --+ -./2,
r : i --+ -i,./2 --+ ./2,
at' : i --+ -i,./2 --+ -./2.
Each of these has order 2, and therefore there are five total subgroups
of G = Gal(K/Q), namely,
{I}, HI = {I, r}, Hz = {I, ar}, H3 = {I, al, G.
We exhibit the five intermediate fields. The fixed field of G, is precisely
Q while the fixed field of {Il is all ofK. Now, consider H3. Since i E K and
Q(l) C K, IK : Q(i) I = 2, so Q(i) will correspond to a subgroup of index
2 and thus order 2. Now, a fixes i but not ./2. Therefore, a fixes Q(i) but
not all of K, so the fixed field of H3 is Q(l)'
Similarly, Q(./2) c K, and then Q(./2) is the fixed field of HI.
Finally, since i,,J2 E K, it follows that i,J2 E K, so Q(i,J2) C K. Since
irr(i./2, Q) = X Z + 2, IQ(i./2) : QI = 2. The automorphism ar fixes i./2;
hence Q(i,J2) is the fixed field of Hz. We illustrate these relationships in
Figures 7.1 and 7.2. 0
7.5. The Fundamental Theorem of Galois Theory 121
Lattice of Subfields
Figure 7.1.
/I~
H. 1f2 1f3
~I/ {l}
Lattice of Subgroups
Figure 7.2.
EXAMPLE 7.5.2
Let K = Q(,J2, ..;3). This is the splitting field of (xl - 2)(xl - 3) over
Q, so K is Galois over Q. Since IQ(,J2) : QI = 2 and..;3 e Q(,J2), we
have IK : QI = 4. The Galois group can then be descnbed by the four
automorphisms
I :..ti ~ ..ti,.J3 ~ .J3,
a : ../2 ~ ..ti,.J3 ~ -.J3,
i : ../2 ~ -../2, .J3 ~ .J3,
ai : ..ti ~ -..ti,.J3 ~ -.J3.
It is easy to see that this group is Z2 x Z2 and is isomorphic to the Galois
group of the previous example. Thus different field extensions of the
same ground field can have isomorphic Galois groups. The corresponding
subgroups and fixed fields are given below.
Ho = {I} =} fixed field is K = Q(..ti, .J3);
HI = {I, a} =} fixed field is Q( ../2);
Theorem 7.6.1
(Fundamental Theorem of Algebra) The complex number field C is alge-
braically closed. That is, any nonconstant complex polynomial has a root
in C.
Proof
If f(x) E C[x), we can form the splitting field K for f(x) over e. This will
be a Galois extension of C and thus a Galois extension of lR, since C is a
finite extension of R The Fundamental Theorem of Algebra asserts that
K must be C itself, and hence the Fundamental Theorem of Algebra is
equivalent to the fact that any nontrivial Galois extension of C must be
C.
Let K be any finite extension of lR with IK : lRl = 2m q, (2, q) = 1. If
m = 0, then K is an odd-degree extension of lR. Since K is separable over
lR, it is a simple extension, and hence K = lR(a), where irr(a,lR) has odd
degree. However, odd-degree real polynomials always have a real root,
and therefore irr(a,lR) is irreducible only if its degree is one. But then
a E lR and K = R Therefore, if K is a nontrivial extension oflR, m > O.
This shows more generally that there are no odd-degree finite extensions
ofR
Suppose that K is a degree 2 extension ofe. Then K = cea) with deg
irr(a, q = 2. But complex quadratic polynomials always have roots in C
so a contradiction. Therefore, C has no degree 2 extensions.
Now, let K be a Galois extension of e. Then K is also Galois over lR.
Suppose IK : lRl = 2m q, (2, q) = 1. From the argument above we must
have m > O. Let G = Gal(K/lR) be the Galois group. Then IGI = 2m q, m >
0, (2, q) = 1. Thus G has a 2-Sylow subgroup of order 2m and index q. This
would correspond to an intermediate field E with IK : EI = 2m and IE :
lRl = q. However, then E is an odd-degree finite extension ofR It follows
that q = 1 and E = lR. Therefore, IK : lRl = 2m and IGI = 2m .
124 7. Galois Theory
Theorem 7.6.2
Let K be an ordered field in which all positive elements have squareroots.
Suppose further that each odd-degree polynomial in K[x] has a root in K.
Then K(i) is algebraically closed, where i = P is a root of the irreducible
polynomial x2 + 1 E K[x],
Definition 7.7.1
G is a solvable group if it has a finite series of subgroups
with Gi+I <I Gi and G/Gi+I abelian. Such a series is called a normal series
for G, and the terms G/Gi+I are called the factors of the series. The
definition can then be put concisely as: A group G is solvable if it has a
normal series with abelian factors.
Definition 7.7.2
K is an extension of F by radicals ifthere exist elements aI, ... ,a y E K
and integers nI, ... , n y such that K = F(aI, ... , a y ), with a~1 E F, and
a~i E F(aI, ... ,ai-I), fori = 2, ... , r.Apolynomialf(x) E F[x] is solvable
by radicals over F if the splitting field K of f( x) over F is an extension
by radicals.
The key result tying these two concepts - solvability of groups and
solvability by radicals - is the following.
Theorem 7.7.1
Suppose K is a Galois extension ofF with char F = O. Then ifK is an extension
of F by radicals, Gal(K/F) is a solvable group.
Theorem 7.7.2
For any n ::: 5 the symmetric group Sn is not a solvable group.
126 7. Galois Theory
Theorem 7.7.3
Let Yl, ... ,Yn be independent transcendental elements over Q. Then the poly-
nomialf(x) = n~=1 (x-y;) is not solvable by radicalsoverF = Q(SI, ... ,sn),
where SI, ... ,Sn are the elementary symmetric polynomials in Yl, ... ,Yn.
Definition 7.7.3
Suppose we are given a line segment of unit length. An a E lR is con-
structible if we can construct a line segment of length lal in a finite
number of steps from the unit segment using a ruler and compass.
Theorem 7.7.4
The set C ofall constructible numbers forms a subfield ofR Further, Q c c.
Proof
Let C be the set of all constructible numbers. Since the given unit length
segment is constructible, we have 1 E C. Therefore, C =1= 13, and thus
7.7. Some Additional Applications of Galois Theory 127
to show that it is a field we must show that it is closed under the field
operations.
Suppose a, f3 are constructible. We must show then that a ± f3, af3, and
a/f3 for f3 I- 0 are constructible. If a, f3 > 0, construct a line segment of
length lal. At one end ofthis line segment extend it by a segment oflength
1f31. This will construct a segment oflength a + f3. Similarly, if a > f3, lay
offa segment oflength 1f31 at the beginning ofa segment oflength lal. The
remaining piece will be a - f3. By considering cases we can do this in the
same manner if either a or f3 or both are negative. These constructions
are pictured in Figure 7.3. Therefore, a ± f3 are constructible.
------+---_. ...----------
a
a+b
b
b
a
a-b
In Figure 7.4 we show how to construct af3. Let the line segment OA
have length lal. Consider a line L through 0 not coincident with OA. Let
OB have length 1f31 as in the diagram. Let P be on ray OB so that OP has
length 1. Draw AP and then find Q on ray OA such that BQ is parallel to
AP. From similar triangles we then have
o A Q
lal
Hence a/ f3 is constructible.
o Q A
lal
Theorem 7.7.5
If Y is constructible with y IE Q, then there exists a finite number of elements
aI, ... ,a y E IR with a y = y such that for i = 1, ... ,r,Q(aI' ... ,aj) is a
7.7. Some Additional Applications of Galois Theory 129
EXAMPLE 7.7.1
It is impossible to double the cube. This means that it is impossible,
given a cube ofgiven side length, to construct using a ruler and compass,
a side of a cube having double the volume of the original cube.
Let the given side length be I, so that the original volume is also 1.
Th double this we would have to construct a side oflength ZI/3. However
IQ(ZI/3) : QI = 3 since irr(Z1/3, Q) = x 3 - 2. This is not a power of Z so
21/3 is not constructible. 0
EXAMPLE 7.7.2
It is impossible to trisect an angle. This means that it is impossible in
general to trisect a given angle using only a ruler and compass.
An angle 0 is constructible if and only if a segment of length I cos 01 is
constructible. Since cos(1l'/3) = 1/2, 1l'/3 is constructible. We show that it
cannot be trisected by ruler and compass.
The following trigonometric identity can be proved
cos(30) = 4 cos3 (0) - 3 cos(O).
Let a = cos(Jr/9). From the above identity we have 4a 3 - 3a - ~ = o.
The polynomial 4x3 - 3x - ~ is irreducible over Q so irr(a, Q) = x 3 -
~x - ~. It follows that IQ(a) : QI = 3, and hence a is not constructible.
Therefore, the corresponding angle 1l'/9 is not constructible. Therefore,
1l'/3 is constructible, but it cannot be trisected. 0
EXAMPLE 7.7.3
It is impossible to square the circle. That is it is impossible in general,
given a circle, to construct using ruler and compass a square having area
equal to that of the given circle.
Suppose the given circle has radius 1. It is then constructible and would
have an area of1l'. A corresponding square would then have to have a side
of length -/ii. But 1l' is transcendental, so -/ii is not constructible. 0
The other great construction problem solved by Galois theory was the
construction of regular n-gons. A regular n-gon will be constructible for
n :::: 3 ifand only ifthe angle 21l'/n is constructible, which is the case ifand
only ifcos 21l'/n is constructible. The algebraic study ofthe constructibility
130 7. Galois Theory
ofregular n-gons was initiated by Gauss in the early part ofthe nineteenth
century.
Theorem 7.8.1
If F is a field extension ofJR with IF:JRI > I, then IF:JRI = 2 and F ;: C.
Ifwe extend somewhat the definition ofa field we can obtain a further
classification of extensions of the reals. A division ring is an algebraic
structure with the same properties as a field but in which multiplication
is not necessarily commutative. That is, it is a ring with an identity where
every nonzero element has a multiplicative inverse. Thus every field is a
division ring but as we will see below there are noncommutative division
rings. A noncommutative division ring is called a skew field. IfF is a field
and FeD where D is a division ring, then D is still a vector space over
F. Further there is a multiplication in D so that each nonzero element
has a multiplicative inverse. The identity in F must be an identity for the
division ring. In this case D is called a division algebra over F. Here the
elements of F will commute with all elements in D.
We now give a method to construct a class of skew fields. Suppose F is
a field in which no sum of squares can be zero. That is, if xi + ... + x~ =
o with Xl, ... ,Xn E F then Xi = 0 for i = 1, ... , n. A field satisfying
this is called a totally real field. Consider now a vector space H F of
dimension 4 over F with basis I, i,j, k. We identify 1 with the identity
in F and then build a multiplication on H F by defining the products of
the basis elements. Let i 2 = P = k2 = -1 and ijk = -1. This will
completely define, using associativity, all products ofbasis elements. For
example, from ijk = -1 we have ij = _k- l = k since k 2 = -1. Then
ik = i(i)) = (ii)j = -j. It is easy to show that ij = -ji, so that this product
is noncommutative.
A general element of HF has the form fo + Ii i + fL.j + f3k. Multiplica-
tion of elements like this is done by algebraic manipulation using the
defined products of basis elements. We thus get a product on H F . It is a
straightforward computation to prove the following theorem.
7.8. Algebraic Extensions of JR and Concluding Remarks 131
Theorem 7.8.2
For a totally real field F, H F forms a division algebra of degree 4 over F. HF
is called the quaternion algebra over F.
The only difficulty in the proofis to show the existence ofinverses. This
is done just as in the complex numbers and we leave it to the exercises.
The quaternions H are the quaternion algebra over the reals. That is,
H = HR. This algebra consists ofall elements ro+rli+rzj+r3kwith n E JR.
IdentifYing JR with the first component we get that JR C H. Therefore, H
is a finite skew field extension of JR. The following theorem says that this
is the only one.
Theorem 7.8.3
Let D be a finite-dimensional division algebra over JR of degree greater than 1.
If D is a field, then ID:JRI = 2 and D ~ C. If D is a skew field, then ID:JRI = 4
andD ~ H.
Proof
If D is a field the result is just the reformulation of the Fundamental
Theorem of Algebra so we consider the case where D is a skew field. We
outline the proof below and leave the details to the exercises.
(1) If D is a skew field extension ofJR, it must have degree at least 4. As
in the field case every element is a root of a polynomial with coefficients
in JR. If the degree is 3, there is a root in JR, and if the degree is 2, the
resulting extension must be commutative and thus isomorphic to C.
(2) If D has degree four over JR, then it has a basis I, el, ez, e3. We can
show that the nonrea1 basis elements must satisfy quadratic polynomials
over JR and thus can be chosen to behave like the imaginary unit i. Thus
ei = e~ = e~ = -1. By the noncommutativity of D we can obtain that
eleZ = e3 and thus D ~ H.
(3) If D is a division algebra over JR of degree n + I, then as above
there is a basis I, el, ... , en with e; = -I, i = 1, ... , n. We can then
show that for each pair i,j with i i= j we have eiej + ejei E lR. From this
we can then show that we can replace ez bye; with el e; + e'Zel = O.(lf
el ez + eZel = 2c E JR set e; = (ez + ceI)/.J1=C2.) We can then show that
this leads to a contradiction if n > 3. (We may choose e3 = eleZ. Then
let aij = eiej + ejei, and so al2 = O. Then -2e4 = a14el + aZ4eZ + a34e3,
violating the independence of the baSiS). •
Exercises
7.1. Prove Theorem 7.2.1. If R is any algebraic structure, then Aut(R) forms a
group. Further, if S is a substructure, then Aut(S) is a subgroup of Aut(R).
7.2. Let G be a cyclic group of order 6. Determine the structure of Aut(G).
7.3. If G is a finite group and H is a subgroup, show that NG(H) is a subgroup
and IG : NG(H) I is the number of distinct conjugates of H in G.
7.4. If G is abelian and H is a subgroup, show that GIH is also abelian.
7.5. Suppose p, q are distinct primes with p > q. If G is a group of order pq, then
G has a normal p-Sylow subgroup.
7.6. Let F be a finite field. Show that F has p" elements for some prime p. (Hint:
Since IFI < 00 char F ::j: 0 so its prime subfield is 7lp . F is then a vector
space over 7lp ).
7.7. (a) Let a = i + ./2. What is irr(a, iQ) and liQ(a) : iQl?
(b) Let a = ./3 + .../7. What is irr(a, iQ) and liQ(a) : iQl?
7.8. Show that Q( ./3 + .../7) = Q(.J3, ./7). (Hint: Compute the degrees).
7.9. Suppose IE : FI = nand p(x) E F[x] is irreducible over F. Then if deg p(x)
and n are relatively prime then p(x) has no roots in E.
7.10. Suppose IE : RI < 00, then E = R or E = C.
7.11. Determine the Galois group and the lattice of subfields for the splitting field
of x3 - 6 over Q.
7.12. Let G be a finite group. Show that G = Gal(KIF) for some field F and Galois
extension K.
7.13. If G is a group and gl, gz E G, then the commutator of gI, gz, denoted by
[gl,gz] iSglgzg11g2"1. The commutator subgroup ofG, denoted by G' is the
subgroup generated by all commutators.
(a) Prove that G' is a normal subgroup of G.
(b) Prove that GIG' is abelian.
(c) Show that if GI H is abelian then G' C H.
7.14. If G is solvable and H is a subgroup of G, then H is also solvable. If H is
normal in G, then GIH is solvable.
7.15. Let a = ro + rli + rzi + r3k be a real quaternion, that is, a E H. Define its
conjugate a = ro - rl i + rzi + r3k. Show that aa is real and then use this to
define an inverse for each nonzero quaternion.
7.16. Fill in the details that for a totally real field F, H F forms a division algebra.
Exercises 133
1
make this more precise. If we integrate, then we get
it
1 dz 1 (2mr ire
21l"i y z - Zo = 2m 10 reit dt = n.
The number n is called the winding number of the curve y aound zoo
More generally, if y is any closed continuously differentiable curve in C
and Zo E C - image y, then it can be shown that
n(y,zo) = ~
2m
1~
yZ-ZO
is an integer, called the winding number of y around ZOo
134
8.1. Winding Number and Proof Five 135
RopeD
Fellow-traveler's path
g(z)
Traveler's path
j(z)
Proof
Suppose f{z) = anzn + ... + ao with an =I 0 and n ::: 1. In searching for a
root we can without loss of generality assume that an = 1, so that
f{z) = zn + an_Iz n- 1 + ... + ao.
If ao = 0, then z = 0 is a root so we may assume that ao =I O.
Recall that f{z) is then a continuous function C """* C . Further,
zn
lim - -1
z-+oo f{z) - I
-
Circle Simple closed curve
-
Torus
Figure 8.2.
Modem topology goes far beyond this rubber sheet geometric ap-
proach; tearing, cutting, and pasting are all permitted as long as they
are done in a continuous manner. However, the ideas of rubber sheet
geometry persist.
In this geometric approach the emphasis is on the spaces themselves.
This approach has led to the development ofhomology and homotopy the-
ory (discussed in the next chapter) and the topological theory ofmanifolds
(spaces that look "locally" like Euclidean space).
The second path topology followed generalizes analysis. Here the pri-
mary emphasis is on continuous functions and their properties. The
138 8. Thpology and Thpological Spaces
Definition 8.3.1
Let PI = (Xl, ... ,xn), Pz = (YI, ... ,Yn) E IR n. Then the distance from PI
to Pz, denoted by d(P I , Pz), is
n
d(P l , Pz) = L (Xi - Yi)Z.
i=1
Lemma 8.3.1
Let d:lR n x IR n --+ IR be the distance function on IR n defined as above. Then:
(1) d(PI,Pz) ::: 0 for all Pl,Pz E IR n , and d(PI,Pz) = 0 ifand only ifPI = Pz.
(2) d(PI,Pz) = d(Pz,PI) for all PI,Pz E IR n .
(3) d(PI,Pz) ::: d(P I ,P3)+d(P3,Pz)forallPI,Pz,P3 E IRn (triangle inequality).
Definition 8.3.2
Let f : IR n --+ IR m . Then f is continuous at Po E IR n if for any E > 0
there exists a 8 > 0 such that d(P, Po) < 8 implies that d(f(P),f(Po)) < E.
The function f is continuous on a subset U C IR n if it is continuous at
each point of U.
In all ofthe above discussion, the only necessity for defining continuity
was the ability to measure distance. We now abstract this to consider
general spaces where distance can be measured.
Definition 8.3.3
Let M be a nonempty set. A metric , or distance function on M is a
function d : M x M --+ IR satisfying
(1) d(x, y) ::: 0 for all X, Y E M, and d(x, y) = 0 if and only if X = y.
(2) d(x, y) = dey, x) for all pairs x, Y EM.
(3) d(x,y)::: d(x,z)+d(z,y) for all triplesx,y,z E M (This is the triangle
inequality).
140 8. Thpology and Thpological Spaces
EXAMPLE 8.3.1
(1) For any n ::: 1, JRn, n-dimensional real space, fonns a metric space
with the metric defined as in Definition 8.3.1.JRn with this metric is called
n-dimensional Euclidean space.
(2) Let Cora, b) ,be the set of all continuous functions on the closed
interval [a, b). If ty.x), g(x) E COra, b), define
b )112
d(f(x), g(x)) = ( llftx) - g(x)1 2dx
This then defines a metric on COra, b) (see the exercises), and so COra, b)
equipped with this metric becomes a metric space.
(3) Let COra, b) be the set of all continuous functions on the closed
interval [a, b). Now, ifftx),g(x) E COra, b], define
d(f(x),g(x)) = max (a,b)lftx) - g(x)l·
This also defines a metric on Cora, b), and so again Cora, b) becomes a
(different) metric space with this metric.
These last two examples show that a set may have more than one
metric defined on it. In particular, any set M can be made into a metric
space by defining del', y) = 0 ifx = Y and del', y) = 1 if x =1= y. However,
in most cases this is not a very interesting metric. 0
The second two metric spaces in Example 8.3.1 are specific cases of
a general type of space arising from certain types of real vector spaces.
These play an important role in analysis so we briefly introduce them.
Definition 8.3.4
Let V be a vector space over the real numbers JR. Then an inner product
on V is a function <, >: V x V -+ JR satisfying
(1) < x, x >::: 0 for all x E V, and < x, x > = 0 if and only if x = 0 (the
zero vector).
(2) < x, Y > = < Y, x > for all pairs of vectors x, Y E V.
(3) < x+y,z >=< X,Z > + < Y,z > foralltriplesofvectorsx,Y,z E V.
(4) < ax, Y > = a < x, Y > for x, Y E V and a E R
An inner product space is a vector space V with an inner product
defined on it.
A norm on V is a function I I : V -+ JR satisfying
(1) Il'l ::: 0 for all x E V, and Il'l = 0 if and only if x = O.
(2) laxl = lallxl for all x E V, a E R
(3) Ix + YI ::: Il'l + lyl for all pairs of vectors x, Y E V.
8.3. Continuity and Metric Spaces 141
EXAMPLE 8.3.2
IRn is an inner product space, where if PI = (Xl, ... , X n), Pz = (YI, ... ,Yn),
then
n
< PI, Pz > = L x;y;.
i=1
Lemma 8.3.2
Any normed linear space V is a metric space, where d(x,Y) Ix - YI for
X,Y E V.
Proof
Define d(x, y) = Ix - YI. Then d(x, y) ~ 0 and d(x, y) = 0 if and only if
X= Y follows directly from the norm property (1). Similarly, d(x, y) =
Ix-YI = I-lily-xi = d(y,x). FinallY,d(x,Y) = Ix-YI = Ix-z+z-YI ::::
Ix - zl + IY - zl = d(x, z) + dey, z). •
Lemma 8.3.3
Let V be an inner product space. For any v E V define Ivl = ( < V,V > YI2.
This then defines a norm on V. In particular any inner product space is then
a normed linear space and further then must be a metric space.
Proof
As in the proof of Lemma 8.3.2, this involves using the inner product
properties to get the norm properties.
Ivl = « v,v > )1/Z, so Ivl ~ 0 (since < v, v >~ 0), and Ivl = 0 if and
only if v = o.
lavl = « av, av »1/2 = (a z < V,V »1/2 = lal( < V,V > Y/2 =
lallvl·
1b get the final property we need the following important result,
true in any inner product space, the proof of which is sketched in the
exercises. •
142 8. Thpology and Thpological Spaces
Cauchy-Schwarz Inequality
In any inner product space V, for any u,v E V,
I < u, v > I ::: lullvl·
Now,
lu + viz = < u + v, u + v > = < u, u > +2 < u, v > + < v, v >
::: < u, u > +21 < u, v > 1+ < v, v >
Theorem 8.3.1
Let CO[a,b] be the set of all continuous functions on the closed interval [a,b].
For f(x),g(x) E CO[a,b] define
< f, g > = l b
f(x)g(x)dx.
Then CO[a,b] is an inner product space and therefore a normed linear space
and hence a metric space.
Proof
Addition, subtraction, and real scalar multiplication of continuous func-
tions preserves continuity. Therefore, COla, b] is a real vector space. Th
show that it is an inner product space we must now show the inner
product properties.
(1) < f,f >= J:Cf(x))zdx 2': 0. If < f,f >= 0, then Jab(f(x)idx = 0,
which implies that f(x) == 0, since f(x) is a continuous function.
(2) < f,g >= J: f(x)g(x)dx = J: g(x)f(x)dx =< g,f >.
(3) < af, g > = J: af(x)g(x)dx = a J: f(x)g(x)dx = a < f, g >.
(4) < f + g, h >= J:(f(x) + g(x))h(x)dx
= J: f(x)h(x)dx + J: g(x)h(x)dx =< f, h > + < g, h >. •
Definition 8.3.5
Let (M I , dI), (M zl dz) be metric spaces. A function f : MI -+ Mz is con-
tinuous at Xo E MI if for any E > °there exists a 0 > °
such that
8.3. Continuity and Metric Spaces 143
Definition 8.3.6
Let (M, d) be a metric space. If Xo EM, then an open ball of radius E > 0
centered on xo, denoted by Sf(XO) is the set (x; d(x, xo) < El. A set SCM
is an open set if for all Xo E S there exists an open ball centered on Xo
entirely contained in S. A set C C M is a closed set if its complement C'
is an open set.
Recall that these definitions generalize the concepts ofopen and closed
regions in the complex plane as discussed in Chapter 4.
Theorem 8.3.2
Let (M,d) be a metric space. Then:
(l) Any union of open sets is open.
(2) Any intersection offinitely many open sets is open.
Lemma 8.3.4
A function f:M I ~ M 2 is continuous at Xo E M I iffor each open ball Sf (f(xo))
centered on f(xo) there exists an open ball S8(XO) such that f(S8(XO)) C
Sf(f(XO)). Equivalently, for each open neighborhood S oftexo), the pull-back
f-\S) is an open neighborhood ofxo.
Lemma 8.3.5
A function f:M I ~ M 2 is continuous on M I if for each open set 0 C M2,
f-I(O) is an open set in MI.
In the next section we will take Lemma 8.3.5 as the definition of conti-
nuity. Before continuing, we would like to go over some other ideas that
144 8. Thpology and Thpological Spaces
the reader may be familiar with from analysis. These will not playa role
in our final proof of the Fundamental Theorem of Algebra, but we want
to tie them together with the above discussion.
We defined closed sets as being the complements of open sets. Most
people first encounter closed sets as sets that contain their boundaries.
These ideas are reconciled via the following concepts, which we just state.
Definition 8.3.7
Let (M, d) be a metric space and (x n ) a sequence of points in M. Then
X n -+ Xo if for any E > 0 there exists an N such that for all n ::: N, X n E
Sf (xo). If ScM, then x E M is a limit point of S if X n -+ x for some
sequence in S. The point ~o E S is an interior point if there exists an
E > 0 such that Sf(Xo) c S. The boundary of S consists of all the limit
points of S that are not interior points.
Theorem 8.3.3
Let (M,d) be a metric space. Then C C M is closed if and only if C contains
all its limit points.
Definition 8.3.8
A function [ : M1 -+ M2isuniformlycontinuou80nMl iffor any E > 0
there exists a I) > 0, depending only on E, such that d(j(Xl), f(X2)) < E
whenever d(Xl, X2) < I).
Theorem 8.3.4
Let f:IRn -+ IR m be continuous on M C IRn. If M is closed and bounded
(compact), then f is uniformly continuous on M.
the metrics on each space but then reformulated in terms of open sets.
What we wish to do now is remove the reliance on the metric but still
allow continuity. Th do this we must have a collection of open sets, and
this leads us to the concept of a topological space.
Definition 8.4.1
Let X be a nonempty set. A topology on X is a class ofsubsets T satisfying
(1) The empty set 12} and the whole set X are in T.
(2) The union of any class of subsets in T is in T.
(3) The intersection of any finite class of subsets in T is in T.
Thus a topology is a class of subsets, closed under arbitrary unions
and finite intersections. A subset in T is called an open set, while the
complement of an open set is a closed set.
A topological space is a set X together with a topology on it. The
elements of X are referred to as points, and the topology is called the
class of open sets ofX.
In any metric space the class of opens sets is closed under arbitrary
unions and finite intersections and therefore constitutes a topology.
Lemma 8.4.1
Any metric space is a topological space. In particular, 1R" and more generally
any inner product space, is a topological space.
EXAMPLE 8.4.1
Consider any nonempty set X, and let T be the class of all subsets of X.
Then T is a topology on X, called the discrete topology. 0
EXAMPLE 8.4.2
Let X be any nonempty set and let T = {Ii', X}. Then T is a topology on
X. 0
EXAMPLE 8.4.3
Let X be an infinite set and let T consist of the empty set 12} and every
subset whose complement is finite. Then T forms a topology (see the
exercises), called the cofinite topology on X. 0
Definition 8.4.2
Suppose X, Yare topological spaces and r:
X --+ Y. Then is a con- r
tinuous function, or continuous mapping, ifr-l(O) is an open set in
X whenever a is an open set in Y. Therefore, continuous functions pull
back open sets to open sets.
The function r : X --+ Y is an open mapping if r(V) is an open set in
Y whenever V is an open set in X. Therefore, open mappings preserve
open sets.
Definition 8.4.3
A homeomorphism between two topological spaces X, Y is a one-to-one
continuous function of X onto Y (thus a bijection) that is also an open
mapping. Equivalently, we could define this as a continuous bijection
whose inverse is also continuous.
If there exists a homeomorphism r:
X --+ Y we say that X, Yare
homeomorphic spaces.
Fine/Rosenberger
Figure 8.3
SIS
Figure 8.3.
generalize these ideas to topological spaces, and from this we can then
give sufficient conditions to ensure metrizability.
Definition 8.5.1
(1) A topological space X is a TI-space if each point is a closed set.
(2) A topological space X is a Hausdorff space (or Tz-space) if given
each pair x, y of distinct points in X there exist disjoint open sets Ox, Oy
with x E Ox, Y E Oy.
(3) A topological space X is a completely regular space (or T3-space)
if X is a T)-space with the additional property that given a closed set F
and a point x e F there exists a continuous function f : X -+ [0, 1] such
thatfCx) = 0 andf == Ion F.
(4) A topological space X is a normal space (or T4 -space) if X is a
TI-space with the additional property that given disjoint closed subsets
F I , Fz in X there exist disjoint open sets 0 1 , Oz with F I CO), Fz C Oz.
Theorem 8.5.1
For a topological space, normality implies complete regularity implies
Hausdorffimplies T 1 .
Definition 8.5.2
If X is a topological space, an open base or open basis for X is a class B
of open sets such that any open set is a union of sets from B. A second-
countable space is a topological space with a countable open base.
Definition 8.5.3
A topological space X, or more generally a subset of X, is compact if
every open cover has a finite subcover.
Theorem 8.5.4
Any compact Hausdorff space is normal.
Exercises 149
Theorem 8.5.5
Let X be a compact space and f:X ~ Y a continuous function. Then the
image of X is compact in Y.
Since subsets of IR n are compact if and only if they are closed and
bounded, the above result implies that any continuous real-valued func-
tion on a compact set must be bounded. In particular, iff(z) is a continuous
complex function then the values If(z)1 on any closed ball about the origin
must be bounded.
The final property we look at is connectedness. Intuitively, a con-
nected topological space is one that is in one piece. This property is
closely tied to the intermediate value theorem, which has played such
a large role in our proofs of the Fundamental Theorem of Algebra.
Definition 8.5.4
A topological space X, or more generally a subset Y of X, is connected if
whenever X = 0 1 UO2 with 0 1 , O2 open and 0 1 n O2 = 0, then one of 0 1
or 02 must be empty: that is, X is connected if it cannot be decomposed as
the union of two nonempty disjoint open sets. A decomposition into two
nonempty disjoint open sets, when possible, is called a disconnection.
Theorem 8.5.6
A subset Y C IR is connected ifand only if Y is an interval.
Theorem 8.5.7
If X is connected and f:X -+ Y is continuous, then feX) is connected.
Corollary 8.5.1
If f:1R ~ IR is continuous, then the image ofan interval is an interval.
Corollary 8.5.2
Suppose f:D ~ IR is continuous, where D is a connected subset oflRn . Then
f assumes any value between any two of its values.
Exercises
8.1. Prove Lemma 8.3.1 - that is, distance in jRn satisfies all the basic metric
properties.
150 8. Thpology and Thpological Spaces
8.2. Let Cora, b) be the set of all continuous functions on the closed interval
[a, b). If [(x), g(x) E COra, b), define
d(f(x), g(x)) = (l b
I[(x) _ g(X)IZdx) I/Z
Prove that this defines a metric on Cora, b) (Example 8.3.1), and so Cora, b)
equipped with this metric becomes a metric space.
8.3. Let Cora, b) be the set of all continuous functions on the closed interval
[a, b). Now, if [(x), g(x) E COra, b) define
d(f(x),g(x)) = max [a.bJlftx) - g(x)l.
Prove that this also defines a metric on Cora, b), and so again COra, b)
becomes a (different) metric space with this metric.
8.4. Show that n-dimensional real space Rn forms an inner product space, where
if PI = (XI, ... ,xn), Pz = (YI, ... ,Yn), then
n
8.5. Prove the Cauchy-Schwarz inequality - that is, in any inner product space
V, for any u, v E V then,
each f E F construct disjoint open sets Ox, Of such that x E Ox, f E Of. F
is then contained in the union of the Of, and then use the compactness.)
8.13. Prove that a compact subspace of a Hausdorff space is closed. (Hint: Use
the previous exercise.)
8.14. Prove that a subset of IR is connected if and only if it is an interval.
8.15. Show that the fact that the image of an interval in IR under a continu-
ous function is again an interval is equivalent to the intermediate value
theorem.
8.16. Prove that the image of a compact set under a continuous map is compact
(Theorem 8.5.5).
Algebraic
Thpology and the
CHAPTER Final Proof
There are two general methods for finding groups associated to topo-
logical spaces - homotopy theory and homology theory. We discuss
homotopy theory first.
If X, Yare topological spaces and f : X -+ Y, g : X -+ Yare contin-
uous maps, then roughly, f is homotopic to g if f can be continuously
deformed into g, a notion that we will make precise in Section 9.3. The
spaces X, Yare homotopic, or homotopically equivalent if there exist
continuous maps f : X -+ Y, g : Y -+ X such that fog is homotopic to the
identity map on Y and go f is homotopic to the identity map on X. Very
roughly, homotopy can be pictured geometrically as follows: X is homo-
topic to Y ifX can be continuously deformed and then shrunk, stretched,
or pinched to Y. If X, Yare homeomorphic they are clearly homotopic.
However, spaces can be homotopic without being homeomorphic. For ex-
ample the annular region pictured in Figure 9.1 is homotopic to the circle
but not homeomorphic to it.
stronger invariant then homology so that two spaces that are homotopi-
cally equivalent will have the same homology theory. We will make this
more precise in Section 9.4.
Definition 9.2.1
Suppose Hand K are groups. Then their direct product, denoted by H x
K, is the set {(h, k); h E H, k E K} under the operation (hI, kl)(h z , k z) =
(hlh z , klk z )' It is straightforward to verify that H x K forms a group.
Lemma 9.2.1
Suppose G = H x K. Then:
(1) HI = H X (I},KI = {I} x K are nonnal subgroups ofG isomorphic to
H,K respectively. Further, G = HIK I and HI n K I = {I}.
(2) H x K is finite if and only ifboth Hand K are finite.
(3) H x K is abelian if and only if both Hand K are abelian.
From the above lemma, part (1), we have that Hand K can be con-
sidered as normal subgroups of H x K. We can invert the process and
start with subgroups of a given group G. If they satisfy the properties of
Lemma 9.2.1, part (1), then G is the direct product of these subgroups.
Lemma 9.2.2
Suppose G is a group and H ,K are nonnal subgroups of G satisfying
(1) G = HK, that is, G is generated by all products {hk;h E H,k E K},
(2)HnK = {l}.
Then G ~ H x K. In this case we say that G is the direct product of its
subgroups H ,K.
We now turn our attention to abelian groups. Notice that from Lemma
9.2.1 direct products of abelian groups are themselves abelian. Recall that
9.2. Some Further Group Theory - Abelian Groups 155
Definition 9.2.2
A free abelian group F of rank n is a direct product of n copies of Z.
That is,
F = Z x ... x Z = Zn.
If F is free abelian and gl, ... ,gn are elements of F such that
Lemma 9.2.3
Let F be free abelian of rank n. Then:
(1) The number ofelements in any basis is unique and equal to n.
(2) If gl, ... ,gn is a basis, then each f E F has a unique representation
Theorem 9.2.1
Let gl, ... ,gn be a set ofelements in an abelian group G. Then G is free abelian
with basis gl, ... ,gn if and only if each mapping gl ~ H, ... ,gn ~ H into
an abelian group H can be extended to a unique homomorphism of G to H.
Lemma 9.2.4
If G is abelian, all elements offinite order form a subgroup.
Proof
Let tG = {g E G; g of finite order}. Now, 1 E tG, so tG is nonempty.
Suppose gl,gz E tG. Then there exist nl, nz with g~1 = g;2 = 1. Then
(glgZ)n,n 2 = (g~1)n2(g;2)"1 = I, since G is abelian. Hence, glgz has finite
order and so glgz E tG. Similarly, (gII)n, = (g~I)-1 = I, so gIl E tG.
Therefore, tG is a subgroup. •
Lemma 9.2.5
If G is a finitely generated torsion abelian group then G is finite.
Proof
We will show this for two generators. The general case then follows easily
by induction.
Suppose gl, gz generate G. Since G is a torsion group, both gl, gz have
finite order, say nl, nz respectively. Now, ifg E G, theng = g~1 g~1 ... g~kg~k ,
since gl, gz generate G. Since G is abelian, we can put all the occurrences
ofgl together and all the occurrences ofgz together so that
g = g;g~, for some integers t, s.
Since this holds for all g E G, the number of elements of G is bounded by
the product of the number of choices for t and the number of choices for
s. Since gl has order nl andgz has order nz, this product is nl nz. Therefore,
IGI ~ nl nz and hence G is finite. •
a negative solution. We refer the reader to the survey article by Gupta [G]
for a discussion of this.
Armed with these ideas we now state the fundamental theorem for
finitely generated abelian groups. We outline its proof in a series of
lemmas, however, we leave out most details. A very understandable com-
plete discussion is in the book of Rotman [R]. After the outline of the
proof, we will give an example of the use of this theorem.
Theorem 9.2.2
(Fundamental Theorem for Finitely Generated Abelian Groups) Let G be a
finitely generated abelian group. Then G has a unique (up to ordering of
factors) decomposition as a direct product ofa free abelian group offinite rank
and finite cyclic groups ofprime power order.
The rank ofthe free abelian factor of G is an invariant of G called its Betti
number.
Lemma 9.2.7
If G is a finitely generated abelian group, then G is a direct product of cyclic
groups.
Lemma 9.2.8
If G is a torsion-free finitely generated abelian group, then G is free abelian of
finite rank.
Lemma 9.2.9
If G is a finitely generated abelian group, then G = tG x (free abelian).
Lemma 9.2.10
If G is a finite abelian group, then G is a direct product of subgroups ofprime
power order.
Lemma 9.2.11
If G is a finite abelian p-group, then G is a direct product of cyclic p-groups.
Further, this decomposition is unique up to ordering of the factors.
EXAMPLE 9.2.1
We classify all abelian groups with Betti number 3 and torsion group of
order 180.
Suppose G is such an abelian group. Then
G ~ tG x (free abelian)
= tG x Z3,
since the Betti number is 3. Since ItGI = 180, we must then classify all
finite abelian groups of order 180.
The prime factorization of 180 is 22325. Therefore, the possible group
factorizations into cyclic groups of prime power order are:
(1) Z22 x Z32 X Zs.
(2) Z2 x Z2 X Z3 2 X Zs.
(3) Z22 x Z3 X Z3 X Zs.
(4) Z2 X Z2 X Z3 X Z3 X Zs.
Putting the two parts together, we see that there are precisely four
possible abelian groups with the desired characteristics, namely:
(1) Z2 2 x Z3 2 X Zs X Z3.
(2) Z2 x Z2 X Z32 X Zs X Z3.
(3) Z22 x Z3 X Z3 X Zs X Z3.
(4) Z2 x Z2 X Z3 X Z3 X Zs X Z3. 0
Definition 9.2.3
Let G be a group. If g, h E G, then their commutator, denoted [g, h)
is the product ghg-1h- 1. Clearly g and h commute if and only if their
commutator is trivial.
The commutator subgroup ofG, denoted by [G, G) or G', is the sub-
group of G generated by all the commutators from G. Again, clearly G is
an abelian group if and only if its commutator subgroup is trivial.
Theorem 9.2.3
Let G be a group. Then G' is a normal subgroup of G, and the quotient group
G/ G' is abelian. Further, if H is a normal subgroup of G with G/H abelian,
then G' C H.
Hence, G/G' is the largest abelian quotient group of G. We call G/G' the
abelianization of G, and denote this by Gab.
9.3. Homotopy and the Fundamental Group 159
Proof
By definition G/ is a subgroup. We must show that it is normal. Let x E G
and consider
x-I[g, h]x = x-Ighg-Ih-Ix
ghH = hgH,
since GIH is abelian. But then ghg-Ih-IH = H, or [g, h]H = H. This
implies that [g, h] E H for all commutators, and hence G/ C H.
Definition 9.3.1
Given topological spaces X, Y and continuous maps f : X -+ Y, g : X -+
Y, then f is homotopic to g if there exists a continuous map H : X x I -+
Y such that H(x, 0) = fcx) for all x E X and H(x, 1) = g(x) for all x E X.
The mapping H is called a homotopy. In this case g is also homotopic to
f via the homotopy HI (x, t) = H(x, 1 - t), so we say in general thatf and
g are homotopic.
160 9. Algebraic Thpology and the Final Proof
-------------------------
g(x) - H(x, I)
Definition 9.3.2
'IWo topological spaces X, Yare homotopically equivalent, or homo-
topic, if there exist continuous maps f : X -+ Y and g : Y -+ X such
that the composition gf is homotopic to the identity map on X and fg is
homotopic to the identity map on Y.
9.3. Homotopy and the Fundamental Group 161
1
af3(x) = f3(2x - I), if - :::: t :::: 1.
2
We picture some of this in Figure 9.3.
if a, 13, yare all loops based at xo, then (af3)y is homotopic to a(f3y). We
summarize all these statements.
Lemma 9.3.1
Suppose a,f3,y are loops based at Xo. Then:
(1) If a is homotopic to al and 13 is homotopic to 131, then af3 is homotopic
to alf3l.
(2) (af3)y is homotopic to a(f3y).
If a is a loop at xo, let [a] denote its homotopy class. Hence, [a] consists
of all loops based at Xo homotopic to a. Define [a][f3] = [af3]; that is, the
product of two homotopy classes of loops is the homotopy class of the
product ofclass representatives. From Lemma 9.3.1 we can conclude that
this is a well-defined operation on the set of homotopy classes. Further,
from the second part of that lemma it follows that this is an associative
operation on the set of homotopy classes. Let nl(X, xo) denote the set
of homotopy classes of loops based at Xo. We now have a well-defined
associative operation on nl(X, xo).
Let e denote the identity loop at xo, that is, e(t) = Xo for all t E [0, 1].
Then eel = ae = a for any other loop based at xo, and hence [e][a] =
[a][e] = [a], and therefore, [e] is an identity on nl(X, Xo). Finally, if a is a
loop at xo, define a- l (t) = a(1 - t); that is, a-I is the loop traversing a in
the opposite direction. We can show that aa- l and a-la are homotopic to
e, and hence [aa-l] = [a-la] = [e]. Therefore, each element of nl(X, xo)
has an inverse. We thus have on nl(X, xo) a well-defined operation that is
associative, has an identity, and under which there is an inverse for each
element. It follows that nl(X, xo) forms a group.
Theorem 9.3.1
The set n1(X ,Xo) ofhomotopy classes of loops based at Xo E X forms a group
under the operation [a][f3] = [af3]. It is called the fundamental group of X
based at Xo.
The above definition depends on the base point Xo. We can in many
cases remove this restriction. Suppose Xl E X and suppose that there is
a path p in X from Xo to Xl, that is, a continuous function p : [0, 1] -7 X
withp(O) = xo,p(1) = Xl. Then nl(X, xo) is isomorphic to nl(X, Xl) with
the isomorphism being given by [a] -7 [p-lap]. This is pictured in Figure
9.4.
Theorem 9.3.2
If XO,Xl are two points in the space X that are connected by a path p, then
nl(X,xo) ~ nl(X,Xl).
9.3. Homowpy and the Fundamental Group 163
Corollary 9.3.1
If X is path-connected, then ITl(X,xo) ~ ITl(X,Xl) for any pair of points
xoA E X. It follows that in this case there is really only one fundamental
group associated with X. We call this the fundamental group of X and
denote it by IT 1 (X).
EXAMPLE 9.3.1
(1) The unit disk in 1R2 , or more generally any solid sphere in IR n , is
contractible and therefore simply connected.
(2) For n > 1 the n-sphere Sn c IRn + l is simply connected but not
contractible. Recall that for n ::: 0 the n-sphere is Sn = {(Xl, ... ,Xn+l) E
IRn +l .• X 21 + X22 + ... + X n
2 = lI}·
+ 0
EXAMPLE 9.3.2
Consider the circle Sl C 1R2 . Fix a base point on this circle and an orien-
tation for paths. Any loop that does not go all the way around the circle
is homotopic to a point. If a loop goes around a circle more than once but
not twice, it is homotopic to a loop that goes around exactly once. Further,
loops that go around exactly once are all homotopic. In general, if a loop
is homotopic to a loop that goes around the circle exactly n times, we
say that this loop has winding number n. All loops of the same winding
number are homotopic.
With these ideas it follows that the homotopy classes of loops on the
circle are precisely those loops with winding numbers n = 0, I, 2, ... and
their inverses. The inverse of a loop with winding number n also winds
n times, but in the opposite orientation. We say this has winding number
164 9. Algebraic Thpology and the Final Proof
o
EXAMPLE 9.3.3
Consider a torus T as in Figure 9.5.
General loops are homotopic to loops that wind around the inside circle
M and then the outside circle o. Such loops thus have a winding number
around M and around o. Further, loops around M and around 0 commute.
It follows that 0 1 (7) is free abelian of rank two, that is,
This result can also be obtained from the following. The torus T can
be considered as a product space of two circles T = Sl X Sl. It can be
proved that if X, Yare path-connected spaces, then
Therefore,
o
Finally, an example to show that not every fundamental group is
abelian.
EXAMPLE 9.3.4
Consider a figure eight curve in]R2 as in Figure 9.6.
9.3. Homotopy and the Fundamental Group 165
Such a loop is called a word in a, fJ, and the corresponding group is called
a free group of rank 2. This is nonabelian and not to be confused with a
free abelian group of rank 2. We won't discuss these groups further here
but refer to [Ro] for details.
More generally, a planar figure consisting of n circles touching at a sin-
gle point is called a bouquet of n circles and the resulting fundamental
group is a free group of rank n. 0
Theorem 9.3.3
If X, Yare path-connected spaces and h:X -+ Y is a continuous map, then
h*:n 1 (X) -+ n 1 (y) by h*[a] = rhea)] is a homomorphism. Ifh is a home-
omorphism, then h* is an isomorphism. Further, if X, Yare homotopic then
n1(X) is also isomorphic to n 1 (y).
EXAMPLE 9.3.5
Consider an annular region A as in Figure 9.1. Since A is homotopic to a
circle, we have
o
166 9. Algebraic lbpology and the Final Proof
Definition 9.3.3
Let B, X be path-connected spaces. Then B is a covering space of X if
there exists a map p : B ~ X, called a covering map, such that for each
x E X there exists an open neighborhood U of x such that each connected
component ofp-I(U) is homeomorphic to U under p.
Theorem 9.3.4
(1) Let B be a path-connected covering space ofa path-connected space X with
covering map p. Then:
(i) p* : n I (B) ~ n I(X) is an injection so that n I (B) can be considered
as a subgroup of n I (X).
(ii) If x E X and p-I(X) = fbi} C B, then p*(nl(B,bi)) forms a
complete set of conjugate subgroups in nI (X).
(2) Given a complete set {Hi} ofconjugate subgroups in n I (X), there exists
a path-connected covering space B of X such that n I (B) ~ Hi.
q+1
ai E JR, 0 S ai S 1, 1 SiS q +1 and L ai = I}.
i=1
Definition 9.4.1
Suppose POPI ... Pq is an oriented q-simplex. Then for i = 0, ... , q its ith
face is the oriented (q - I)-simplex given by
(-I)iPOpl .. . Pi·· .Pq .
Definition 9.4.2
If POPI ... Pq is an oriented q-simplex with q 2: 1 then its boundary 8q is
q
8q(POPI ... Pq ) = L(-I)iPOPI .. . Pi ... Pq •
i=1
Lemma 9.4.1
For any q-simplex we have 8q- I 8q = O. Hence by additivity 8q- I 8q , applied to
any formal sum of q-simplexes always gives O.
9.4. Homology Theory and niangulations 169
Proof
We show this for a 2-simplex. The computation for a 3-simplex is left to
the exercises while the general case follows in the same manner.
Let POPI Pz be an oriented 2-simplex. Then
alaz(poPIPz) = al(PIPz - PoPz + POPI )
= al(PIPz) - al(Popz) + al(poPI )
= Pz - PI - Pz + Po + PI - Po
= o.
•
Many geometric figures in IR n ,
in particular polyhedra, are built up
in a very nice way from simplexes. These figures are called simplicial
complexes, which we formally define below. First for a q-simplex we
define its subsimplexes to be the set ofall faces, all faces offaces, all faces
of faces of faces, etc. without regard to orientation. Thus, for example, the
set of subsimplexes of POPIPZP3 is
PI PZP3, POPZP3, POPI P3, PoPIPZ,
PoPI , PoPz, POP3, PIPZ, PI P3, PZP3, Po, PI, Pz·
Definition 9.4.3
A simplicial complex in jRn is a subset satisfying the following
conditions:
(1) It is a union of oriented simplexes. Notice that in IRn the maximum
dimension of a possible simplex is n - 1.
(2) Each point belongs to only finitely many simplexes.
(3) The intersection of two different (up to orientation) simplexes is
either empty or one of the simplexes or a subsimplex of both.
(A) (8)
Simplicial Complex Not a Simplicial Complex
Definition 9.4.3'
Let X be a topological space. Then a simplicial complex in X is a subset
Y satisfying:
(I) It is a union of oriented simplexes. (In distinction to IR n there is no
maximum dimension of a possible simplex.)
(2) Each point belongs to only finitely many simplexes.
(3) The intersection of two different (up to orientation) simplexes is
either empty or one of the simplexes or a subsimplex of both.
EXAMPLE 9.4.1
Consider 81 the unit circle in IR z . Then a triangulation as a I-dimensional
simplicial complex is pictured in Figure 9.8. The O-simplexes are Po, PI, Pz,
while the I-simplexes are PoP}, PIPZ, PzPo.
0
- 0
p
0
P
M
M
p p
o o
P ----"---t....---"'-----.P
M
given by
on(mID~ + m2D~ + ... + mtD~) = mlon(D~) + ... + mton(D~).
From Lemma 9.4.1 we have that On-IOn: Cn(X) -+ Cn- 2(X) is always
the trivial homomorphism.
The kernel of On consists of all n-chains whose boundary is zero. We
call these the n-dimensional cycles. The kernel is then a subgroup of
Cn(X), called the n-dimensional cycle group, denoted by Zn(X).
The image of Cn +l (X) in Cn(X) under On+l is called the n-dimensional
boundary group, denoted by Bn(X). Thus, an n-chain is in Bn(X) if it
is the boundary of an (n + I)-chain. From the fact that OnOn+1 = 0 we
have that Bn(X) must be a subgroup of Zn(X), that is, each n-dimensional
boundary is an n-dimensional cycle.
Since all these groups are abelian, all subgroups are normal, and hence
Bn(X) is a normal subgroup of Zn(X). Therefore, we can form the quo-
tient group Zn(X)/Bn(X). This quotient group is called the n-dimensional
homology group, denoted by Hn(X). We summarize all this.
Definition 9.4.4
If X is a simplicial complex with the above condition, then:
(1) Cn(X) is the n-dimensional chain group, which is the free abelian
group with basis the n-simplexes in X.
(2) On : Cn(X) -+ Cn-I(X) is the boundary operator, which is
the homomorphism formed by extending the boundary operator on
simplexes.
(3) Zn(X) is the n-dimensional cycle group, which is the kernel of
the boundary homomorphism On : Cn(X) -+ Cn-I(X).
(4) Bn(X) is the n-dimensional boundary group, which is the image
of Cn+l (X) in Cn(X) under the boundary homomorphism On+l'
(5) Hn(X) is the n-dimensional homology group, which is
Zn (X)/B n(X).
'IWo cycles in Cn(X) that differ by a boundary - that is, lie in the same
coset of Bn(X), are said to be homologous.
These definitions appear to depend on the particular triangulation of
the space X. That is, if X had a different simplicial complex then per-
haps the homology groups might be different. The following important
invariance theorem says that this cannot be true and is really where the
strength of the combinatorial topological approach lies.
9.5. Some Homology Computations 173
Theorem 9.4.1
Suppose X is a topological space that has a simplicial decomposition. Then
the homology groups determined from this decomposition are isomorphic to
those determined by any other simplicial decomposition of x.
Theorem 9.4.2
Suppose X, Yare topological spaces with simplicial decompositions and h:X -+
Y is a continuous map. Then h induces a homomorphism h*:Hn(X) -+ Hn(Y)
for all n. Ifh is a homeomorphism, then h* is an isomorphism. Further, ifX, Y
are homotopic then the homology groups are also isomorphic.
Theorem 9.5.1
If X is a contractible space, then Ho(X) = Z and Hn(X) = aifn ::: 1.
174 9. Algebraic 'Ibpology and the Final Proof
Theorem 9.5.2
If X is a simplicial complex with n connected components, then
Ho(X) = free abelian of rank n ~ Zn.
Finally, the following result can be proved. This not only allows a
computation of HI when n I is known but gives a relationship between
homology and homotopy. Recall that if G is a group, its abelianization is
Gab = GIG' (see Section 9.2). IfG is abelian, then Gab = G.
Theorem 9.5.3
If X is a path-connected simplicial complex then
HI(X) ~ nI(X)ab.
IfnI(X) is abelian then HI (X) ~ nI(X).
EXAMPLE 9.5.1
We compute the homology of the circle SI. A triangulation was given in
Figure 9.8. Clearly, the homology of SI is the same as the homology of a
triangle, the boundary of a 2-simplex.
Since SI is connected it has one connected component and we have
HO(SI) = Z. Further, since SI is I-dimensional, Hn(SI) = 0 for n > 1.
We saw in Section 9.3 that nI(SI) = Z which is abelian. Therefore,
HI(SI) ~ nI(SI) = Z. We could also obtain this result directly from the
triangulation.
CI(SI) is free abelian with basis POPI , PIP2, P2PO' A I-chain is then of
the form
mIPOPI + m2PIP2 + m3P2PO for integers mI, m2, m3.
The only way the vertex PI will be eliminated in forming the boundary is
ifmI = -m2. Similarly, the only way P2 will be eliminated is ifm3 = -m2.
9.5. Some Homology Computations 175
m(PoPI - PI P2 + P2Po), m E Z.
Thus, any cycle must be a multiple ofthe single I-chain POPI - PI P2 + P2PO.
It follows that ZI(SI) ~ Z. Now, BI(SI) = 0, since there are no 2-chains,
so HI(SI) = ZI(SI)/BI(SI) = Z. 0
EXAMPLE 9.5.2
Consider the 2-sphere S2. Just as Sl could be triangulated to look like a
triangle, S2 can be triangulated to look like the surface of a tetrahedron,
as pictured in Figure 9.11.
Hence in summary,
Ho(SZ); = z;
Hz(SZ) = z;
Hn(SZ) = 0 ifn =1= 0,2.
o
EXAMPLE 9.5.3
Consider a torus T. A triangulation was pictured in Figure 9.10. T is con-
nected, so Ho(T) = Z. The group nI(T) was free abelian of rank 2, so
HI(T) ~ nI(T) = Z x Z.
A 2-chain is a sum of the seventeen triangles (2-simplexes) pictured
in Figure 9.10. As in the argument for the circle and in the argument for
the sphere, the only way the common edge of triangles numbered 1,2
will cancel in taking the boundary is if the coefficients are of the same
magnitude but of different signs. If we let TI, ... , T17 , denote these 2-
simplexes then this argument leads to the fact that a 2-cyde in T is a
multiple ofTI - Tz + T3 - ••. + TI7 . It follows that Zz(T) = Z, and since
Bz(T) = 0, the torus being 2-dimensional, we have Hz(T) = Z.
Finally, Hn(T) = 0 ifn > 2. 0
EXAMPLE 9.5.4
Consider X to be a figure eight - bouquet of two circles as seen in Figure
9.6. Ifwe triangulate each circle as in Example 9.5.1 we get a triangulation
of this figure as a I-dimensional simplicial complex.
Since this figure is connected Ho(X) = Z. The group nI(X) is a free
group of rank 2 (see Section 9.3). The abelianization of such a group is
free abelian of rank 2. Therefore, HI(X) = nI(X)ab = Z x Z. Finally,
Hn(X) = 0 if n > 1. 0
arguments used in Examples 9.5.1 and 9.5.2, the following can be proved.
We refer to [Be] for a detailed proof.
Theorem 9.6.1
Let Sn be an n-sphere with n ::: 1. Then Ho(sn) = Z, and for q ::: I,
Hn(Sn) = Z,
Hq(Sn) = 0 ifn #- q.
Now, suppose sn, En are two n-spheres, and consider a continuous map
f : sn ~ En. This induces a homomorphism [* : Hn(sn) ~ Hn(E n).
From Theorem 9.6.1, Hn(sn) is infinite cyclic. Suppose a is a generator of
Hn(sn). Similarly, Hn(E n) is infinite cyclic and suppose fJ is a generator.
Then since [* is a homomorphism, f*(a) is an integer multiple of fJ. That
is,
[*(a) = mfJ for some integer m.
The number m is called the Brouwer degree, or degree, of the map f.
We abbreviate this by deg(f). Intuitively, deg(f) is the algebraic number of
times that the image fCsn) winds around En.
The following three results on degree are the critical ones for the proof
of the Fundamental Theorem of Algebra. The proofs are lengthy and we
leave them out. They can be found in [H-Y].
Lemma 9.6.1
The degree ofa continuous map is independent ofthe simplicial decompositions
of sn and 1::".
This lemma is needed because even though Hn(sn) and Hn(E n) are
both infinite cyclic, the action off on the cycle a generating H n(sn) may
be different for different simplicial decompositions. The lemma says that
in mapping to another sphere, up to homology this does not happen.
Lemma 9.6.2
(1) The degree of a continuous map f of an n-sphere Sn into an n-sphere En
depends only on the homotopy class off. In particular, two homotopic maps
fsn ~ En ,g:sn ~ En have the same degree.
(2) If f:sn ~ En,g:sn ~ En are two continuous maps with deg(f) =
deg(g) , then f and g are homotopic.
Theorem 9.6.2
If f:8 n -+ I;n and deg(f) =f:. 0, then each point of I;n is in the image off( sn).
This last theorem will be the key result in our final proof.
Before closing this section, we state a famous result due to Brouwer -
the Brouwer fixed point theorem - that also is proved using Brouwer
degree. Iff: X -+ X, then a fixed point for f is a point Xo E X such that
f(xo) = Xo·
Theorem 9.6.3
(Brouwer Fixed Point Theorem) Let D n be the n-ball, that is, the interior and
boundary of the unit sphere 8n - 1 C IR n . Suppose f :Dn -+ D n is a continuous
map. Then f has at least one fixed point.
N(O,O,I)
p(w)
explicitly derived (see [AD, but all we need is that p is a continuous bi-
jection between C and S2 - N. If we identify the north pole N with 00,
then p is a bijection of all of S2 with C U 00. The map p can be shown to
be conformal, and thus S2 provides a conformal model of the extended
complex plane C U 00. When we look at S2 in this manner, it is called the
Riemann Sphere.
Theorem 9.7.1
Stereographic projection p provides a conformal bijection between S2 and C U
00.
Lemma 9.7.1
P(z) is homotopic to the map fez) = zn.
Proof
Let H(z, t) = zn + (l - t)(ao + alz + ... + an_lz n- 1) for z E C, t E [0,1]
and H(oo, t) = 00 for t E [0,1]. Clearly, H(z, t) is continuous on C x I --+
C. Now, limz--+ooH(z, t) = 00 = H(oo, t) for all t E [0, I], so H(z, t) is
continuous from (C U 00) x I --+ C U 00, and thus from S2 x I --+ S2. Since
H(z, 0) = P(z) and H(z, 1) = zn, this defines a homotopy. •
Lemma 9.7.2
The Brouwer degree off(z) = zn is n. Thus for complex polynomials, Brouwer
degree and polynomial degree coincide.
Proof
Consider the two triangulations of S2 (considered as C U (0) pictured in
Figure 9.13.
Under fez) = zn the n-shaded pieces in figure (A) are mapped onto the
one-shaded piece in figure (B) in an orientation-preserving fashion. It is
clear then that f will map a cycle in the triangulation (A) to n times the
cycle in triangulation (B). Therefore, the degree of zn is n. •
Theorem 9.7.2
(Fundamental Theorem of Algebra) Suppose P(z) is a nonconstant complex
polynomial. Then P(z) has a complex root.
180 9. Algebraic 'Ihpology and the Final Proof
j(z) = t'
~
(A) (B)
Proof
Suppose P(z) = ao + alZ + ... + anzn. Since P(z) is assumed nonconstant,
n > 0 and an i= O. Therefore, as before we can assume that an = 1. From
Lemma 9.7.1, p(z) is homotopic to f(z) = zn, and therefore, from Lemma
9.7.2 and Lemma 9.6.2, P(z) has Brouwer degree n i= O. From Theorem
9.6.2 each point of82 is then in the image of p(z). In particular there exists
at least one point Zo E 82 with P(zo) = O. Since P(oo) = 00, it follows that
Zo E C. •
Exercises
9.1. Prove that ifHand K are groups, then their Cartesian product H x K is also
a group under (h, k)(h 1 , k1) = (hh 1 , kkI).
9.2. Prove Lemma 9.2.1.
9.3. Prove Lemma 9.2.2.
Exercises 181
9.4. Prove that if p, q are relatively prime integers, then 'Ilp x 'Ilq is cyclic. This
shows that the direct product of cyclic groups can be cyclic.
9.5. Show that 'Il z x 2: z is not cyclic. This shows that the direct product of cyclic
groups may not be cyclic.
9.6. Prove that every finitely generated abelian group is a direct product ofcyclic
groups. (Hint: Show it for two generators.)
9.7. Prove that a finite abelian group is a direct product of abelian p-groups.
(Hint: In G consider all elements of order a power of p for p a prime dividing
the order of G. Show that this is a subgroup and G is the direct product of
all the subgroups of this form for all primes dividing the order of G.)
9.8. Give a complete classification of all finite abelian groups of the following
orders. (i) 25 (ii) 45 (iii) 37 (iv) 1284
9.9. Show that if X, Y are homeomorphic, then they are homotopic.
9.10. Show that all contractible spaces are homotopic.
9.11. Describe a general element of the fundamental group of a bouquet of n
circles.
9.12. Suppose nl(X) = 'Il z x Zz. Up to homotopy, how many possible covering
spaces of X can there be?
9.13. Determine the homology ofa bouquet ofn circles.
9.14. If POP\PZP3 is a 3-simplex, verify that OZ03(POP1PZP3) = O.
9.15. Compute the homology of two tangent 2-spheres.
AVersion of
Gauss's
APPENDIX Original Proof
Th see this, let S be a real constant greater than all the magnitudes
A, B, ... L. Then take R = 1 + hs. If r > R, then r > 1 + hs, and so
I
1 - :/J:2.
r-I
> O. Since r > R > 1'
we have
r
that 1 + .1
r2
+ ... + .1
r"
< _1_.
r-I
It follows then that
" ~ 1 1 1
r (1 - v 2S(; + r 2 + ... + r")) > 0
hence
r" - hS(r"-I + ... + 1) > 0
and so
r" - h(Arn - I + Br n - 2 + ... + L) > O.
Second, the circumference ofa circle of radius r > R will consist of 2n
arcs, inside of which T takes alternately positive and negative values as
in Figure A.I.
T>O
n
(-I/+ IT = -r + (_I)k+l Arn- I cos((n - 1)eJ>' + a)
"fi
+ ... + (-I)k+ILcosA.
Replacing (-ll cos((n - 1)eJ> + a), ... , (-I)k cos A, and (-ll+ 1 cos((n -
1)eJ>' + a), ... , (-ll+ 1 cos Aby -1 we get the following inequalities:
n
k
( -1) T> -r - AR n - 1 - ... - L ,
-"fi
The right sides of both of these inequalities are positive by the choice of
R, thus proving the original statement.
Now, since T varies continuously with eJ>, it follows that it will be zero
at least 2n times on the circle. Suppose there are 2n zeros at the points
00, OJ, ... ,02n-l, which have arguments between 0,30,50,70, ... ,(8n -
3)0, (8n - 1)0 respectively. Let ~ = tan(eJ>I2), and then
1 - S2. 2s
cosq, = --2,smq,
l+s
= --2'
1+~
).
Now let z = r( ~~f~ + i 1~12 Then if we consider the original polynomial
f(z), its real part T becomes
T = P2n(~) .
(1 + ~2)n
Here P2n(~) is a real polynomial of degree no higher than 2n. Since this
polynomial has 2n zeros, it must be of degree exactly 2n, and there can
be no other roots. Therefore, 00, OJ, ... ,02n-1 are the only points on this
circle where T, vanishes and hence the positive and negative values ofT
alternate on this circle, as pictured in Figure A.l.
Thirdly, the values of the imaginary part U are positive at 00,02, ...,
O2n-2 and negative at OJ, OJ, ... , O2n-l' The argument q, of Ok lies be-
tween (4k + 1)1l'/4n and (4k + 3)1l'/4n. It follows that (-ll sin(neJ» ~
1/"fi. Multiplying U by (_I)k and replacing (-llsin((n - 1)eJ> +
a), ... , (-I)k sin Aby -1 we have the inequality
(-I)k U ~ rn (-I/ sin(neJ» - ARn - 1 - .•. - L
and hence also the inequality
n
k r
(-1) U ~ "fi - AR n - 1 - ... - L.
App. A. A Version of Gauss's Original Proof 185
The right-hand side is positive by the choice of R and therefore the sign
ofUatQkis(-ll·
Now choose a circle r of radius r > R. Its circumference is divided
into 2n arcs by points 00, ... ,02n-1 between which the sign of T alter-
nates. As we expand the radius of the circle r the arcs OoOJ; OJ 02; etc.
sweep out 2n regions extending to infinity within which T has alternately
positive and negative values and these regions are separated by curves
on which T = O. Borrowing from Gauss and Uspensky's treatment, call
the regions outside of r where T > 0 seas and the regions where T < 0
lands. The curves where T = 0 are then seashores. The n seas and n
lands extend themselves into the interior of r across the various arcs.
Starting from OJ, move along the seashore so that the land is always on
the right heading inward. We must eventually exit r, and when we cross
it again the land must still be on the right. If the circumference is fol-
lowed counterclockwise, lands and seas alternate. It follows then that we
must exit r heading outward at an even point O2k, that is at 02, or Q4,
etc. Thus there is a continuous path y leading from OJ to some point O2k.
On path y, T = O. At OJ, U < 0, whereas at point O2k, U > O. Since U
is a continuous function along the path y, at some point on y it takes the
value o. At this point T, U are both zero, which gives the existence of a
root.
187
188 Appendix B: Cauchy's Theorem Revisited
where
y(t) = YI(t) ift E [ab bI] and
y(t) = yz(t + az - bl ) ift E [b l , bz - az + bI].
Analogously, we can define the sum YI + yz + ... + Ym of finitely many
curves in U. This addition of curves is associative.
A curve Y : [a, b) ~ C, a ~ b, is called piecewise continuously dif-
ferentiable if there are finitely many continuously differentiable curves
YI, ... ,Ym such that Y is the sum Y = YI + + Ym, that is, there are
points aI, az, ... ,am+l with a = al < az < < am+l = b such that
Yi is a curve with domain I[ai, ai+d, 1 ~ i ~ m, and is continuously dif-
ferentiable. For example, each polygonal curve is piecewise continuously
differentiable.
Now let U be an open region in C and Y : [a, b) ~ U a curve that is
the sum Y = YI + ... + Ym of finitely many continuously differentiable
curves YI, ... , Ym' Suppose f : U ~ C is a continuous function. Then we
define the line integral
1 Y
f(z)dz = t1
i=l Y,
f(z)dz.
i f(z)dz ~ ML(y).
For the remainder of Appendices Band C we assume always that a
curve is piecewise continuously differentiable as defined above.
Definition 8.1
A subset M C C is starlike if there is a point Zl E M such that for each
z E M, the line segment ZIZ is contained in M. Zl is called a center of M.
Clearly, starlike sets are path-connected.
U C C is a star domain if U is open and starlike.
EXAMPLE B.1
The following sets are all star domains.
(a) C or IR with center O.
(b) Any open circular region.
Appendix B: Cauchy's Theorem Revisited 189
Definition B.2
Let U C C be open and { : U ~ C be continuous. An analytic function
F : U ~ C is a primitive function for { in U if F' = f.
where YCt l ), YCto) are the endpoints of the curve y. Hence, if a continuous
function has a primitive function in U, it satisfies Cauchy's Theorem.
Goursat's proof consists essentially in showing that any analytic function
in a star domain has a primitive function. We first must relate integrability
to the existence of primitive functions.
Definition B.3
Let U C C be open. Then { : U ~ C is called integrable in U if { is
continuous in U and if{ has a primitive function in U.
Theorem B.I
(Criteria {or Integrability) Let U be a domain andf:U ~ C continuous in U.
Then the following are equivalent:
(1) { is integrable in U.
(2) For each closed curve y in U we have fy{(z)dz = O.
I{ (2) is fulfilled, fix a point Zl E U and {or each Z E U choose a curve y
from ZI to z. Define
F(z) = i f(z)dz, z E U.
Proof
(1) clearly implies (2), so we must only show that (2) implies (1). Further,
it is enough to show that F(z) as defined in the statement is a primitive
function off(z). Thus, we must show that F'(z) = f(z) for all z E U.
Since fyf(z)dz = 0 for each closed curve y in U, the function F(z) is
well-defined (see Theorem 5.2.2). The proofis then the same as the proof
of Theorem 5.2.2.
190 Appendix B: Cauchy's Theorem Revisited
,
F(z)
l'
= 6z-+0
1m
F(z + ~z) - F(z)
~z
= 6z-+0
lim - 1 (
~z
l z 6z
+ ftw)dw - l z
ftw)dw)
l
Zo Zo
z 6z
= 6z-+0
lim - 1
~z z
+ ftw)dw.
l
Z +6Z
Z ftw)dw = (ftz) + E)~Z,
where E -+ 0 as ~z -+ O.
Therefore,
l
Z+t.Z
lim
t.z-+O Z
ftw)dw = ftz),
and hence F'(z) = ftz). •
We now show that in a star domain V the integrability of ftz) follows
from the fact that fa6 ftz)dz = 0 for the boundary of any triangle ~ in V.
Theorem B.2
(7Hangle Criterion for Integrability) Let V be a star domain with center Zo
and let f: V -+ C be continuous. If
( ftz)dz = 0
Ja6
for the boundary a~ of each triangle ~ c V with Zo as a vertex, then fez) is
integrable in U.
Further, the function
F(z) = ~ ftz)dz
Jzoz
is a primitive function for fez) in V, and in particular, fyf(z)dz = 0 for any
closed curve y in V.
Proof
F(z) is well-defined, since ZlZ C V for all z E V. Let C E V be fixed and
choose z E V such that the triangle ~ with vertices Zl, Z, c is contained
in V. We picture this in Figure B.!. Since fa6 ft z)dz = 0, we get that
Then as in the proof of Theorem 5.2.2 and Theorem B.l we get that F is
a complex differentiable function with F'(c) = fCc) for c E U. •
We can now give the key criterion in Goursat's proof of the extended
Cauchy theorem.
Theorem 8.3
(Integral Lemma of Goursat) Let U C C be open and f: U ~ C be analytic.
Let I::. cUbe a triangle. Then
r f(z)dz =
Jot!>
O.
Proof
First we mention two elementary geometrical facts. Let I::. be any triangle
and let L stand for length.
(1) MaxW,ZEt!>lw - zl ~ L(al::.).
(2) L(al::.') = ~ L(al::.) for each of the four congruent subtriangles 1::.'
formed by drawing the line segments connecting the midpoints of the
sides of 1::.. These subtriangles are pictured in Figure B.2. •
Now define v(l::.) = fiJt!> f(z)dz. We draw the line segments connecting
the midpoints of the sides of I::. to get four congruent subtriangles I::. v ,
v = 1,2,3,4 as in Figure B.2. We then have
4 r 4
v(l::.) = ~ JiJt!> f(z)dz = ~ v(l::. v ),
since the integrals over the connecting lines occur twice with different
signs and thus cancel. We picture the situation in Figure B.3.
192 Appendix B: cauchy's Theorem Revisited
From the four integrals we choose one with maximal absolute value
and let !:J.. 1 be the triangle for this chosen integral. Then
Iv(!:J..) I S 4Iv(!:J.. 1)1.
Now we apply the same division and selection procedure to obtain a
second subtriangle !:J.. 2 with
IV(!:J..) I S 4Iv(!:J..1)1 s 42 Iv(!:J.. 2)1.
If we continue in this manner we get a descending sequence
!:J..l ~ !:J..2 ~ ... ~ !:J.." ~ ...
of bounded closed triangles with
Iv(!:J..) I S 4" Iv(!:J..") I, n = 1,2, ...
From the second elementary geometrical remark made at the start of
the proofwe have that
From the nested rectangles property in ]R2 we have that the intersection
contains exactly one point c E !:J... Since f(z) is analytic there is a
n~=I!:J.."
continuous function g : U -+ C with
f(z) = f(c) + f'(c)(z - c) + (z - c)g(z), z E U with g(c) = o.
Appendix B: Cauchy's Theorem Revisited 193
Theorem B.4
(Cauchy-Goursat Theorem for Star Domains) Let U C C be a star domain
with center Zo and let f:U ~ C be analytic. Then fez) is integrable in U, and
the function
i fCz)dz =0
for any closed curve y in U.
Appendix B: Cauchy's Theorem Revisited
il,z
-dz =
zIt
iT -dt + 1
0
9 il
ire dt = log r + iB,
-'1
reI
the main branch of the complex logarithm function in C- .
Using the Cauchy-Goursat theorem, Cauchy's integral formula and the
Liouville theorem follow as in Chapter 5. From this we obtained our sec-
ond proofofthe Fundamental Theorem ofAlgebra. In Appendix C we use
some other results in complex analysis to give three additional analytic
proofs.
Three Additional
Complex Analytic
Proofs of the
Fundamental
Theorem of
Algebra
APPENDIX
Lemma C.l
(Mean Value Theorem and Inequality for Complex Integrals) Let U C C be
open, f:U ~ C be analytic, and let B = Br(zo) be a circular disk of radius
r > 0 centered on Zo and contained in U. Then using the parametrization
yet) = Zo + re it ,0 ~ t ~ 2n, for CJB we get
and
LemmaC.2
(Growth Lemma) Let p(z) = ao + alz + ... + anz n be a complex polynomial
of degree n ::: 1 (so an ¥= 0). Then there exists an R ::: 1 such that for all
195
196 Additional Complex Analytic Proofs
Proof
Letr(z) = laol + latllzl + ... + lan_lllzl n- l . Then
lanllzl n - r(z) ~ Ip(z) I ~ lanllzl n + r(z)
by the triangle inequality. If Izi :::: I, then we have r(z) ~ klz\n-l, where
k = L~:d lail, because Izl i ~ Iz\n-l for i < nand Izi :::: 1. Therefore we
get the stated result with
R = max {I, 2klan l- 1 }. •
We can now give our three new proofs. Again, we want to prove that
any nonconstant complex polynomial has a complex root.
Proof Seven
Letp(z) = ao +alz + ... +anz n be a complex polynomial ofdegree n :::: 1
and assume thatp(z) has no zero in C. As in Chapter 3 (see 3.4.1) let
p(z) = ao + alZ + ... + anzn.
Thenp(Z) =
p(z) for all z E ((;, and hence g(z) =
p(z)p(z) has degree 2n
and no zero in C. Further, g(x) = Ip(x)1 2 > 0 for x E JR.
Since gtZ) is analytic in ((;, we get from Cauchy's Theorem that for all
1-
r > 0
0-
-
j-r
r
-ax-2
Ip(x)1
+
y,
dz
g(z) ,
(C.1)
-r o r
with M = 21anl-2 for Izi ::: r, r sufficiently large. Then for such r we get
that
1
dz I
I - I < max 1 - Irrr < rrMr-(2n-1)
y, g(z) - y, g(z) - ,
that is,
11m
·
HOO
1y,
- dz
g(z)
=0,
lim
Y.... OO
jY
_Y
ax
Ip(xW
= o.
But this is impossible, since Ip(x)1 2 > O.
•
Proof Eight
Let p(z) = ao + a1z + ... + anz n be a complex polynomial of degree n ::: I
and assume thatp(z) has no zero in C. Then Iftz) I = ptZ) is analytic in «;,
and hence by the mean value inequality we have
ftO) ~ maxaB,lftz)1
for each circular disk By(O), r > 0, centeredonO. By the growth lemma we
have limz.... oo Iftz) I = 0, and hence f(O) = 0 contradictingf(O) = p(Or 1 :f:.
o. •
Our final proofdepends on the maximum minimum modulus prin-
ciple, which we mentioned in section 5.5. This is a consequence again
of the Cauchy integral formula. Here we repeat it and two important
corollaries and then give proof nine.
Theorem C.I
(Maximum Modulus Principle) Suppose fez) is nonconstant and analytic in
a domain U. Then every neighborhood in U ofZo E U contains points z with
If(z)1 > If(zo) I.
Corollary C.I
(Maximum Principle) Suppose fez) is analytic in a domain U and assume
that there is a point Zo E U that is a local maximum for fez), that is, there is
a circular disk Bf(zo) C U with If(z) I ~ IfCzo)1 for all z E Bf(zo)' Then fez)
is constant in U.
Corollary C.2
(Minimum Principle) Suppose fez) is analytic in a domain U and assume
that there is a point Zo E U that is a local minimum for fez), that is, there is a
198 Additional Complex Analytic Proofs
circular disk B€(zo) C V with If(zo) I ::: If(z)1 for all z E B€(zo). Then either
fez) = 0 or fez) is constant in V.
We need one other fact for the ninth proof. Since a polynomial over a
field can have only finitely many zeros (see Chapter 3) it follows that a
nonconstant complex polynomial p(z) can be nowhere locally constant.
That is there cannot be an open region vee withp(z) constant on V,
for if p(z) = c for all z E V, then the polynomial g(z) = p(z) - c has
infinitely many zeros.
Proof Nine
Let p(z) = ao + alz + '" + anz n be a complex polynomial of degree
n ~ 1. For each r > 0 let V r = Br(O) the closed ball of radius r centered
on O. Each V r is a compact set and since p(z) is continuous, \P(z) I has a
minimum at some point Zr E Vr. Since \P(z) I -+ 00 as Izi -+ 00, we can
choose an R sufficiently large such that this minimum point ZR is in the
interior of VR. 1b see this choose an s > 0 so that Ip(z) I > laol = Ip(O) I
for Izi > s. Choose R > s. Then on the boundary of VR we have Ip(z) I >
Ip(O) I, and hence the minimum on the compact set VR must be in the
interior.
Therefore, we have a ZR E Interior V r with Ip(ZR) I ::: \P(z) I for all z E
VR. Since ZR is in the interior, there exists an E > 0 such that VI = B€(ZR),
the closed ball of radius E centered on ZR, is contained in VR. However,
p(z), being everywhere analytic, is analytic on VI, and further, Ip(zr) I :::
\P(z)1 for all z E VI. From the minimum principle either P(ZR) = 0 or p(z)
is locally constant on VI. As mentioned above, a nonconstant polynomial
can be nowhere locally constant, and therefore P(ZR) = O. •
Two More
Thpological Proofs
of the Fundamental
Theorem of
APPENDIX Algebra
Theorem D.I
(Rouche's Theorem) Let f,g;[O,l] -+- C be loops such that !g(t) I < !{(t)! {or
all t. Then { and { + g have the same degree.
199
200 Additional Thpological Proofs
Proof
The conditions insure that both f and f + g map on C - [0]. Consider
R(t, u) = f(t) + ug(t). Then R(t, u) is a homotopy in C from R(t, 0) = f(t)
to R(t, 1) = f(t) + get). The conditions on f(t), get) further guarantee that
there is no choice of (t, u) with R(t, u) = O. Hence R(t, u) is a homotopy
in C - {O}. Therefore, f andf + g are homotopic and hence have the same
degree. •
ProofThn
Let p(z) = ao + alz + ... + anzn be a complex polynomial with an =j:. O.
For each r ::: 0 let Pr : 81 ~ C by Pr(z) = p(rz). Then for any r the map
R(z, u) = p(zru) is a homotopy frompr to Po in C. If the polynomialp(z)
were never zero, this would be a homotopy in C - {O} and hence pr and
Po would have the same degree. Now, Po is a constant so it has degree O.
We will show that there is an R such that PR has degree n. It will follow
that n = 0, and therefore p(z~ is a constant.
Suppose R > max(1, L~:o ~). Then for all z with Izi = R we have
n-I n-I n-I
I Lakzkl : : L lakllzl k ::: (L lakl)lzln-1 < lanllzl n.
k=O k=O k=O
It follows then from Rouche's theorem that PR has the same degree as the
function on 81 sending z to anR nzn, which has degree n. Since PR and Po
have the same degree, it follows that n = 0 and therefore, ifp(z) has no
roots, p(z) is a constant. •
The final proof is similar to the above but without recourse to Rouche's
theorem.
Proof Eleven
As before, let p(z) = ao + alz + ... + anzn be a complex polynomial with
an =j:. O. Without loss of generality we can assume that an = L For each
r ::: 0 let
Ip(r) I p(re )
2Jrit
Ifp(z) has no roots, then each ofthe two fractions in (0.1) is well-defined
and has absolute value 1. Therefore, Pr(t) e 81 for each r ::: 0 and each
t e [0, 1]. Now, Poet) is the constant loop in 81 . The first fraction gives an
initial point to the loop Pr(t), while the second fraction projects the loop
p(re 2Jrit ) in C - {O} onto 8 1 •
Now, letR(t, u) = Pru(t). This gives a homotopy in 81 frompr(t) to poet),
and hence for each r, both pr and Po have degree O. We show that there
is a value R where PR(t) has degree n.
Additional 'Ibpological Proofs 201
Let 8 = max(1, laII. ... , lan-Ii) and let R = (n + 1)8. Letftt) = Re 2rrit
be the circle ofradius R centered on 0 and let q(z) = zn. Then q(f(t)) =
Rne21rnit is a loop with winding number n on a circle of radius R around
the origin. Consider the distance from q(j(t)) to p(j(t)) for any t. We have
Iq(j(t)) - p(j(t))1 = IJU)" - + an_I!(t)"-I + ... + ao)1
(j(t)"
::: lan-I!(t)"-II + + laol
::: lan-Illf(t)"-II + + laol
::: lan_IiR n- I + ... + laol
R nI
< --R - + ... + -R- n n n
< - - R < R.
- n+1 n+1 - n+1
This implies that for each t, p(j(t)) lies in a disk ofradius R n centered on
q(j(t)). In particular, then, there is a line segment fromp(j(t)) to q(j(t)) that
does not go through the origin. This is in essence a version of the fellow-
traveler property mentioned in Chapter 8. This says that the function
K(t, u) = uq(j(t)) + (1 - u)p(j(t)) is never zero for u E [0, 1], t E [0, 1]. Let
IK(O, u)1 K(t, u)
H( t u) - ---'-"- _.....:......---:;....
, - K(O, u) IK(t, u)1 .
By a straightforward computation we have that H(t, 0) = PR(t) , H(t,l)
is the loop ret) = e27rint, which has winding number n, and H(O, u) = 1
for all u. Therefore, H(t, u) is a homotopy in 81 from H(t, 0) = PR(t) to
H(t, 1) = ret). It follows that PR(t) has degree n. Since it also has degree
0, we must have that n = 0 andp(z) is constant. •
Bibliography
and References
1 Quoted References
[A] L. Ahlfors, Complex Analysis, McGraw-Hill, 1966.
[G] N. Gupta, The Burnside Problem, The American Mathematical
Monthly, 1991.
[H-Y] J.G. Hocking and G.S. Young, Thpology, Addison-Wesley, 1961.
[R] R Remmert, Fundamentalsatz der Algebra, zahlen, by H.D. Ebbing-
haus, H. Hermes, F. Hirzebruch, M. Koecher, K. Mainzer, J. Neu-
kirch, A. Prestel, R Remmert, Springer-Verlag, 1983.
[Ra] J.Rotman, Theory of Groups, w.e. Brown - 3rd Edition, 1988.
[U] J.V. Uspensky, Theory of Equations, McGraw-Hill ,1948.
202
2. Suggestions for Other Reading 203
205
206 Index