Beruflich Dokumente
Kultur Dokumente
= H0
3
is a group under quaternion multiplication, and that the norm N is a homomorphism
N : H
>0
from H
to the group R
>0
of positive real numbers under multiplication, whose kernel
ker N = q H
: q q = 1 = t + xi + yj + zk H : t
2
+ x
2
+ y
2
+ z
2
= 1
may be identied with the three-dimensional sphere S
3
R
4
.
1.2 The Lie group S
3
From now on we write
S
3
= q H
: q q = 1.
Thus S
3
is a group under quaternion multiplication, tting into the exact sequence
1 S
3
H
N
R
>0
1.
The group S
3
contains the quaternion group
Q
8
= 1, i, j, k
of order eight as a subgroup, so S
3
is nonabelian. and in fact the center of S
3
has just two elements:
Z(S
3
) = 1,
since this is already the full center of Q
8
. The aim for the rest of this section is to nd the noncentral
conjugacy classes in S
3
.
The subgroup
T = t + xi : t
2
+ x
2
= 1 = e
i
: R
is an abelian subgroup of S
3
, isomorphic to S
1
, the circle group. One checks that
T = C
S
3(i)
is the centralizer of i in S
3
. Let N(T) be the normalizer of T in S
3
.
Lemma 1.1 We have N(T) = T Tj. Thus N(T) consists of two circles, which are cosets of T.
Proof: The elements of order four in T are just i. Hence if q N(T) we have either qiq
1
= i or
qiq
1
= i. The former means that q T. Assume that qiq
1
= i. We note that jij
1
= i as
well, so qj
1
C
S
3i = T, which means q Tj.
We note that Q
8
< N(T) and that
jsj
1
= s = s
1
for all s T.
4
Also Tj = tj + xk : t
2
+ x
2
= 1 lies on the equatorial two sphere
C
0
:= xi + yj + zk : x
2
+ y
2
+ z
2
= 1 S
3
.
The meaning of the subscript 0 is as follows.
As we have dened the norm of a quaternion q to be N(q) = q q, so we dene the trace of q to be
(q) =
1
2
(q + q).
Note that : H R because (q) = (q). In fact we have
(t + xi + yj + zk) = t.
Lemma 1.2 For q S
3
and all p H we have (qpq
1
) = (p).
Proof: Since q S
3
we have q
1
= q. We compute
(qpq
1
) = (qp q) =
1
2
(qp q + qp q) =
1
2
(qp q +
q p q) =
1
2
(qp q + q p q) = q(p) q.
Since (p) R it commutes with q, so we have
(qpq
1
) = (p)q q = (p),
again because q S
3
.
By Lemma 1.2, the restriction of to S
3
is a function
: S
3
[1, 1]
whose level sets
C
t
= p S
3
: (p) = t
are preserved under conjugation by S
3
. For t = 0, the level set is the equatorial two-sphere C
0
men-
tioned above. We have
C
0
= S
3
H
0
,
where
H
0
= Ri Rj Rk = p H : (p) = 0.
more generally, for xed t [1, 1], the level set
C
t
= t +xi + yj + zk : x
2
+ y
2
+ z
2
= 1 t
2
t[1,1]
C
t
. (1)
1
Of course C
1
= 1 and C
1
= 1 are spheres of zero radius.
5
Proposition 1.3 For each t [1, 1], the latitude sphere C
t
is a single conjugacy class in S
3
. Hence
(1) is the partition of S
3
into conjugacy classes.
Proof: It must be shown that S
3
acts transitively on each latitude sphere C
t
. We rst prove this for C
0
.
Following Euler, we write
e
i
= cos + i sin , e
j
= cos + j sin , e
k
= cos + k sin .
Thus we have three subgroups T
i
, T
j
, T
k
< S
3
, all isomorphic to S
1
, given by
T
i
= e
i
: R, T
j
= e
j
: R, T
k
= e
k
: R.
Everything we have said about T
i
(previously called T) holds for the other subgroups. Their normaliz-
ers are
N(T
i
) = T
i
T
i
j, N(T
j
) = T
j
T
j
k, N(T
k
) = T
k
i.
The nontrivial cosets T
i
j, T
j
k, T
k
i are three orthogonal great circles on the two-sphere C
0
. Conjuga-
tion by j, k, i on T
i
T
j
, T
k
is inversion, meaning that
je
i
j
1
= je
i
, ke
j
k
1
= e
j
, ie
k
i
1
j = e
k
.
It follows that T
i
conjugates the coset T
i
j to itself, and likewise for T
j
with T
j
k, and T
k
with T
k
i.
Explicitly, we have
e
i
e
i
j e
i
= e
i(+2)
j, e
j
e
j
k e
j
= e
j(+2)
k, e
k
e
k
j e
k
= e
k(+2)
i.
Now take a point p C
0
and write it in spherical coordiates:
p = sin cos i + sin sin j + cos k.
If we view k as the north pole of C
0
then conjugation by e
j/2
sends k down to a point p
on the same
latitude as p, and then conjugation by e
k/2
sends p
1 t
2
)p.
Then f
t
is bijective, with inverse
f
1
t
(q) =
q t
1 t
2
,
and for all q S
3
we have f
t
(qpq
1
) = f
t
(p). Now the transitivity on C
t
follows from the transitivity
on C
0
, completing the proof.
We can write each t [1, 1] as t = cos for [0, ], and we have the
6
Corollary 1.4 For 0 < < , the conjugacy class C
cos
meets each of T
i
, T
j
, T
k
in two mutually
inverse points. Namely,
C
cos
T
i
= e
i
, e
i
, C
cos
T
j
= e
j
, e
j
, C
cos
T
k
= e
k
, e
k
.
Proof: The sets on the right hand side of each asserted equality consist of the points in T
i
, T
j
, T
k
whose trace is cos .
We now understand conjugacy classes of points in S
3
. The next step is conjugacy of circles. More pre-
cisely, by circle we mean a subgroup S < S
3
such that S S
1
via a continuous group isomorphism.
Lemma 1.5 For R, the subgroup e
i
of S
1
generated by e
i
is nite if 2Q and is dense in
S
1
if / 2Q.
Proof: The group A = e
i
is nite if and only if e
in
= 1 for some n Z, which is equivalent to
having 2Q. So if / 2Q, the subgroup A is innite. We prove that A is in fact dense in S
1
, as
follows.
Let > 0 and subdivide S
1
into equal arcs, starting at 1, of length at most . In the innite set A there
exist distinct points e
in
and e
im
, with m ,= n, lying the same arc. Since A = e
im
A, it contains the
point e
i(nm)
lying in an arc having 1 as an endpoint. The subgroup generated by e
i(nm)
is contained
in A and meets every arc. Hence A is dense in S
1
.
We revert to the notation T = T
i
= e
i
: R.
Proposition 1.6 Every circle in S
3
is conjugate to T.
Proof: Let S be a circle in S
3
. This means S is a subgroup of S
3
and we have a continous group
isomorphism f : S
1
S. Let s = f(e
i
), where R2Q. Then s is dense in S, by Lemma 1.5
and the continuity of f. By Cor. 1.4, there exists q S
3
such that qsq
1
T. The conjugate element
qsq
1
also has innite order, hence the subgroup qsq
1
is dense in T. Letting X denote the closure
of a subset X S
3
, we have
qSq
1
= qsq
1
= qsq
1
= T.
3, while e
i/3
=
1
2
+ i
3,
we see that
1
2
(1 + i + j + k) has order six while
1
2
(1 + i + j + k) has order three.
Elements q =
1
2
(1 i j k) S
3
, with all possible combinations of signs, will conjugate i, j, k
amongst one another, up to sign. That is, such elements q normalize Q
8
and act on Q
8
via outer
automorphisms. These elements, along with Q
8
itself, comprise the normalizer of Q
8
. Thus, N(Q
8
)
consists of the 24 quaternions
N(Q
8
) = 1, i, j, k
_
1
2
(1 i j k)
_
,
and Q
8
is the Sylow 2-subgroup of N(Q
8
). The group N(Q
8
) is also known as the binary tetrahedral
group for reasons that will become clear. One can show that
N(Q
8
) SL
2
(Z/3Z),
the group of 2 2 matrices over Z/3Z with determinant = 1.
1.3 The exponential map for S
3
The exponential map gives a canonical parametrization of compact Lie groups.
The circle group S
1
is parametrized by exponentiating the purely imaginary complex numbers iR.
Thus,
S
1
= e
z
: z iR, where e
z
=
n=0
z
n
n!
.
We can write each purely imaginary complex number z uniquely as z = i, where = [z[ 0, and
we have Eulers formula
e
z
= e
i
= cos i sin ,
8
as one computes by expanding the exponential series. This parameterization of S
1
sends the open
segment
(, )i = i : 0 < = [0, ) (S
1
iR)
bijectively onto S
1
1 and both values i are sent to 1 S
1
. Thus, the map z e
z
glues the
ends of the closed segment [, ]i together, forming a circle S
1
.
Likewise, we can parametrize the 3-sphere S
3
H, by exponentiating the pure quaternions: H
0
=
Ri +R
j
+R
k
. Thus, we dene
exp : H
0
S
3
by exp(v) =
n=0
v
n
n!
.
Let us compute this sum in closed form. As we did with S
1
, we can write v = v
0
, where v
0
S
3
H
0
,
and = [v[. Recall that S
3
H
0
= C
0
is the conjugacy class of elements of order four; these are the
elements of H that behave like i, in that they square to 1. It follows that exp(v) can be computed in
the same way as e
i
. For v
2
=
2
, so for all k 0 we have v
2k
= (1)
k
2k
and v
2k+1
= (1)
k
2k
v.
It follows that
exp(v) =
k=0
(1)
k
2k
(2k)!
+ v
k=0
(1)
k
2k
(2k + 1)!
= cos + v
sin
= cos + v
0
sin . (2)
as with Eulers formula. This is consistent with our earlier denitions of e
i
, e
j
, e
k
; these were values
of exp on the three lines iR, jR, kR in H
0
.
From (2) we observe that exp maps the sphere C
0
H
0
of radius to the conjugacy-class C
cos
.
Proposition 1.8 The map exp : H
0
S
3
has the following properties:
1. exp(H
0
) = S
3
;
2. exp maps the open ball v H
0
: [v[ < = [0, ) C
0
bijectively onto S
3
1;
3. exp(C
0
) = 1 S
3
;
4. We have exp(qvq
1
) = q exp(v)q
1
for all q S
3
and v H
0
.
Proof: Item 1 is implied by items 2 and 3. If q C
cos
then q = cos + q
0
, where q
0
H
0
has
squared-length [q
0
[
2
= 1 cos
2
= sin
2
. The vector v
0
= (sin )
1
q
0
lies in C
0
and exp(v
0
) = q.
Therefore exp(C
0
) = C
cos
.
Suppose v, v
H
0
have exp(v) = exp(v
). Write v = v
0
, v
0
, with ,
[0, ) and
v
0
, v
0
C
0
. Since cos is injective on [0, ), it follows from (2) that =
and v
0
sin = v
0
sin . If
= 0 then v = v
0
, hence v = v
. If v
0
is any point
in C
0
, then exp(v
0
) = cos + v
0
sin = 1. This completes the proof of item 2.
Item 3 follows from the continuity of the map v qvq
1
.
9
2 Rotations of three-dimensional space
2.1 The orthogonal and special orthogonal groups
Let V = R
n
with the inner product
u, v =
n
i=1
u
i
v
i
,
where u
i
, v
i
are the coefcients of u, v with respect to the standard orthonormal basis e
i
of R
n
. This
inner product is positive-denite, meaning that u, u > 0 for all nonzero vectors u R
n
. The length
u is given by
[u[ = u, u
1/2
.
The orthogonal group of V is the subgroup O
n
GL
n
(R) preserving the the lengths of vectors:
O
n
= g GL
n
(R) : [gu[ = [u[ for all u R
n
.
It is useful to recognize when a matrix g belongs to O
n
without having to check the condition [gu[ = [u[
for every vector u R
n
.
Proposition 2.1 On For a matrix g GL
n
(R), the following are equivalent.
1. g O
n
.
2. We have gu, gv = u, v for all u, v R
n
.
3. The columns of g form an orthonormal basis of R
n
.
4. The product of g with its transpose is the identity matrix: g
t
g = I.
Proof: The equivalence of items 1 and 2 results from the formula
u, v =
1
2
_
[u + v[
2
[u[
2
[v[
2
_
.
Applying item 2 to the orthonormal basis e
i
, we get item 3. Conversely, item 3 implies item 2 by
expanding u, v in terms of the basis e
i
. The entry in row i column j of g
t
g is the inner product of
columns i and j of g, whence the equivalence of items 3 and 4.
The condition g
t
g = I implies that det(g) = 1 for all g O
n
. The special orthogonal group is
the subgroup of determinant = 1:
SO
n
= g O
n
: det(g) = 1.
We give O
n
and SO
n
the topology inherited from the Euclidean space M
n
(R) = R
n
2
of n n real
matrices.
10
Proposition 2.2 The subsets SO
n
, O
n
M
n
(R) are compact and SO
n
is connected, while O
n
has two
connected components.
Proof: For 1 i j n, dene functions f
ij
: M
n
(R) R by
f
ij
(g) = g
i
, g
j
ij
,
where g
i
, g
j
are the i
th
and j
th
columns of g M
n
(R), and
ij
= 1 or 0 according as i = j or i ,= j.
Then O
n
is the set of common zeros of all the functions f
ij
, and SO
n
is the subset of O
n
on which the
additional function det 1 is zero. All of these are polynomial, hence continous functions on M
n
(R),
so O
n
and SO
n
are closed. Since the columns of any g O
n
are orthonormal vectors, each entry of g
belongs to [1, 1], hence O
n
is a bounded subset of M
n
(R). Since O
n
and SO
n
are closed and bounded
subsets of M
n
(R), it follows from the Heine-Borel theorem that O
n
and SO
n
are compact.
To prove that SO
n
is connected, we show that every element lies in a connected subgroup. We will use
induction on n. Since SO
1
= 1 and SO
2
= S
1
is a circle, we may assume n 3 and that SO
m
is
connected for m < n.
Let g SO
n
, and let G = g be the closure in SO
n
of the subgroup generated by g. As SO
n
is compact, the group G is also compact. Let C
be an eigenvalue of g. If = 1 then a
corresponding eigenvector v lies in R
n
. Scaling so that [v[ = 1, and choosing an orthonormal basis of
the orthogonal complement of the line Rv, we obtain a matrix h O
n
such that
hgh
1
_
1 0
0 SO
n1
_
,
which is connected, by the induction hypothesis. The conjugate by h of this subgroup is also connected,
so we have found a connected subgroup of SO
n
containing g.
Assume now that g has no eigenvalue equal to 1. Let v = (v
1
, . . . , v
n
) C
n
be an eigenvector of g,
with eigenvalue C
= [u[
1
u and v
= [v[
1
v are an orthonormal basis of a
two-dimensional plane U R
n
.
Hence there exists h O
n
whose rst two columns are u
, v
. We then have
hgh
1
_
SO
2
0
0 SO
n2
_
,
which is connected, by the induction hypothesis. This completes the proof that SO
n
is connected.
Finally, O
n
consists of two cosets of SO
n
, each of which is connected component.
2.2 SO
3
and quaternions
Let us regard R
3
as the space of the pure quaternions:
H
0
= Ri Rj Rk = v H : (v) = 0.
The dot product may be expressed quaternionically as as
u, v =
1
2
(u v + v u). (3)
For q S
3
, let R
q
: H
0
H
0
be the linear map given by
R
q
(v) = qvq
1
.
A familiar calculation using (3) shows that
R
q
(u), R
q
(v) = u, v,
for all u, v H
0
and q S
3
. Therefore R
q
O
3
and we have a continuous homomorphism R : S
3
O
3
, sending q R
q
. Since S
3
is connected, the image of R is connected, and therefore lies in SO
3
,
by Prop. 2.2. Thus, we have a homomorphism
R : S
3
SO
3
, given by q R
q
,
where R
q
(v) = qvq
1
. To see this homomorphism explicitly, let q = a + bi + cj + dk S
3
and
calculate
qiq
1
= (a
2
+ b
2
c
2
d
2
)i + 2(bc + ad)j + 2(bd ac)k
qjq
1
= 2(bc ac)i + (a
2
b
2
+ c
2
d
2
)j + 2(cd + ab)k
qkq
1
= 2(bd + ac)i + 2(cd ab)j + (a
2
b
2
c
2
+ d
2
)k,
12
so the matrix of R
q
with respect to the basis i, j, k is
R
q
=
_
_
a
2
+ b
2
c
2
d
2
2(bc ac) 2(bd + ac)
2(bc + ad) a
2
b
2
+ c
2
d
2
2(cd ab)
2(bd ac) 2(cd + ab) a
2
b
2
c
2
+ d
2
_
_
. (4)
Proposition 2.3 The homomorphism R : S
3
SO
3
is surjective with ker R = 1 equal to the
center of S
3
. In particular, every matrix in SO
3
is of the form (4) for some (a, b, c, d) R
4
with
a
2
+ b
2
+ c
2
+ d
2
= 1.
Proof: We will use the following basic fact about group actions. Suppose a group G acts on a set X,
that H is another group, and that we have a homomorphism f : H G. Assume that the subgroup
f(H) G acts transitively on X and that there exists x X such that f(H) contains the stabilizer
G
x
= g G : g x = x. Then f(H) = G. For if g G, there is h H such that f(h) x = g x,
by the transitivity assumption. Then g
1
f(h) G
x
, so g
1
f(h) = f(k) for some k H, by the
assumption that f(H) G
x
. Thus we have g = f(hk
1
) f(H).
We apply this to the homomorphism R : S
3
SO
3
, where SO
3
acts on the sphere C
0
= S
2
. We have
proved that R(S
3
) acts transitively on C
0
, and that
R(T
i
) =
_
1 0
0 SO
2
_
is the stabilizer of i in SO
3
. It follows that R is surjective.
The kernel of R consists of those q S
3
commuting with every vector in H
0
. Since every quaternion
commutes with R 1 which is the center of H, it follows that ker R is the intersection of S
3
with R 1.
This is the unit sphere in R, that is, ker R = 1.
Remark 1: One can describe Rmore geometrically as follows. If q S
3
, there is a unit vector u H
0
and [, ] such that
q = cos + usin .
Since q commutes with u, it follows that R
q
(u) = u so R
q
is a rotation about the axis through u. To
nd the angle of rotation, we note that for u, v H
0
the quaternionic product uv is given by
uv = u v u v H,
where and are the cross and dot product on R
3
. Note that u v H
0
and u v R. If u v = 0
this reduces to uv = u v, and we compute that
R
q
(v) = (cos + usin )v(cos usin ) = cos(2) + sin(2)(u v).
This shows that R
q
is rotation about u by 2 seen counterclockwise as u points towards you.
Remark 2: The quaternionic interpretation gives an explicit formula for the product of two rotations
in SO
n
. Let S, T SO
3
be rotations by 2, 2 about unit vectors u, v H
0
. Then S = R
p
, T = R
q
,
where
p = cos + usin , q = cos + v sin .
13
Then ST = R
pq
, and we compute
pq = (cos cos ) (sin sin )u v + (sin cos )u + (cos sin )v + (sin sin )u v.
Therefore ST is rotation by angle about the axis through the vector w, where
cos = (cos cos ) (sin sin )u v,
and
w = (sin cos )u + (cos sin )v + (sin sin )u v.
Remark 3: The image under R of the binary tetrahedral group N(Q
8
) is the symmetry group of a
regular tetrahedron, and is isomorphic to the alternating group A
4
. Thus, we have an exact sequence
1 1 N(Q
8
) A
4
1.
This sequence is non-split: there is no subgroup of N(Q
8
) isomorphic to A
4
. In particular, N(Q
8
) and
S
4
are non-isomorphic groups of order 24. Note that the latter ts into another exact sequence (which
is now split)
1 A
4
S
4
1 1.
Remark 4: The cosets of 1 in S
3
are pairs of antipodal points. Each pair determines a line in R
4
,
so the set of antipodal pairs is the real projective space RP
3
. Thus, Prop. 2.3 shows that SO
3
= RP
3
,
as topological spaces. You can also regard RP
3
as the quotient of a solid ball in R
3
by identifying
antipodal points on the boundary. Indeed, every element of SO
3
is rotation about some axis by some
angle [, ]. The axis determines a line segment in the ball B
of radius in R
3
and determines
a point on the axis. Each (, ) gives a unique rotation about this axis, but the two values = ,
corresponding to antipodal points on the boundary of B
: V so
3
dened as follows. For each v V , R
v
so
3
is the skew symmetric matrix acting on V by
R
v
(u) =
d
d
R
exp(v)
(u)[
=0
.
Computing this explicitly using power series, we nd the explicit formula
R
v
(u) = vu uv.
To see R
v
as a matrix, let v = (x, y, z) and compute R
v
(i), R
v
(j), R
v
(k) to nd that
R
v
= 2
_
_
0 z y
z 0 x
y x 0
_
_
= A
2v
.
Finally, one checks that
R exp = exp R
.
That is, the following diagram is commutative.
V
R
so
3
exp
_
exp
S
3
R
SO
3
It follows that exp : so
3
SO
3
is surjective.
Thus, the Lie groups S
3
and SO
3
are parametrized by R
3
via the exponential maps, just as S
1
is
parameterized by R via the usual exponential map. Moreover, the group homomorphism R : S
3
SO
3
lifts to a linear map R
x
: R
n
R
m
such that
lim
h0
1
[h[
[f(x + h) f(x) f
x
(h)] = 0.
If v R
n
has length [v[ = 1 (with respect to the standard Euclidean metric, as we have been using)
then we can take h = tv as t 0 and the formula for f
x
becomes
f
x
(v) = lim
t0
f(x + tv) f(x)
t
.
If v = e
i
is one of the standard basis vectors then f
x
(e
i
) =
f
x
i
(x).
Now the function x f
x
is a function f
: U Hom(R
n
, R
m
) = R
nm
, called the derivative of f.
We say that f : U R
m
is smooth or C
if each of f, f
, f
, U
, M
) where U R
n
and M
: U
is a homeomorphism. Thus,
a
l parametrizes
the open subset M
, U
, M
A
M
. We say the
atlas has dimension n if U
is an open subset of R
n
, with the same n, for every A.
Given two local charts (
, U
, M
) and (
, U
, M
:
1
(M
)
M
R
n
dened on the open set U
=
1
(M
) R
n
. An atlas (
, U
, M
) : A is smooth if
the transition functions
1
x
2
i
= 1. Take the two antipodal points
z
= S
n
z
, and let U
= R
n
. For = , dene
: R
n
S
n
by
(u
1
, . . . , u
n
) =
1
[u[
2
(([u[
2
1), 2u
1
, . . . , 2u
n
)
16
Then
1
: M
R
n
is given by
(x
0
, . . . , x
n
) =
1
1 x
0
(x
1
, . . . , x
n
),
and the transition functions are given by
(u) =
u
[u[
.
This is a smooth function on
1
(M
) = R
n
(0, . . . , 0). Hence the collection (
, R
n
, M
) :
= is a smooth atlas. fs
Let M and N be topological spaces with atlases (
, U
, M
) : A and (
, V
, N
) : B
respectively. A function f : M N is smooth if each composition
:
1
(M
f
1
(N
)) R
m
is smooth. For example, the identity M M is smooth, and the composition of smooth functions is
smooth.
A given topological space M may admit more than one atlas. Two atlases (
, U
, M
) : A
and (
, V
, M
:
1
(M
) R
n
is smooth.
Denition 3.1 An smooth manifold is a Hausdorff
2
topological space M with an equivalence class of
smooth atlases. We say M is n-dimensional if these atlases have dimension n.
The term smooth manifold is interchangeable with differentiable manifold. An equivalence class
of smooth atlases is often called a smooth structure or a differentiable structure. Having a single
smooth atlas on on a Hausdorff topological space M gives a smooth structure on M, via the equivalance
class of the given atlas.
Two manifolds M, N are diffeomorphic if there is a smooth map f : M N having a smooth inverse
f
1
: N M. It is possible for two manifolds to be homeomorphic but not diffeomorphic. That is,
the topological space M may admit more than one For example, the 7-sphere S
7
admits exactly 28
smooth structures.
3
Lemma 3.2 If M and N are smooth manifolds with atlases (
, U
, M
) : A and (
, V
, N
) :
B, then then there is a canonical smooth structure on MN, given by the atlas (
, U
, M
) : A, B.
Proof: Left to the reader!
2
Recall M is Hausdorff if given distinct points x, y M there exist disjoint open subsets U, V M such that x U
and y V .
3
See Milnor, On manifolds homeomorphic to the 7-sphere Annals of Math. 1956, and Brieskorn Beispiele zur Differen-
tialtopologie von Singularit aten Invent. Math. 1966
17
3.2 Manifolds dened by equations
Let f = (f
1
, . . . , f
m
) : R
n
R
m
be a smooth function, where n m, and write n = k + m. Let
M = f
1
(0) = p R
n
: f(p) = 0. The Implicit Function Theorem gives a sufcient condition for
M to be a smooth manifold, in terms of the derivative f
of f. Recall that f
is the m n matrix of
functions
f
=
_
f
i
x
j
_
which at each point p R
n
gives a linear map
f
p
=
_
f
i
x
j
(p)
_
: R
n
R
m
.
Theorem 3.3 (Implicit Function Theorem) Let f : R
n
R
m
be a smooth function, where n m,
and let M = f
1
(0). Suppose f
p
(R
n
) = R
m
. Then there exist open sets U R
k
and V R
m
, and a
unique smooth function g : U V such that p U V and (U V ) M = (u, g(u)) : u U.
Proof: See [Rudins Principles of Mathematical Analysis].
Corollary 3.4 With notation as in Thm. 3.3 assume that f
p
has rank m for all p M. Then M has
the structure of a smooth manifold.
Proof: Our assumption means that for each p M we can choose m columns whose determinant is
nonzero. Fix p M and order the variables so that these columns are the last m columns of f
and
choose U, V, g as in Thm. 3.3. This gives a chart
p
: U
p
M
p
, where U
p
= U, M
p
= (U V ) M,
given by
p
(u) = (u, g(u)). Doing this for each p M gives an atlas (, U
p
, M
p
) : p M. One
checks that for p, q M and u
p
(M
p
M
q
) we have
1
q
p
(u) = u. Hence the atlas is smooth.
Sn
sgn()a
1(1)
a
n(n)
.
Let A
j
be the matrix obtained by deleting row 1 and column j fromA. If det(A) = 1 then det(A
j
) ,= 0
for some j. And
f
a
ij
=
(1)=j
sgn()a
1(1)
a
n(n)
= det(A
j
) ,= 0,
so the matrix f
i
g
ip
g
iq
,
where
pq
= 1 if p = q and is zero otherwise. One checks that f
g
is surjective if g is invertible, in
particular if g O
n
. Hence O
n
is indeed a manifold. Again, the group operations are polynomial, so
O
n
is a Lie group.
Since O
n
= f
1
(0) and each column of a matrix in O
n
is a unit vector, it follows that O
n
is a closed
and bounded subset of M
n
(R), hence is compact. The determinant maps O
n
onto 1, so O
n
is
disconnected. We will see that it has exactly two connected components.
19
Example 4: The Special Orthogonal Group SO
n
is the subgroup of GL
n
(R) preserving orientation,
volume and length. That is,
SO
n
= g SL
n
(R) : [gv[ = [v[ v R
n
= g SL
n
(R) :
t
gg = I.
A matrix lies in SO
n
exactly when its columns form an orthonormal basis having the same orientation
as the standard orthonormal basis e
1
, . . . , e
n
. We will see that SO
n
is connected; it is the component
of O
n
containing the identity matrix.
Example 5: The group R
3
is an abelian Lie group under the operation of addition. But the same
manifold R
3
has another Lie group structure. The Heisenberg group
H
3
:=
_
_
_
_
_
1 x z
0 1 y
0 0 1
_
_
: (x, y, z) R
3
_
_
_
is diffeomorphic to R
3
but is not isomorphic to R
3
as a Lie group. Indeed, H
3
is non-abelian.
4 The Lie algebra of a Lie group
Let G be a Lie group. Imagine moving along paths in G. Each smooth path : R G has a position
(t) and velocity vector
(t) as a vector in R
n
. But G may embed in many Euclidean spaces in many ways, and
we want a home for
(t)). The rst projection : TU U shows the position, and the ber
1
(u) = R
n
is the vector space of all possible velocities at u. Note that TU is a union
TU =
pU
p R
n
.
of bers of . These bers are called tangent spaces. Each tangent space is canonically identied with
the ambient vector space R
n
containing U as an open subset.
20
Let M be a n-dimensional smooth manifold with atlas
, U
, M
: M
M be the inclusion map. The tangent bundle of M is a new manifold TM equipped with
a projection map
TM
M
whose bers
1
(p) are n-dimensional vector spaces which vary smoothly. More precisely, TM is the
set of equivalence classes
TM =
A
(M
R
n
)/ ,
where (x, v) (y, w) if
(x) =
(y) and w = (
1
x
(v).
Thus, points in TM are equivalence classes [x, v]
R
n
, with the understand-
ing that if x M
we have
[x, v]
= [x, (
1
x
(v)]
.
Each M
R
n
= TM
injects into TM. Via the quotient topology, TM has an open cover
TM =
_
A
TM
: U
R
n
TM
by
(u, v) = [
(u), v)]
.
One checks that
(TM
TM
) =
1
(U
) R
n
,
and that
:
1
(U
) R
n
1
(U
) R
n
is given by
1
(
1
, TU
, TM
) :
A.
The projection
M
: TM M is given by
M
([x, v]
is the composition
(U
) R
n
proj
1
(U
(U
),
which is smooth, so that
M
: TM M is smooth.
The tangent space to M at x M is the ber
T
x
M :=
1
M
(x).
If x M
then T
x
M = [x, v]
: v R
n
R
n
, but this isomorphism is non-canonical, as it depends
on .
21
In practice, the tangent bundle usually has a more concrete realization than the rather abstract general
denition just given.
Example: Consider again the n-sphere S
n
= x R
n+1
: [x[ = 1. Then we may realize TS
n
as
TS
n
= (x, v) S
n
R
n+1
: x v = 0, (5)
where x v is the dot product on R
n+1
. To see that this realization is indeed a manifold, we use Cor.
3.4 to express the right side of (5) as f
1
(0), where f = (f
1
, f
2
) = ([x[
2
1, x v). We nd that
f
=
_
2x
1
. . . 2x
n
0 . . . 0
v
1
. . . v
n
x
1
. . . x
n
_
has rank two for all x S
n
, so that f
1
(0) is indeed a manifold.
Proposition 4.1 If M and N are smooth manifolds and f : M N is a smooth map, then there is a
unique smooth map f
//
TN
M
f
//
N
.
If L is another manifold and g : N L is another smooth map then
(g f)
= g
. chain rule.
Proof: Let (
, U
, M
) : A and (
, V
, N
. For v R
n
,
dene f
([x, v]
) = [f(x), (
1
(v)]
([x, v]
commutes with the projections. The last assertion follows from the chain
rule on R
n
.
4.2 Vector elds
Let M be a smooth n-dimensional manifold. A vector eld on M is a smooth map X : M TM
such that
M
X is the identity on M. Intuitively, a vector eld is choice of vector X(p) T
p
M for
each p M, such that X(p) varies smoothly in TM as p varies smoothly in M.
Example: Let M = S
3
, viewed inside H = R
4
as before, and let H
0
be the subspace of H orthogonal
to R. Then for any v H
0
, the function X
v
(p) = pv is a vector eld on S
3
.
Let G be a Lie group. For any g G we have a map
L
g
: G G, given by L
g
(x) = gx,
22
called left translation. This map has a derivative L
g
: TG TG, mapping each tangent space T
x
G
to T
gx
G.
A vector eld X on G is called left-invariant if for all g G the following diagram commutes:
TG
L
g
//
TG
G
X
OO
Lg
//
G
X
OO
.
Let g = T
e
G denote the tangent space to G at the identity element e G. For each v g there is a
unique left-invariant vector eld X
v
: G TG, given by
X
v
(g) = L
g
(v).
Conversely, if X : G TG is any left-invariant vector eld then
X(g) = L
g
X(e),
so X = X
v
, where v = X(e). Thus, the left-invariant vector elds are in canonical bijection with
vectors in g.
4.3 The tangent bundle of a Lie group
An n-dimensional manifold M is parallelizable if there is diffeomorphism
f : M R
n
TM
restricting to a linear isomorphism R
n
= p R
n
T
p
M for each p M. Equivalently, M is
parallelizable if there exist n vector elds X
1
, . . . , X
n
which are linearly independent at each point in
M.
For example, S
n
is parallelizable exactly when n = 1, 3, 7. This is easy to see for S
1
and for S
3
the
vector elds X
i
, X
j
, X
k
(see example above) are linearly independent at each point. For S
7
see section
8.3.
More generally, let G be any Lie group, and choose a basis e
1
, . . . , e
n
of g. Since L
g
: g T
g
G is an
isomorphism, the vector elds X
i
(g) = L
g
(e
i
) are linearly independent in T
g
G for each g G. This
proves:
Proposition 4.2 Any Lie group is parallelizable.
4.4 One-parameter-subgroups
Let M be an n-dimensional manifold. A path in M is a smooth map : I M dened on some open
interval in I R. The derivative of is then a map on tangent bundles
: TI TM.
23
Choose I small enough so that (I) M
, U
, M
) on M. Then TI = I R and
(TI) TM
= M
R
n
U
R
n
. Taking 1 R as a basis vector, we have
R
n
.
Given a smooth vector eld X : M TM we seek a path in M such that
(t, 1) = X((t)),
(0, 1) g = T
e
G.
Lemma 4.5 Let : R G be a one-parameter subgroup, with v :=
g
(v). Then for all t R
we have
(t, 1) = L
(t)
(v) = X
v
((t)).
24
Proof: Regarding R itself as a Lie group, we can extend the vector (0, 1) T
0
R to a left-invariant
vector eld X
(
0, 1) : R TR, given by X
(0,1)
(t) = L
t
(0, 1), where L
t
: R R is the translation
map L
t
(x) = t + x. For all x R we have L
t
(x, 1) = (t + x, 1).
Since is a homomorphism, the diagram
R
Lt
//
G
L
(t)
R
//
G
is commutative. Taking derivatives, we get the commutative diagram
TR
L
//
TG
L
(t)
TR
//
TG
.
We get
(t, 1) =
t
(0, 1) = L
(t)
(v) = X
v
((t)),
proving the lemma.
Thus, a one-parameter subgroup is a global integral of the left-invariant vector eld determined by
(0, 1) g.
Theorem 4.6 For each v g there is a unique one-parameter subgroup
v
: R G such that
(0, 1) = v.
Proof: Suppose and are two one-parameter subgroups such that
0
(t, 1) = X
v
(
0
(t) for [t[ < .
The rst step is to prove that
0
is a local homomorphism. That is,
0
(s + t) =
0
(s)
0
(t), for [s[, [t[ < /2.
Fix s. Since both sides agree at t = 0, it is sufcient to show that both sides solve the equation
f
(t, 1) = X
v
(f(t)).
For the left side, we have
0
(s + t) =
0
L
s
(t) and
[
0
L
s
]
(t, 1) =
0
L
s
(t, 1) =
0
(t + s, 1) = X
v
(
0
(t + s)).
25
For the right side, let g =
0
(s). Then
[
0
(s)
0
(t)]
(t, 1) = [L
g
0
]
(t, 1)
= L
0
(t, 1)
= L
g
X
v
(
0
(t))
= L
g
L
0
(t)
(v) (by Lemma 4.5)
= L
g
0
(t)
(v)
= X
v
(g
0
(t))
= X
v
(
0
(s)
0
(t)),
Thus
0
is indeed a local homomorphism.
The next step is to extend
0
to a homomorphism : R G. Given t R, choose a positive integer
N such that [t/N[ < /2 and set
(t) =
0
(t/N)
N
.
If also [t/M[ < /2 then
0
(t/M)
M
=
0
(t/MN)
MN
=
0
(t/N)
N
,
since
0
is a local homomorphism. Hence is well-dened. Similarly, is a homomorphism: Given
t, s R, choose N such that [t[/N and [s[/N are both < /2. Then
(s + t) =
0
_
s
N
+
t
N
_
N
=
0
(s/N)
N
0
(t/N)
N
= (s)(t).
Next, is smooth: For
0
is smooth on (, ), so (t) =
0
(t/N) is smooth on (N, N). Finally,
(0, 1) =
0
(0, 1) = X
v
(
0
(0)) = X
v
(e) = v.
This completes the proof the theorem.
For v g, we let
v
: R G be the unique one-parameter subgroup such that
v
(0, 1) = v.
4.5 The exponential map
Let G be a Lie group. The Exponential map for G is the map exp = exp
G
: g G dened by
exp(v) =
v
(1).
We wish to show that exp is a smooth map, where the vector space g is regarded as a smooth manifold.
We must examine how
v
varies with v. The simplest situation is when v is scaled.
Lemma 4.7 For all s R, we have
sv
(t) =
v
(st).
26
Proof: Let M
s
: R R be the map M
s
(t) = st. We have M
s
(0, 1) = (0, s), so
(
v
M
s
)
(0, 1) =
v
M
s
(0, 1) =
v
(0, s) = s
v
(0, 1) = sv.
Since
sv
is the unique one-parameter subgroup of G such that
sv
(0, 1) = sv, we have
v
M
s
=
sv
and the lemma follows.
Next we consider the smoothness of v
v
. This requires an extension of the uniqueness and existence
theorem to families of vector elds.
Theorem 4.8 Let U V R
n
R
m
be an open neighborhood of (0, 0) and let X : U V R
n
be a smooth map. Then there exists > 0, a neighborhood V
0
of 0 in V and a unique smooth map
f : (, ) V
0
U such that for all v V
0
the function f
v
(t) := f(t, v) satises the differential
equation
f
v
(t) = X(f
v
(t), v), for [t[ < .
Proof: See references in Adams, page 8.
Proposition 4.9 Let G be a Lie group and let g = T
e
G. The exponential map exp : g G, dened by
exp(v) =
v
(1), is smooth.
Proof: Recall that G is parallelizable, via the map
X : Gg TG given by X(g, v) = L
g
(v).
The one-parameter subgroup
v
(t) solves the differential equation
v
(t, 1) = X(
v
(t), v).
By Thm. 4.8 the map (t, v)
v
(t) is smooth on some neighborhood (, ) V
0
of (0, 0) in R g.
Then
N
1
v
(t/N)
N
2
=
v
(t)
is smooth on (N, N) NV
0
, so (t, v)
v
(t) is smooth on R g. Taking t = 1 we see that exp
is smooth.
Remark 1: For each v g, the image exp(v) =
v
(1) G is a value of the one parameter subgroup
v
. In fact all values of
v
are obtained from exp. For Lemma 4.7 implies that
v
(t) =
tv
(1) = exp(tv).
Remark 2: Beware that exp is not a group homomorphism if G is non-abelian.
We next show that the exponential map is functorial. Let : G H be a Lie group homomorphism
and let g = T
e
G, h = T
e
H be the respective tangent spaces at the identity elements of G and H. Let
0
: g h
be the derivative of . This is a linear map obtained by restricting
: TG TH to g.
27
Proposition 4.10 The following diagram is commutative:
g
0
h
exp
G
_
exp
H
G
H
Proof: In the proof we write
instead of
0
. For all v g we have
(
v
)
(0, 1) =
v
(0, 1) =
(v),
so
v
=
(v)
by Thm. 4.6. Now
(exp
G
(v)) = (
v
(1)) =
(v)
(1) = exp
H
(
(v)),
as claimed.
This is the most useful result in the theory of Lie groups.
4
Example: Let G = GL
n
(R). Since Gis an open subset of M
n
(R) we canonically identify g = M
n
(R).
Take A g and consider the path : R G given by
(t) = e
tA
= I + tA +
t
2
2
A
2
+ G.
Since
p
=
_
f
i
x
j
(p)
_
is invertible at p. Then there exists a neighborhood U
0
of p mapped diffeomorphically by f onto an
open neighborhood of f(p) in R
n
.
4
This claim is really a challenge to nd a result that is even more useful.
5
See Rudin Principles of Mathematical Analysis.
28
The smooth map exp : g G has derivative exp
G
: Tg TG. Restricting to T
0
g = g, we get a
linear map exp
0
: g g. Let v g, and let h : R g be the map h(t) = tv. Then
v
= exp h, so
exp
0
(v) = exp
0
(h
(0)) =
v
(0) = v,
so exp
0
is the identity map, which is invertible. Prop. 4.11 now follows from the Inverse Function
Theorem.
Example: Again let G = GL
n
(R) with g = M
n
(R). A neighborhood of I in G is of the form I + U,
where U is a neighborhood of 0 in g. The series
log(I + u) = u
1
2
u
2
+
1
3
u
3
converges for u near 0 (e.g., for [u[ < 1 when n = 1) and inverts the exponential map: exp(log(I +
u)) = u for u near 0.
Prop. 4.11 shows that the exponential map parametrizes an open neighborhood of the identity in G. It
says nothing about how large this neighborhood is, but that does not matter, thanks to the following.
Proposition 4.13 A connected Lie group G is generated by any open neighborhood of the identity. In
particular G is generated by exp(g).
Proof: This result relies only on the continuity of the group laws, and not on the smooth structure. Let
U be an open neighborhood of e in a connected Lie group G and let H be the subgroup of G generated
by U. Since the multiplication in G is continuous, gU is an open neighborhood of g, for all g G.
Now
H =
_
hH
hU and GH =
_
gGH
gU
are both open in G, so H is both open and closed and nonempty (we have e H). As G is connected,
it follows that H = G.
From Prop. 4.11 we have that exp(g) contains an open neighborhood of e, which generates G, so
exp(g) generates G.
Remark: We will often nd that exp(g) = G, for example if G is compact. However it is not always
so, even if G is connected. For example if G = SL
2
(R) then the image of exp consists of matrices
exp(A) where tr(A) = 0. The eigenvalues , of A sl
2
(R) satisfy + = tr(A) = 0 and
= det(A) R. Hence = and
2
R. It follows that is either real or purely imaginary, and
the eigenvalues e
of A lie on the positive real axis or the unit circle, respectively. Thus, for example
the matrix
_
2 0
0 1/2
_
SL
2
(R)
is not in the image of exp : sl
2
(R) SL
2
(R).
Corollary 4.14 Let G and H be Lie groups with G connected. Then any Lie group homomorphism
: G H is determined by its derivative
0
: g h.
29
Proof: If and are two Lie group homomorphisms from G H with
0
=
0
then applying
functorality Prop. 4.10, we have
exp
G
= exp
H
0
= exp
H
0
= exp
G
.
From Prop. 4.11 it follows that and agree on a neighborhood of e in G. From Prop. 4.13 it follows
that and agree everywhere.
Corollary 4.15 Let : G H be a Lie group homomorphism and assume H is connected. If
is surjective we have
exp
H
(h) = exp
H
(
(g)) = (exp
G
(g)). Hence the image of generates H, so is surjective.
4.6 The Adjoint representation
A representation of a group G is a homomorphism
: G GL(V ), (6)
where V is a vector space and GL(V ) = Aut(V ) is the group of linear automorphisms of V . If
V = R
n
for some n then a choice of basis of V gives an isomorphism GL(V ) GL
n
(R), so a nite
dimensional representation may be regarded as a homomorphism
: G GL
n
(R).
However in most situations there is no natural choice of basis and it is better to think of a representation
in the form (6).
A Lie group G has a canonical representation
Ad : G GL(g)
called the adjoint representation, dened as follows. Take an element g G and let c
g
: G G be
the conjugation map:
c
g
(x) = gxg
1
.
Since c
g
(e) = e, the derivative Ad(g) := c
g
maps g g. And since c
g
is a homomorphism, functoral-
ity Prop. 4.10 gives a commutative diagram
g
Ad(g)
g
exp
_
exp
G
cg
G
30
Since c
gh
= c
g
c
h
, it follows that Ad(gh) = Ad(g) Ad(h). Thus we have a representation
Ad
G
= Ad : G GL(g), g Ad(g).
For v g, the image Ad(g)v is the initial tangent direction of the path t g exp(t)g
1
in G. Thus,
we have
Ad(g)v =
d
dt
_
g exp tvg
1
_
[
t=0
.
Now Ad itself is a Lie group homomorphism; its derivative at e is the linear map
ad : g End(g), u ad(u) = Ad
(u).
For u, v g, the image ad(u)(v) is the initial tangent direction of the path t Ad(exp(tu))v in g.
Thus, we have
ad(u)v =
d
dt
(Ad(exp(tu))v) [
t=0
.
Functorality relates Ad and ad via the commutative diagram
g
ad(g)
End(g)
exp
G
_
exp
GL(g)
G
Ad
GL(g).
Recall that exp
GL(g)
(A) = e
A
= I + A +
1
2
A
2
+ . Thus, the diagram expresses the equality
Ad(exp(x)) = e
ad(x)
GL(g).
It is convenient to use the following alternative notation for ad. The Lie bracket is the mapping
[ , ] : g g g, given by [u, v] = ad(u)(v). (7)
So the adjoint representation Ad : G GL(g) is given bv
Ad(exp(u))(v) = e
ad(u)
(v) = v + [u, v] +
1
2
[u, [u, v]] + .
The adjoint representation Ad and its derivative ad are functorial.
Proposition 4.16 Let : G H be a Lie group homomorphism. Then for every g G the following
diagram commutes
g
h
Ad
G
(g)
_
Ad
H
((g))
g
h
and for all u, v g we have
[
(u),
(v)] =
([u, v]).
31
Proof: For Ad we compute
Ad
G
(g) =
g
= ( c
g
)
= (c
(g)
)
= c
(g)
= Ad
H
((g))
,
making the diagram commute as claimed. Next we compare two power series in t, namely
(Ad
G
(exp(tu)v) =
(v + t[u, v] + ) =
(v) + t
([u, v]) + ,
which is equal to
Ad
H
((exp(tu))
(v) =
(v) + t[
(u),
(v)] + .
Comparing coefcients in t we obtain [
(u),
(v)] =
([u, v]).
Finally, the Adjoint representation is as faithful as possible.
Proposition 4.17 If G is a connected Lie group then the kernel of Ad : G GL(g) is the center of G.
Proof: If z is in the center of G then c
z
is the trivial automorphism of G so its derivative Ad(z) is the
identity map I. Conversely, for any g G and v g we have
g exp(v)g
1
= exp(Ad(g)v).
If Ad(g) = I then g centralizes exp(g), which generates G, so g is in the center of G.
4.6.1 The product rule for paths
At various points we will need to differentiate a product of two paths in a Lie group. We install the
result now for later use.
Let G be a Lie group and let , : R G be two smooth paths in G. We then have a product path ,
given by ()(t) = (t)(t). Our goal is to express the derivative ()
in terms of ,
, ,
. We will
write
(t) instead of
(t, 1) etc.
For g, h G let L
g
, R
h
: G G be the maps L
g
(x) = gx and R
h
(x) = xh. Associativity of the group
law on G means that the maps L
g
and R
h
commute. The product rule is as follows.
Proposition 4.18 For all t R we have
()
(t) = L
(t)
(t) + R
(t)
(t) G
(t)
Proof: Let : GG G be the product map: (g, h) = gh. We compute
: T
g
GT
h
G T
gh
G
32
as follows. A typical element of T
g
G is of the form L
g
(v), where v g = T
e
G. Then
(L
g
(v), 0) =
d
dt
(g exp(tv), h)[
t=0
=
d
dt
[g exp(tv)h] [
t=0
=
d
dt
[L
g
R
h
(exp(tv))]
t=0
= (L
g
R
h
)
(v)
= R
h
L
g
(v)
since R
h
and L
g
commute. Likewise,
(0, R
h
u) = L
g
R
h
(u).
It follows that for all x T
g
G and y T
h
G we have
(x, y) =
(x, 0) +
(0, y) = R
h
(x) + L
g
(y).
Now = (, ) so the chain rule gives
()
(t) =
(t),
(t)) = L
(t)
(
(t)) + R
(t)
(
(t)),
as claimed.
We use this to express the bracket [ , ] on g as the derivative of a commutator of one-parameter sub-
groups in G. Let [g, h] = ghg
1
h
1
be the commutator in G.
Proposition 4.19 For all u, v g we have
[u, v] =
d
dt
_
d
ds
[exp(tu), exp(tv)]
s=0
_
t=0
.
Proof: Using the product rule for the paths (s) = g(exp(sv)g
1
and (s) = exp(sv)
1
= exp(sv),
we nd that
d
ds
[g, exp(sv)]
s=0
= Ad(g)v v,
and the result follows.
Corollary 4.20 For all u, v g we have
[u, v] = [v, u].
Proof: A similar argument shows that
d
dt
[exp(tu), h]
t=0
= u = Ad(h)u.
Interchanging the order of differentiation gives the result.
33
4.7 The Lie algebra
A Lie algebra is a vector space L over a eld F together with a map
[ , ] : L L L
called the bracket, satisfying the following three properties for all x, y, z, w L and a, b, c, d F:
(bilinearity) [ax + by, cz + dw] = ac[x, z] + ad[x, w] + bc[y, z] + bd[u, w];
(skew-symmetry) [x, y] = [y, x];
(Jacobi-identity) [x, [y, z]] = [[x, y], z] + [y, [x, z]].
A homomorphism of Lie algebras L, M is a linear map : L M preserving the bracket:
([x, y]) = [(x), (y)], for all x, y L.
We let Aut(L) denote the group of automorphisms of L.
Proposition 4.21 Let Gbe a Lie group. Then the tangent space g = T
e
G, with bracket [ , ] : gg g
as dened in (7), is a Lie algebra.
Proof: Since ad : g End(g) is linear, it follows that the bracket is bilinear. Skew-symmetry was
proved in Cor. 4.20. It remains to prove the Jacobi identity. Fix t R and u g and consider the map
: G G given by conjugation by exp(tu). We nd, for any v g:
(v),
(0) =
_
_
0 x
(0) z
(0)
0 0 y
(0)
0 0 0
_
_
.
Hence the Lie algebra is
h
3
=
_
_
_
_
_
0 x z
0 0 y
0 0 0
_
_
: x, y, z R
_
_
_
.
We note that A
3
= 0 for all matrices A h
3
, so the exponential map is given by
exp(A) = I + A +
1
2
A
2
,
with inverse
log(I + A) = A
1
2
A
2
.
35
5 Abelian Lie groups
If a Lie group G is abelian, then the adjoint representation is trivial and the Lie bracket on g is
zero. Nevertheless, abelian Lie groups are important for understanding the structure of non-abelian
Lie groups.
Lemma 5.1 If G is an abelian Lie group then the map exp
G
: g G is a group homomorphism.
Two examples of abelian Lie groups are R and S
1
. The main result says that these two account for all
connected abelian Lie groups.
Proposition 5.2 Let G be a connected abelian Lie group of dimension n. Then there is 0 k n
such that G is isomorphic to (S
1
)
k
R
nk
as Lie groups.
Proof: Let u, v g and dene : R G by
(t) = exp(tu) exp(tv).
Then
(t + s) = exp((t + s)u) exp((t + s)v) = exp(tu) exp(su) exp(tv) exp(sv).
Since G is abelian, this is equal to
exp(tu) exp(tv) exp(su) exp(sv) = (t)(s).
Therefore is a one-parameter subgroup. From the product formula we have
(0) =
u
(0) +
v
(0) =
u + v, so in fact (t) = exp(t(u + v)). Therefore, we have
exp(tu) exp(tv) = exp(tu + tv)
so exp is a group homomorphism, as claimed.
Let L = ker exp. Since exp is injective on a neighborhood of 0 g, it follows that L is a discrete
subgroup of g. We may write L = Ze
1
Ze
k
for some k n, and extend to a basis e
1
, . . . , e
n
of
g. This gives an isomorphism g
R
n
sending L Z
k
0
nk
. Hence
G g/L
R
k
R
nk
Z
k
0
nk
(S
1
)
k
R
nk
.
T
is continuous, so (N)
1
(U
k
) is an open subset of R
n
contained in C
k1
. Then C
k
is chosen to be any
cube contained in (N)
1
(U
k
).
Take x
k
C
k
and let t = (x) T. For all k > 1 we have
t
N(k)
= N
k
(x) N
k
(C
k
) U
k
.
Hence the powers t, t
2
, . . . meet every open set in T.
An element of a torus T whose powers are dense in T is called a topological generator of T.
6 Subgroups of Lie groups
6.1 Closed subgroups
Let M be an n-dimensional manifold and let S be a subset of M. We say that S is a submanifold of
M
6
if there exists a subspace W R
n
and an atlas (
, U
, M
= , or
(W U
) = S M
.
6
There are various denitions of submanifold. The one used here is usually called embedded submanifold.
37
We say that such an atlas is good for S.
Remark 1: If S is a submanifold of M then S is a manifold, with charts (
, W U
, S M
) where
S M
,= .
Remark 2: If G is a Lie group and H G is a submanifold and a subgroup then H is a Lie group.
However, there are subgroups of Lie groups which are Lie groups but not submanifolds. An example
is G = R
2
Z
2
and H = [t
2, t] : t R. Here H R is dense in G.
Theorem 6.1 A closed subgroup of a Lie group is a submanifold.
Proof: Let G be a Lie group with Lie algebra g. Put a metric [x[ on g, say by choosing a basis of g
and declaring it to be orthonormal.
Let H be a subgroup of G which is closed in G. The proof requires three lemmas.
We say a vector v g is known by H if there exists a sequence of nonzero vectors (x
n
) g such
that exp(x
n
) H for all n, x
n
0 and [x
n
[
1
x
n
v. Dene
h := tv : t R and v is known by H.
Lemma 6.2 We have exp(h) H.
Proof: Let tv h and let (x
n
) be a sequence for v as above. Choose integers m
n
such that m
n
=
t/[x
n
[ + r
n
, where 0 r
n
< 1. Then m
n
[x
n
[ t m
n
x
n
= m
n
[x
n
[ [x
n
[
1
x
n
tv. We have
exp(m
n
x
n
) = exp(x
n
)
mn
H for all n, so exp(m
n
x
n
) exp(tv) H since H is closed.
Lemma 6.3 h is a vector subspace of g.
Proof: It sufces to show that h is closed under addition. Let x, y h. Choose neighborhoods
0 U g and e V G such that exp : U V is a diffeomorphism, and denote the inverse by
log : V U. Since multiplication is continuous on G we can choose a smaller neighborhood V
1
V
such that V
1
V
1
V .
Choose > 0 such that exp(tx) and exp(ty) lie in V
1
for all [t[ < . Then exp(tx) exp(ty) V , so we
can dene a function f : (, ) g by
f(t) = log(exp(tx) exp(ty)).
We have f(0) = 0 and exp(f(t)) = exp(tx) exp(ty). Differentiating at t = 0, we get
f
(0) = x + y.
38
Let x
n
= f(1/n), for n > 1/. Then exp(x
n
) = exp(x/n) exp(y/n) H by Lemma 6.2, x
n
f(0) = 0 and
[x
n
[
1
x
n
=
1/n
f(1/n)
1
1/n
f(1/n)
1
[x + y[
(x + y).
Hence the latter is known by H so x + y h.
Choose a vector space complement m to h, so that g = h m. Dene : g G by (x, y) =
exp(x) exp(y) where x h and y m. Since = exp on each summand h and m, it follows that
i
u
i
v
i
.
More explicitly,
U
n
= g GL
n
(C) : g
1
=
t
g.
These are closed conditions, so U
n
is a Lie group. In fact, each column of a matrix in U
n
lies in the
unit sphere S
2n1
C
n
and U
n
is compact. Note that U
1
= S
1
.
The Special Unitary Group is the Lie group SU
n
= U
n
SL
n
(C). For n = 2 we have SU
2
= S
3
.
6.2 Homogeneous spaces
Let G be a Lie group and let H be a closed subgroup, and let : G G/H be the projection of G
onto the set G/H = gH : g G of left cosets of H in G. Declare a subset U G/H to be open iff
1
(U) is open in G. This makes an open mapping, and H being closed is equivalent to G/H being
a Hausdorff topological space.
Choose a neighborhoods U, V of 0 g and e G such that exp maps U diffeomorphically onto V .
Recall that g = h m, where m is a vector space complement to h in g. Dene
: m U G/H, by (u) = exp(u).
From Lemma 6.4 it follows that is a homeomorphism onto an open neighborhood of eH in G/H.
Left-translating as above, we obtain an atlas on G/H making G/H into a smooth manifold such that
the projection : G G/H is smooth and
t
x 0
_
: x R
n
_
.
Identifying m = R
n
, Ad
G
[
H
becomes the natural representation of SO
n
on R
n
. The homogeneous
space M = G/H is the sphere S
n
and the map m S
n
given by A
x
exp(A
x
) e
n+1
is spherical
42
coordinates on S
n
. If we change this example slightly and take H = O
n
xing e
n+1
up to sign, then h
and m are unchanged and G/H is the real projective space P
n
(R).
Example 3: The previous example generalizes as follows. Take G = SO
n
and let H = G(O
k
O
)
be the subgroup preserving a k-dimensional subspace U R
n
(and therefore also preserving the
= n k-dimensional orthogonal complement of U). Then
m =
__
0 x
t
x 0
_
: x = k matrix
_
and (h
1
, h
2
) H acts by x h
1
xh
1
2
. The homogeneous space is the Grassmannian of k-planes in
R
n
. If we take instead H = SO
k
SO
is
connected and complex manifolds have a natural orientation, this example has only one version, unlike
the previous examples.
7 Maximal Tori
Let G be a compact Lie group. A maximal torus in G is a subgroup T G which is a torus, and is
contained in no larger torus.
Lemma 7.1 A conjugate of a maximal torus is a maximal torus.
Proof: Let T be a maximal torus in G and let g G. The conjugation map c
g
: G G is a
diffeomorphism of the manifold G and an automorphism of the group G, so gTg
1
is a torus. If U is
a torus containing gTg
1
then g
1
Ug is a torus containing T, which equals T by maximality of T, so
gTg
1
= U and so gTg
1
is a maximal torus.
Let t denote the Lie algebra of a maximal torus T in G. As G is compact, the adjoint representation
Ad
G
decomposes as
g = t m,
where m is the orthogonal complement of t with respect to some Ad
G
-invariant inner product on g (for
example, the negative of the Killing form). We will study the representation of T on m, which is the
tangent space to G/T at eT.
Lemma 7.2 Every continuous nite dimensional irreducible real representation of a torus T is either
trivial (one-dimensional with trivial T-action) or is two-dimensional, of the form
(exp(x)) =
_
cos 2(x) sin 2(x)
sin 2(x) cos 2(x)
_
for a unique linear functional : t R.
43
Proof: Let : T GL(V ) be a nontrivial continuous irreducible real representation. Let t T be a
topological generator. Since T is connected and compact, the eigenvalues of (t) lie on the unit circle
and at least one has innite order. Since (t) is a real matrix, the latter come in complex-conjugate
pairs e
i
. Let V
is a four-dimensional real vector space whose xed-points under complex conjugation are two-
dimensional. Since V is irreducible, this two-dimensional space is all of V . Again, since T is compact
and connected, (T) SO
2
. The functional is the derivative
: t so
2
= R. Since determines
, this functional is unique.
The linear functionals which can arise in Lemma 7.2 are precisely those which take integer values
on the lattice L = ker[exp : t T]. Thus, the irreducible representations of T are parametrized by the
lattice
L
= t
: (L) Z.
Lemma 7.3 A maximal torus T in G is maximal connected abelian.
Proof: Let A be a connected abelian subgroup of G containing T. Since multiplication is continuous,
the closure
A is also abelian and is still connected. Hence
A is a torus, so T =
A and therefore T = A.
R/
m
where m
is given by
ad(x) = 2(x)
_
0 1
1 0
_
.
Example 1: Take G = U
n
, with Lie algebra
g = u
n
= X M
n
(C) :
t
X = X.
The diagonal matrices in G form a maximal torus T U
n
1
. The Lie algebra of T consists of diagonal
matrices with imaginary entries u =
1(u
1
, . . . , u
n
), so t
has basis x
i
where x
i
(u) = u
i
. For
1 i < j n and z C let X
ij
(z) be the matrix with ij-entry equal to z, ji-entry equal to z, and
all other entries zero. Let
m
ij
= X
ij
(z) : z Z.
Then m
ij
is preserved by Ad
G
(T) with roots x
i
x
j
, x
j
x
i
and the decomposition of su
n
with respect
to Ad
G
(T) is
su
n
= t
i<j
m
ij
and the roots of U
n
are given by
R = x
i
x
j
: 1 i ,= j n.
For SU
n
a maximal torus is the det = 1 subgroup of the above T, and su
n
is the tr = 0 subspace of
u
n
. The subspaces m, m
ij
and the roots are unchanged.
Example 2: Take G = SO
2n
(or O
2n
), with Lie algebra
g = so
2n
= X M
2n
(R) :
t
X = X.
Think of matrices in so
2n
as nn matrices where each entry is a 2 2 matrix. For 1 i < j n and
a 2 2 matrix z, let X
ij
(z) be the matrix with ij-entry equal to z, ji-entry equal to
t
z, and all other
entries zero. For 1 i < j n let
m
x
i
x
j
= X
ij
__
x y
y x
__
: x, y R,
m
x
i
+x
j
= X
ij
__
x y
y x
__
: x, y R.
Then m
x
i
x
j
and m
x
i
+x
j
is preserved by Ad
G
(T) with roots (x
i
x
j
) and (x
i
+ x
j
) respectively
and the decomposition of so
2n
with respect to Ad
G
(T) is
so
2n
= t
i<j
(m
x
i
x
j
m
x
i
+x
j
)
45
and the roots of SO
2n
are given by
R = x
i
x
j
: 1 i ,= j n.
Example 3: Take G = SO
2n+1
(or O
2n+1
), with Lie algebra
g = so
2n+1
= X M
2n+1
(R) :
t
X = X.
Then G contains SO
2n
and a maximal torus T of SO
2n
is also a maximal torus of SO
2n+1
. We have
so
2n+1
= so
2n
R
2n
,
as in Example 2 of section 6.3. The action of T = SO
n
2
on R
2n
is a direct sum of the n irreducible
representations obtained by projecting onto each factor of T. Thus, the roots of SO
2n+1
are given by
R = x
i
x
j
: 1 i ,= j n x
i
: 1 i n.
7.1 The Weyl group
Let G be a compact Lie group with maximal torus T. Let N(T) = n G : nTn
1
= T be the
normalizer of T in G. The Weyl group is the quotient
W = N(T)/T.
Proposition 7.5 The Weyl group W is nite.
Proof: We rst show that N(T) is compact. For all t T, dene maps f
t
,
f
t
: G G by f
t
(g) =
gtg
1
and
f
t
(g) = g
1
tg. Then
N(T) =
_
tT
f
1
t
(T)
_
tT
f
1
t
(T)
_
,
so N(T) is closed in the compact group G hence is compact. For each n N(T) we have a commu-
tative diagram
t
Ad(n)
t
exp
T
_
exp
T
T
cn
T.
It follows that Ad(n)L = L, where L = ker exp
T
. This gives a map Ad : N(T) Aut(L) GL
n
(Z),
where n = dimt. Here GL
n
(Z) is the discrete group of n n matrices whose inverse is also integral.
Since N(T) is compact, it follows that Ad(N(T)) is compact and discrete, hence is nite.
46
The kernel of Ad : N(T) Aut(L) is the centralizer C(T) of T in G. Indeed, if Ad(n) is trivial on L
then Ad(n) is trivial on t, since L spans t. For all x t we then have Ad(n)x = x so nexp(x)n
1
=
exp(Ad(n)x) = exp(x) so n C(T). The other containment is clear.
Let H = N(T)
: H
i
(M) H
i
(M) for all i such that if
g : M M is another smooth map we have (f g)
= g
.
iv) If f : M [0, 1] M is a continuous map such that f
t
:= f(, t) is smooth for each t [0, 1]
then f
0
= f
1
.
7
For more details on cohomology of manifolds see Bott and Tu, Differential forms in algebraic topology.
49
v) If f : M M is a smooth map with only nite many xed-points then
n
i=0
(1)
i
tr(f
, H
i
(M)) =
pM
f(p)=p
sgn det(I f
)
p
,
where the right side is understood to be zero if f has no xed-points.
Here, if f(p) = p then f
: TM TM restricts to a map f
: T
p
M T
p
M and det(I f
)
p
is the
determinant of the linear map I f
on T
p
M and sgn det(I f
)
p
+1, 1, 0 is the sign of this
determinant. The formula v) is the Lefschetz xed-point formula, and the alternating sum
L(f) :=
n
i=0
(1)
i
tr(f
, H
i
(M))
is the Lefschetz number of f. In the situation of item iv) we have L(f
1
) = L(f
0
). It may happen that
f
1
has innitely many xed-points but f
0
has only nitely many, so we can apply v) to f
0
to compute
L(f
1
).
If f : M M is the identity map then f
i=0
(1)
i
dimH
i
(M)
is the Euler characteristic (M) of M, which can be computed by triangulating M. For example, any
polyhedron drawn on the sphere S
2
with v vertices, e edges and f faces has v e + f = 2 by Eulers
theorem. Here (S
2
) = 2 and we also have
dimH
0
(S
2
) + dimH
2
(S
2
) = 2.
We can also compute this using the Lefschetz xed-point formula, even though the identity map has
innitely many xed-points. Let f SO
3
be a nontrivial rotation by angle . Then f has two xed-
points on S
2
, and f
p
) = det
_
1 cos sin
sin 1 cos
_
= 2(1 cos ) > 0.
So
pS
2
f(p)=p
det(I f
)
p
= 1 + 1 = 2.
On the other hand, let f
t
be rotation about the same axis of f by t. Then f
1
= f and f
0
= I, so
f
= f
0
is the identity map on H
(S
2
) and we have
L(f) = (S
2
).
50
We generalize this example to prove Thm. 7.6 as follows. Let G be a compact connected Lie group
with maximal torus T. Each g G acts on G/T by L
g
(xT) = gxT. Since g is connected to the
identity e G by a path in G, we have
L(L
g
) = L(L
e
) = (G/T),
independent of g.
Let t T be a topological generator. We have txT = xT if and only if t xTx
1
, which means
T = xTx
1
, so x N(T). Hence the xed-point set of L
t
in G/T is exactly
W = nT : n N(T).
For each n N(T), the derivative map L
n
gives a commutative diagram
T
eT
(G/T)
L
n
T
nT
(G/T)
L
n
1
tn
_
L
t
T
eT
(G/T)
L
n
T
nT
(G/T),
so that
det(I L
t
)
nT
= det(I L
n
1
tn
)
eT
.
Recall the derivative of the projection : G G/T restricts to an isomorphism
: m
T
eT
(G/T).
For any s T this isomorphism transforms Ad
G
(s) on m to the map L
s
on T
eT
(G/T). Let s = n
1
tn
as above and write s = exp(u) for u t. Then
det(I L
n
1
tn
)
eT
=
R/
det(I Ad
G
(s))
m
=
R/
det
_
1 cos 2(u) sin 2(u)
sin 2(u) 1 cos 2(u)
_
=
R/
2(1 cos 2(u)) > 0,
by Lemma 7.4. It follows that
sgn det(1 L
t
)
p
= +1
for every xed-point p of L
t
in G/T. For any g G we therefore have
L(L
g
) = L(L
t
) = [W[ , = 0
so L
g
has a xed-point in G/T. This completes the proof of Thm. 7.6
51
8 Octonions and G
2
In this section we construct the 14-dimensional compact Lie group of type G
2
, using the non-associative
algebra O of octonions. The quaternions will play an important role, but unlike H, the unit sphere in
O is not a group. Instead, G
2
arises as the automorphism group of O. Thus, G
2
is analogous to
Aut(H) = SO
3
.
8.1 Composition algebras
In this section and the next we follow Conway-Smith
8
on the basic equations in composition algebras.
A quadratic form on a real vector space A is a function N : A R such that
N(rx) = r
2
N(x) for all r R and x A.
The function (x, y) =
1
2
[N(x + y) N(x) N(y)] is symmetric and bilinear.
A quadratic form N is positive denite if N(x) > 0 for all nonzero x A.
Let A be an algebra with unit, not necessarily associative, containing R as a subalgebra. We say A is a
composition algebra if there is a quadratic form N : A R which is multiplicative:
N(xy) = N(x)N(y) for all x, y A. (10)
In these notes we always assume without further mention that N is positive denite and call it the norm
on A. Since N takes positive values on A0, we have N(1) = N(1 1) = N(1)
2
so N(1) = 1 and
likewise N(1) = 1. The associated bilinear form is given as above by
(x, y) =
1
2
[N(x + y) N(x) N(y)].
This form is symmetric by denition and satises (x, x) = N(x) > 0 if x ,= 0. This implies that if
(x, t) = 0 for all t A iff x = 0. Replacing x by x y, we have x = y iff (x, t) = (y, t) for all t A.
We often derive equations in A using ( , ) in this way.
Dene the conjugation x x, by
x = 2(1, x) x. (11)
We now have the following equations:
(xy, xz) = N(x) (y, z) = (yx, zx). (12)
That is, left and right multiplication by x preserve the bilinear form, up to the scalar N(x). Equation
(12) is derived from (10) upon replacing y by y + z.
(xy, zw) + (xw, zy) = 2(x, y)(z, w). (13)
8
On quaternions and octonions, A.K. Peters, Natick MA, 2003
52
This follows from (12) upon replacing z w and x x + z. We only use this when (z, w) = 0,
when we get
(xy, zw) = (xw, zy) if (z, w) = 0. (14)
(y, xz) = (xy, z) = (x, z y). (15)
Thus, the adjoint of right multiplication by x is right multiplication by x, and similarly for left multi-
plication. Equation (15) is proved by setting z = 1 in (13).
x = x. (16)
This is proved by setting y = 1 and z = t in (13).
xy = y x. (17)
Proof: use (15) repeatedly to get ( y x, t) = ( x, yt) = ( x
t, y) = (
t, xy) = (
b.
Let t A be arbitrary.
For i) we have
(a(xd), t)
(15)
= (xd, at)
(14)
= ( ad, xt)
(15)
= (x( ad), t) .
For ii) we have
((xb)c, t)
(15)
= (xb, t c)
(23)
=
_
bx, t c
_
(14)
=
_
b c, tx
_
(17)
=
_
(cb), tx
_
(15)
=
_
(cb)x, t
_
(23)
= (x(cb), t)
For iii) we have
((xb)(xd), t)
(15)
=
_
xb, t(xd)
_
(22)
= (xb, t(xd))
(14)
= (x(xd), tb)
(15)
= (xd, x(tb))
(12)
= (d, tb)
(15)
=
_
d
b, t
_
.
54
Equation (24) is not a denition of a product; it is a consequence of the existence of the norm N on A.
However, we can reverse the process: Starting with a composition algebra H with norm N, let x be
a symbol and let H + xH be the set of pairs a + xb with a, b H and multiplication given by (24).
Extending the norm to H + xH by
N(a + xb) = N(a) + N(b)
makes H and xH orthogonal. Now dene the product on H + xH via (24):
(a + xb)(c + xd)
def
= (ac d
b) + x(cb + ad).
Now H + xH will be a composition algebra exactly when
[N(a) + N(b)][N(c) + N(d)] = N(ac d
b) + N(cb + ad).
Expanding and cancelling terms, this will hold exactly when
(cb, ad) = (ac, d
b),
which by (15) is equivalent to
(a(cb), d) = ((ac)b, d)
for all a, b, c, d H, which is equivalent to H being associative. This proves:
Proposition 8.3 Given a composition algebra (H, N), the double H+xH, with product as above and
norm N(a + xb) = N(a) + N(b), is a composition algebra exactly when H is associative.
Starting with Rand the normN(x) = x
2
, the doubling process produces composition algebras C, H, O,
the complex numbers, quaternions and octonions, respectively. There the process ends, because O is
not associative. So these are the only composition algebras, a theorem rst proved by Hurwitz in 1898.
8.3 Parallelizable spheres
We remarked in section 4.3 that the spheres S
n
are parallelizable exactly for n = 1, 3, 7. The proof that
S
3
is parallelizable works in all cases, using multiplication in composition algebras.
Let A be composition algebra of dimension 2
k
. Since N is positive-denite, the set S = x A :
N(x) = 1 of all unit vectors in A is a sphere of dimension 2
k
1. The tangent bundle to S is the
manifold
TS = (s, v) S A : (s, v) = 0.
Let V be the subspace of A orthogonal to 1. Then V = T
1
S is the tangent space to S at 1. More
generally, for any x S, we have (x, vx) = (x x, v) = (1, v) = 0, so V x = T
x
S is the tangent space
to S at s. Hence the function
X
v
: S S A, given by X
v
(x) = vx
takes values in TS and is a smooth vector-eld on S. Since v X
v
(x) is an isomorphism V
T
x
S,
it follows that S is parallelizable.
55
8.4 Automorphisms of composition algebras
An automorphism of a composition algebra A is an R- linear isomorphism : A A such that
(xy) = (x)(y) for all x, y A. The automorphisms of A form a group Aut(A), under composi-
tion.
Lemma 8.4 If Aut(A) then for all x, y A we have
1. N((x)) = N(x);
2. ((x), (y)) = (x, y);
3. (x) = ( x).
Proof: Each item is a consequence of the one before it, so we need only prove item 1.
Let A
0
be the subspace of A orthogonal to R, and let S = x A
0
: N(x) = 1 be the unit sphere in
A
0
. From (19) we have x S if and only if x
2
= 1. It follows that every Aut(A) preserves S.
Now let x A
0
is an arbitrary nonzero element, and let n = N(x)
1/2
. Then y := (n
1
x S so
(y) S so (x) = ny nS A
0
. Hence preserves A
0
and N((x)) = n
2
= N(x). And since
is an R-linear algebra automorphism we have (r) = r for all r R. Since R and A
0
are orthogonal,
it follows that preserves N on all of A.
It follows that Aut(A) is a subgroup of the orthogonal group O(A, N) = g GL(A) : N(g(x)) =
N(x) for all x A. For each x, y A let f
x,y
: O(A, N) A be the function f
x,y
(g) =
g(x)g(y) g(xy). Then G is the common zeros of the continuous functions f
x,y
, so G is closed in
O(A, N). It follows that Aut(A) is a compact Lie group. All cases are tabulated below.
A Aut(A)
R 1
C C
2
H SO
3
O G
2
Let H be a subalgebra of A of dimension half that of A, and let x A be orthogonal to A with
N(x) = 1. Then we have A = H + xH, an orthogonal sum, and the product in A is derived from the
product in H via the formula (24).
The following result shows how to construct automorphisms of A.
Proposition 8.5 Suppose we are given a subalgebra J A, an algebra isomorphism : H J,
and an element y A orthogonal to J. Then extends uniquely to an automorphism of A such that
(x) = y. Conversely, any automorphism of A arises in this way.
56
Proof: We dene by
(a + xb) = (a) + y(b).
This is the only possible extension of to A. It is clearly R-linear and is bijective, since is bijective
and the sums H + xH = J + yJ are both direct. We must show that preserves the product in A.
The key point is that we can apply Prop. 8.2 to both (H, x) and (J, y). Thus, we get
((a + xb) (c + xd))
(24)
=
_
(ac d
b) + x(cb + ad)
_
= (ac d
b) + y(cb + ad)
= (a)(c) (d)(
b) + y ((c)(b) + ( a)(d))
(8.4)
= (a)(c) (d)(b) + y
_
(c)(b) + (a)(d)
_
(24)
= ((a) + y(b)) ((c) + y(d))
= (a + xb) (c + xd).
Conversely, given any automorphism of A, let J = (H), y = (x). Then N(y) = N(x) = 1 and
(y, J) = (x, H) = 0, by Lemma autoconj. Then arises as in the rst part of the Proposition from its
restriction to H.
8.5 The Octonions O
The octonions are the double of the quaternions. Thus we have
O = H+H,
where H is the quaternion algebra with basis 1, i, j, k and is orthogonal to H with
2
= 1. Thus,
O has R-basis
1, i, j, k, , i, j, k.
Since the quaternions are associative the octonions are a composition algebra, with norm
N(x
0
+ x
1
i + x
2
j + x
3
k + x
4
+ x
5
i + x
6
j + x
7
k =
7
i=0
x
2
i
.
For all q H we have q = q by (23), so from (24) the multiplication in O is given by
(p + q)(r + s) = (pr sq) + (q r + sp).
In particular, we have
j(i) = k, i(k) = j, k(i) = j,
and orthogonal elements u, v anticommute: uv = vu.
Let V be the orthogonal complement of 1 in O and let S be the six-dimensional sphere in V :
S = x V : N(x) = 1.
We next show that any pair of orthogonal vectors in S generates a quaternion subalgebra of O. More
precisely, we have:
57
Lemma 8.6 Let u, v S be orthogonal vectors. Then the subspace J O spanned by 1, u, v, uv
is closed under the product in O and the linear map : H J sending 1 1, i u, j v, k uv
is an algebra isomorphism.
Proof: Since N(u) = N(v) = 1 and (u, 1) = (v, 1) = 0, we have u
2
= v
2
= 1 by (19). Likewise
N(uv) = N(u)N(v) = 1 and (1, uv) = ( u, v) = (u, v) = 0, so (uv)
2
= 1.
From (15) we have (uv, u) = (v, uu) = (v, 1) = 0 and likewise (uv, v) = 0. Hence the vectors
1, u, v, uv are orthonormal and uv anticommutes with u and v by (20). From the alternative law (21)
we have v(uv) = v(vu) = (v
2
)u = u and (uv)u = (vu)u = v(u
2
) = v. This shows that
u, v, uv satisfy the same relations as i, j, k so the lemma is proved.
The automorphism group G := Aut(O) is a compact subgroup of O(V ) O
7
. Every element of G
may be constructed as follows. Choose orthogonal vectors u, v S. By Lemma 8.6 these generate a
quaternion subalgebra J, spanned by 1, u, v, uv. Then choose x S orthogonal to u, v, uv. Then by
Prop. 8.5 there is a unique automorphism G such that (i) = u, (j) = v and () = x. Thus, we
may identify G with the manifold of triples
T = (u, v, x) S S S : (u, v) = (x, u) = (x, v) = (x, uv) = 0.
Let : T S be the projection (u, v, x) = u, and let S
u
be the ve-dimensional sphere in S
orthogonal to u. Then
1
(u) may be identied with the set of pairs
T
u
= (v, x) S
u
S
u
: (x, v) = (x, uv) = 0,
and we have dimT = dimS + dimT
u
. Let
u
: T
u
S
u
be the projection
u
(v, x) = v. Then
1
u
(v) is the three-dimensional sphere S
u,v,uv
in S
u
orthogonal to v and uv, and we have dimT
u
=
dimS
u
+ S
u,v,uv
. Therefore
dimG = dimT = dimS + dimS
u
+ dimS
u,v,uv
= 6 + 5 + 3 = 14.
8.6 The SU
3
in Aut(O)
9
Recall that automorphisms preserve the norm, so G = Aut(O) acts on the 6-sphere S = x V :
N(x) = 1.
Proposition 8.7 G acts transitively on S.
Proof: Let x S. Choose u S orthogonal to x and then choose v S orthogonal to each of x, u, ux.
Then (x, uv) = (ux, v) = 0, so x is orthogonal to each of u, v, uv hence there is an automorphism
G sending to x.
Let G
= SU(W).
Since dimG
SU(W).
Let G
. Since U(W), there exist eigenvectors u, v, w for in W which are orthonormal with
respect to the hermitian form , . Since uv is a unit vector orthogonal to both u and v, we may take
w = uv. In fact, if (u) = e
u and (v) = e
u)(e
v)
= (cu + su)(c
v + s
v)
= cc
uv + ss
(u)(v) + cs
u(v) + sc
(u)v
= cc
uv ss
uv cs
(uv) sc
(uv)
= [cos( + ) sin( + )]uv
= e
(+)
uv.
Thus, the eigenvalues of are e
, e
and e
(+)
, so det = 1 and SU(W).
Therefore the homogeneous space G
2
/ SU
3
is equivariantly diffeomorphic to the 6-sphere S = S
6
and
we have
Corollary 8.9 Every automorphism of O xes a point on S and acts on the orthogonal 5-sphere via
an element of SU
3
.
8.7 The maximal torus in Aut(O)
The above proof (with i, j, k instead of u, v, uv) shows that we have a torus T
consisting of
automorphisms which x and send
i e
i, j e
j, k e
k,
where + + = 0. This torus T
is a maximal torus in G
.
59
In fact, T is a maximal torus in G as well. For suppose U is a maximal torus of G containing T. Then
U preserves the xed-point set of T in O, which is the complex plane RR. But U xes R, hence U
must preserve the orthogonal complement R of R in this plane. Since U is connected and preserves
length, it follows that U acts trivially on R, so U G
, so U = T.
The normalizer N(T) must also preserve the xed-point set of T in O, and hence must preserve .
Now N(T) is not connected, so it can change the sign of . For example, the element
2
Aut(O)
acting by +I on H and I on H lies in N(T) and acts on T by
2
t
1
2
= t
1
. The subgroup of N(T)
xing lies in G
, and has index two in N(T). Since the Weyl group of SU(3) is S
3
, it follows that the
Weyl group W = N(T)/T is S
3
1 D
6
.
In general, the extension
1 T N(T) W 1
is not split (eg. in S
3
). In this case however, there is a copy of S
3
inside SU(3) projecting isomorphi-
cally onto the Weyl group of SU(3). This subgroup is generated by the involutions
=
_
_
0 1 0
1 0 0
0 0 1
_
_
,
=
_
_
1 0 0
0 0 1
0 1 0
_
_
.
Hence the group generated by
2
, ,
is isomorphic to D
6
and projects isomorphically onto W.
8.8 The SO
4
in Aut(O)
Let G
H
= G : (H) = H be the stabilizer of the quaternion subalgebra H spanned by
1, i, j, k.
Proposition 8.10 G
H
= SO
4
.
Proof: Since G
H
preserves H, it also preserves the orthogonal complement H. If G
H
acts trivially
on H then G
H
xes so (u) = (u) = u so acts trivially on H as well, so = 1. Thus G
H
acts
faithfully on H (though not on H) and we have an injective homomorphism G
H
O(H) O
4
. To
show that G
H
SO
4
it remains to show G
H
is connected of dimension six.
By restriction we get another map r : G
H
Aut(H) = SO
3
, which is seen to be surjective by taking
H = J = H in Prop. 8.5. Set K = ker r = G : (u) = u u H. An automorphism K
is completely determined by its effect on , and
() = u
,
for some u
) = 1, so u
S
3
H. If K is
another automorphism then since xes u
we have
u
= () = (u
) = u
() = u
(u
) = (u
),
60
so u
is an injective homomorphism K S
3
, which is also surjective by another application of
Prop. 8.5. Thus, G
H
ts into an exact sequence
1 S
3
G
H
SO
3
1.
In particular G
H
is connected and dimG
H
= 6. Hence G
H
SO
4
as claimed.
Remark 1: The proof shows that the intersection
G
G
H
= SO
3
SU
3
.
is the subgroup of SU
3
consisting of real matrices.
Remark 2: From Prop. 8.5, it follows that G
2
acts transitively on quaternion subalgebas of O. Hence
G
2
/ SO
4
is the eight-dimensional manifold of quaternion subalgebras of O and every automorphism
of O stabilizes a quaternion subalgebra.
8.9 The Lie algebra of Aut(O)
The automorphism group G of any nite dimensional R-algebra A is a closed subgroup of GL(A).
Hence the Lie algebra g of G is the Lie subalgebra of End(A) given by
g = D End(A) : e
tD
G for all t R.
Expanding and comparing both sides of the identity (for all a, b A)
e
tD
(ab) = [e
tD
a][e
tD
b]
we nd that D g if and only
D(ab) = D(a)b + aD(b).
Such an operator is called a derivation of A, and
g = Der(A)
is the Lie algebra of derivations on A with the Lie bracket [D, D
] = DD
D inherited from
End(A).
If A is a composition algebra then Aut(A) O(A), the orthogonal group of the norm N : A R,
so Der(A) so(A) is represented by skew-symmetric matrices with respect to an orthonormal basis
of A. Since D(1) = D(1 1) = 2D(1), it follows that D(1) = 0 for any derivation. Hence we in fact
have A so(A
0
).
Returning to the octonions, we see that the Lie group G = Aut(O) has Lie algebra g = Der(O) so
7
.
We will use this representation to nd the roots R and the decomposition
g = t
R/
m
.
61
As a subalgebra of g, the Lie algebra of G
is
D g : D() = 0.
These derivations are linear with respect to the subalgebra C
given by
D(i) = ai, D(j) = bj, D(k) = ck, D() = 0, (25)
and we have
g
= t m
bc
m
ac
m
ab
,
where
m
bc
=
_
_
_
_
_
0 0 0
0 0 z
0 z 0
_
_
: z C
_
_
_
, m
ac
=
_
_
_
_
_
0 0 z
0 0 0
z 0 0
_
_
: z C
_
_
_
, m
ab
=
_
_
_
_
_
0 z 0
z 0 0
0 0 0
_
_
: z C
_
_
_
.
The remaining root spaces consist of derivations which are not C
_
_
_
0 0 0 0 0 0 2y
0 0 0 0 0 0 2x
0 0 0 0 x y 0
0 0 0 0 y x 0
0 0 x y 0 0 0
0 0 y x 0 0 0
2y 2x 0 0 0 0 0
_
_
: x, y R
_
_
m
b
=
_
_
_
_
0 0 0 0 x y 0
0 0 0 0 y x 0
0 0 0 0 0 0 2y
0 0 0 0 0 0 2x
x y 0 0 0 0 0
y x 0 0 0 0 0
0 0 2y 2x 0 0 0
_
_
: x, y R
_
_
m
c
=
_
_
_
_
0 0 x y 0 0 0
0 0 y x 0 0 0
x y 0 0 0 0 0
y x 0 0 0 0 0
0 0 0 0 0 0 2y
0 0 0 0 0 0 2x
0 0 0 0 2y 2x 0
_
_
: x, y R
_
_
62
We therefore have 12 roots
R = a b, b c, a c, a, b, c.
These are linear functionals on
t = (a, b, c) : a + b + c = 0.
To see R more explicitly, identify t with its dual space t
. For example, if = , we get the Cartan subalgebra t in (25). It follows that the
Lie algebra g = Der(O) is a direct sum
g = t
i
t
j
t
k
t
i
t
j
t
k
t