In this section, we introduce the notion of commutator subgroup or derived subgroup of a group. This subgroup is important because it is the smallest normal subgroup such that the quotient group of the original group by this subgroup is abelian. Also, we study other special subgroup of a group called the maximum subgroup.

10.1 Commutators and Derived Subgroups

Let G be a group, and let \(a,b\in G\). The commutator of a and b is \([a,b]=a^{-1}b^{-1}ab\).

“Commutator” is a good word to use: because [ab] is a kind of measure as to how near a to b come to commuting, [ab] being the identity element of G if and only if \(ab=ba\). For any \(a,b,c\in G\), we denote \([a,b,c]=\big [[a,b],c]\big ]\) and \(b^a=a^{-1}ba\).

Theorem 10.1

Let G be a group. For every \(a,b,c\in G\), the following hold:

  1. (1)

    \([ab,c]=[a,c]^b[b,c]\);

  2. (2)

    \([a,bc]=[a,c][a,b]^c\);

  3. (3)

    \([a,b^{-1},c]^b[b,c^{-1},a]^c[c,a^{-1},b]^a=1\).

Proof

(1) We can write

$$ \begin{array}{ll} [ab,c] &{} = (ab)^{-1}c^{-1}abc=b^{-1}a^{-1}c^{-1}abc \\ &{}=b^{-1}(a^{-1}c^{-1}ac)b(b^{-1}c^{-1}bc)=[a,c]^b[b,c]. \end{array} $$

(2) We have

$$ \begin{array}{ll} [a,bc] &{} a^{-1}(bc)^{-1}abc=a^{-1}c^{-1}b^{-1}abc \\ &{} =(a^{-1}c^{-1}ac)c^{-1}(a^{-1}b^{-1}ab)c=[a,c][a,b]^c. \end{array} $$

(3) We can write

$$ \begin{array}{ll} [a,b^{-1},c]^b &{} = b^{-1} \big [ [a,b^{-1}],c\big ]b=b^{-1}[a^{-1}bab^{-1},c]b \\ &{} =b^{-1}(ba^{-1}b^{-1}ac^{-1}a^{-1}bab^{-1}c)b= (a^{-1}b^{-1}ac^{-1}a^{-1})(bab^{-1}cb) \\ &{} =(aca^{-1}ba)^{-1}(bab^{-1}cb). \end{array} $$

Hence, we have

$$\begin{aligned}{}[a,b^{-1},c]^b =(aca^{-1}ba)^{-1}(bab^{-1}cb). \end{aligned}$$
(10.1)

Similarly, we obtain

$$\begin{aligned}{}[b,c^{-1},a]^c =(bab^{-1}cb)^{-1}(cbc^{-1}ac) \end{aligned}$$
(10.2)

and

$$\begin{aligned}{}[c,a^{-1},b]^a =(cbc^{-1}ac)^{-1}(aca^{-1}ba). \end{aligned}$$
(10.3)

Now, by (10.1)–(10.3), the results follows. \(\blacksquare \)

It is perfectly natural to look at the set of all commutators in a group G. This subset may not form a subgroup of G, so we move to the next best thing.

Definition 10.2

Let G be a group. The subgroup generated by the set \(\{[a,b] \ | \ a,b\in G\}\) is called the commutator subgroup or derived subgroup of G. It is denoted by \(G^\prime \) or \(G^{(1)}\) or [GG].

A group G is called perfect if \(G=G^\prime \).

Theorem 10.3

If G is a group, then \(G^\prime \unlhd G\) and \(G/G^\prime \) is abelian.

Proof

For every \(a,b,g\in G\), we have

$$ \begin{array}{ll} [a^g,b^g] &{} =[g^{-1}ag, g^{-1}bg]=(g^{-1}ag)^{-1}(g^{-1}bg)^{-1}(g^{-1}ag)(g^{-1}bg)\\ &{} =(g^{-1}a^{-1}g)(g^{-1}b^{-1}g)(g^{-1}ag)(g^{-1}bg)\\ &{} = g^{-1}(a^{-1}b^{-1}ab)g=g^{-1}[a,b]g=[a,b]^g. \end{array} $$

So, we obtain \([a,b]^g=[a^g,b^g]\). Now let \(x\in G^\prime \) be an arbitrary element. Then, we can write \(x=\prod \limits _{i=1}^n[a_i,b_i]\), where \(a_i,b_i\in G\) for all \(1\le i\le n\). Therefore, we have

$$ g^{-1}xg=\Big (\prod _{i=1}^n[a_i,b_i] \Big )^g=\prod _{i=1}^n[a_i,b_i]^g=\prod _{i=1}^n[a_i^g,b_i^g]\in G^\prime . $$

This shows that \(G^\prime \unlhd G\).

Now, let \(aG^\prime \) and \(bG^\prime \) be arbitrary elements of \(G/G^\prime \). Since \(a^{-1}b^{-1}ab\in G^{\prime }\), it follows that \(abG^\prime =\big (a^{-1}b^{-1}\big )^{-1}G^\prime \), or equivalently \(aG^\prime bG^\prime =bG^\prime aG^\prime \), and so \(G/G^\prime \) is abelian. \(\blacksquare \)

Theorem 10.4

Let G be a group and N be a normal subgroup of G. Then, G/N is abelian if and only if \(G^\prime \le N\).

Proof

Suppose that G/N is abelian. Then, for every \(a,b\in G\), we have \(aNbN=bNaN\). This implies that \(abN=baN\), or equivalently \(a^{-1}b^{-1}ab\in N\). Since N contains every commutator [ab], it follows that \(G^\prime \le N\).

Conversely, let \(G^\prime \le N\), and assume that aN and bN are arbitrary elements of G/N. Since \(a^{-1}b^{-1}ab\in G^\prime \), it follows that \(a^{-1}b^{-1}ab\in N\). Hence, we have \(abN=baN\). This means that \(aNbN=bNaN\), and so we conclude that G/N is abelian. \(\blacksquare \)

Definition 10.5

A group G is called metabelian if there exists a normal subgroup N of G such that both N and G/N are abelian.

Theorem 10.6

A group G is metabelian if and only if \(G^{\prime \prime }=\{e\}\).

Proof

Let G be a metabelian group; we show that \(G^{\prime \prime }=\{e\}\). Since G is metabelian, it has a normal abelian subgroup N, and G/N is abelian. Thus, by Theorem 10.4, \(G^\prime \le N\). Since N is abelian, it follows that \(G^\prime \) is abelian, and so \(G^{\prime \prime }=\{e\}\).

Conversely, suppose that \(G^{\prime \prime }=\{e\}\). We show that G is metabelian. It is clear that if G is abelian, then G is metabelian. So, we have only to consider the case where G is not abelian, and hence \(G^\prime \not = \{e\}\). Thus, assume that G is not abelian. By Theorem 10.3, we know that \(G^\prime \) is a normal subgroup of G. Let \(a,b\in G^\prime \). We know that \(aba^{-1}b^{-1}=e\) since \(G^{\prime \prime }=\{e\}\) and so we have that \(ab=ba\). Thus, arbitrary elements \(a,b\in G^\prime \) commute, and so \(G^\prime \not =\{e\}\) is a normal abelian subgroup of G. Again, by Theorem 10.3, \(G/G^\prime \) is abelian, and so G is metabelian. \(\blacksquare \)

Theorem 10.7

Let G be a group such that \(G^\prime \) is a subset of the center of G. Then, \([a,bc]=[a,b][a,c]\), for all \(a,b,c\in G\).

Proof

We can write

$$ \begin{array}{ll} [a,bc] &{} =a^{-1}(bc)^{-1}abc= a^{-1}c^{-1}(ac)(c^{-1}a^{-1})b^{-1}abc\\ &{} = (a^{-1}c^{-1}ac)c^{-1}(a^{-1}b^{-1}ab)c=[a,c]c^{-1}[a,b]c. \end{array} $$

Since \([a,b]\in G^\prime \) and \(G^\prime \subseteq Z(G)\), it follows that \([a,b]\in Z(G)\). So, we conclude that \([a,b]c=c[a,b]\). Consequently, we get \([a,bc]=[a,c][a,b]=[a,b][a,c]\). This completes the proof. \(\blacksquare \)

More generally, for subgroups H and K of a group G, we denote [HK] the subgroup of G generated by the set \(\{[h,k]\ | \ h\in H \ \mathrm{and} \ k\in K\}\).

Theorem 10.8

If G is a group and HK are subgroups of G, then

  1. (1)

    \([H,K]=[K,H]\);

  2. (2)

    \([H,K]\unlhd H\vee K\).

Proof

(1) Let [hk] be an arbitrary commutator in [HK]. Then, we have

$$ \begin{array}{l} [h,k]=h^{-1}k^{-1}hk=\big (k^{-1}h^{-1}kh \big )^{-1}=[k,h]^{-1}. \end{array} $$

Since \([K,H]\le G\) and \([k,h]\in [K,H]\), it follows that \([k,h]^{-1}\in [K,H]\). This implies that \([h,k]\in [K,H]\). So, we have \([H,K]\subseteq [K,H]\). In a similar way, we obtain \([K,H]\subseteq [H,K]\). Therefore, we conclude that \([H,K]= [K,H]\).

(2) Since \(H\vee K\) is the subgroup generated by \(H\cup K\), it follows that \([h,k]\in H\vee K\), for all \(h\in H\) and \(k\in K\). This shows that [HK] is a subgroup of \(H\vee K\). Now, let [hk] be an arbitrary commutator in [HK] and \(x\in H\). Then, by Theorem 10.1 (1), we have \([hx,k]=[h,k]^x[x,k]\), and so \([h,k]^x=[hx,k][x,k]^{-1}\). Since [hxk] and \([x,k]^{-1}\) belong to [HK], it follows that \([h,k]^x\in [H,K]\). Also, let \(y\in K\), then by Theorem 10.1 (2), we can write \([h,ky]=[h,y][h,k]^y\), and hence \([h,k]^y=[h,y]^{-1}[h,ky]\). Since \([h,y]^{-1}\) and [hky] lies in [HK], it follows that \([h,k]^y\in [H,K]\). Therefore, we conclude that \([H,K]\unlhd H\vee K\). \(\blacksquare \)

Theorem 10.9

If G is a group and HK are normal subgroups of G, then \([H,K]\le H\cap K\).

Proof

Suppose that \(h\in H\) and \(k\in K\) are arbitrary. Since \(H\unlhd G\), it follows that \(h^{-1}(k^{-1}hk)\in H\). Similarly, since \(K\unlhd G\), it follows that \((h^{-1}k^{-1}h)k\in K\). Thus, we obtain \([h,k]\in H\cap K\). Consequently, we have \([H,K]\le H\cap K\). \(\blacksquare \)

Theorem 10.10

If H, K, and N are normal subgroups of a group G, then \([HK,N]=[H,N][K,N]\).

Proof

Assume that \(a\in H\), \(b\in K\) and \(c\in N\). Then, by Theorem 10.1 (1), we can write

$$\begin{aligned}{}[ab,c]=[a,b]^b[b,c]=[a^b,c^b][b,c]. \end{aligned}$$
(10.4)

Since H and N are normal subgroups of G, it follows that \([a^b,c^b]\in [H,N]\). On the other hand, we have \([b,c]\in [K,N]\). Now, by (10.4), we conclude that \([HK,N]\subseteq [H,N][K,N]\).

Conversely, since H and K are subgroups of HK, it follows that

$$ \begin{array}{l} [H,N]\subseteq [HK,N]\ \mathrm{and} \ [K,N]\subseteq [HK,N]. \end{array} $$

Consequently, we get \([H,N][K,N]\subseteq [HK,N]\), and this completes the proof. \(\blacksquare \)

Exercises

  1. 1.

    Let G be a group and H be a cyclic subgroup of G. If \(G^\prime \le H\), show that \(H\unlhd G\).

  2. 2.

    For any group G, show that \(G^\prime \) is the subset of all “long commutators”:

    $$ \begin{array}{l} G^\prime =\{a_1a_2\ldots a_na_1^{-1}a_2^{-1}\ldots a_n^{-1} \ | \ a_i\in G \ \mathrm{and} \ n\ge 2\}. \end{array} $$

    Hint: \((aba^{-1}b^{-1})(cdc^{-1}d^{-1})=a (ba^{-1})b^{-1}c(dc^{-1})d^{-1}a^{-1}(ab^{-1})bc^{-1}\) \((cd^{-1})d\).

  3. 3.

    Let G be a group and suppose that H and K are subgroups of G. Prove that

    1. (1)

      \(H\unlhd G\) if and only if \([H,G]\le H\);

    2. (2)

      If K is a subgroup of H and \(K\unlhd G\), then \([H,G]\le K\) if and only if \(H/K\le Z\big (G/K \big )\).

  4. 4.

    Let \(G=HK\), where H and K are abelian subgroups of G. Prove that \(G^\prime \) is abelian.

  5. 5.

    Let N be a normal subgroup of a group G such that \(N\cap G^\prime =\{e\}\). Show that \(N\le Z(G)\).

  6. 6.

    If H is a subgroup of a metabelian group G, prove that H is metabelian.

10.2 Derived Subgroups of Some Special Groups

In this section, we investigate the derived subgroups of symmetric groups, alternating groups, quaternion group, general linear groups, special linear groups, and dihedral groups.

Lemma 10.11

Let \(n\ge 5\) and \(N\unlhd A_n\). If N contains a cycle of length 3, then \(N=A_n\).

Proof

Suppose that \((x \ y \ z)\in N\). By Theorem 5.43, each permutation in \(A_n\) is a product of cycles of length 3. Hence, it suffices to prove that every cycle of length 3 lies in N. Assume that \((a \ b \ c)\in S_n\) is an arbitrary cycle. Take \(\sigma \in S_n\) such that \(x \sigma =a\), \(y\sigma =b\) and \(z\sigma =c\). Then, we have \(\sigma ^{-1} (x \ y \ z)\sigma =(a \ b \ c)\).

If \(\sigma \in A_n\), then \((a \ b \ c)\in N\), because \(N\unlhd A_n\).

If \(\sigma \not \in A_n\), then \(\sigma \) is an odd permutation. Since \(n\ge 5\), we can consider r and s distinct from x, y, and z. So, we have \((r \ s)\sigma \in A_n\). Consequently, we have \(\big ( (r \ s)\sigma \big )^{-1}(x \ y \ z)(r \ s) \sigma \in N\), or equivalently \(\sigma ^{-1}(r \ s) (x \ y \ z) (r \ s) \sigma \in N\). Since \((r \ s)\) and \((x \ y \ z)\) are disjoint cycles, it follows that \(\sigma ^{-1}(x \ y \ z)\sigma \in N\). This shows that \((a \ b \ c)\in N\). \(\blacksquare \)

Theorem 10.12

For all \(n\ge 2\), \(S^\prime _n=A_n\).

Proof

Let \(n\ge 2\) and suppose that \(\sigma \) and \(\tau \) are two arbitrary permutations in \(S_n\). Let \(\sigma =\sigma _1\sigma _2 \ldots \sigma _r\) and \(\tau =\tau _1 \tau _2 \ldots \tau _s\), where \(\sigma _i\)’s and \(\tau _i\)’s are transpositions. Then, we have

$$ \begin{array}{l} [\sigma , \tau ]=\sigma ^{-1}\tau ^{-1}\sigma \tau = \sigma _r \ldots \sigma _2\sigma _1 \tau _s \ldots \tau _2 \tau _1\sigma _1\sigma _2 \ldots \sigma _r\tau _1 \tau _2 \ldots \tau _s. \end{array} $$

Hence, \([\sigma , \tau ]\) is a product of \(2(r+s)\) transpositions, and so it is an even permutation. This shows that \([\sigma , \tau ]\in A_n\). Therefore, we conclude that \(S^\prime _n\le A_n\).

Now, in order to show that \(A_n=S^\prime _n\), we consider the following cases:

Case 1: \(n=2\). In this case, clearly, we have \(S^\prime _n=A_2=\{id\}\).

Case 2: \(n=3\). We have \(A_3=\{ id, \ (1 \ 2 \ 3), \ (1 \ 3 \ 2)\}\). Since there are permutations in \(S_3\) that don’t commute, it follows that the derived subgroup is not just \(\{id\}\). Hence, it must be all of \(A_3\).

Case 3: \(n=4\). For distinct \(i,j,k\in \{1, \ 2, \ 3, \ 4\}\), we have

$$ \begin{array}{l} [(i \ j), (i \ k)]=(i \ j)(i \ k)(i \ j)(i \ k)=(i \ j \ k)^2=(i \ k \ j), \end{array} $$

that is, every cycle of length 3 belongs to derived subgroup. Therefore, we conclude that \(S^\prime _4=A_4\).

Case 4: \(n\ge 5\). We know that \(S^\prime _n\) is a normal subgroup of \(S_n\). Moreover, we proved that \(S_n^\prime \le A_n\). On the other hand, we have

$$ \begin{array}{l} [(1 \ 2), (2 \ 3)]=(1 \ 2)(2 \ 3)(1 \ 2)(2 \ 3)=(1 \ 2\ 3)\in S^\prime _n. \end{array} $$

This means that \(S^\prime _n\) contains a cycle of length 3, and so by Lemma 10.11, we deduce that \(S^\prime _n=A_n\). \(\blacksquare \)

Theorem 10.13

We have

  1. (1)

    \(A^\prime _2=A^\prime _3=\{id\}\);

  2. (2)

    \(A^\prime _4=\{id, \ (1 \ 2)(3 \ 4), \ (1 \ 3)(2 \ 4), \ (1 \ 4)(2 \ 3)\}\);

  3. (3)

    \(A^\prime _n=A_n\), for all \(n\ge 5\).

Proof

(1) It is straightforward.

(2) Assume that \(N=\{id, \ (1\ 2)(3 \ 4), \ (1 \ 4)(2 \ 3), \ (1 \ 3)(2 \ 4)\}\). It can be checked that N is a normal subgroup of \(A_4\). Since \(|A_4/N|=3\), it follows that \(A_4/N\) is abelian, and so by Theorem 10.4, we have \(A^\prime _4\le N\). Since \(A_4\) is non-abelian, it follows that \(A^\prime _4\not = \{id\}\). Hence, there exists a non-identity permutation \((a \ b)(c \ d)\) in \(A^\prime _4\). Since \(A^\prime _4\unlhd A_4\) and every product of disjoint transpositions are conjugate in \(A_4\), we conclude that \(A^\prime _4=N\).

(3) Since \(A^\prime _n\unlhd A_n\), by Lemma 10.11, it is enough to prove that \(A^\prime _n\) contains a cycle of length 3. Since \(n\ge 5\), assume that \(\sigma =(a \ c \ b)\) and \(\tau =(b \ c)(d \ e)\). Then, we have \([\tau \ \sigma ]=\tau ^{-1}\sigma ^{-1}\tau \sigma =(\tau ^{-1}\sigma ^{-1}\tau )\sigma \). Also, we have

$$ \begin{array}{l} \tau ^{-1}\sigma ^{-1}\tau =\tau ^{-1}(a \ b \ c) \tau = (a \tau \ b\tau \ c\tau )=(a \ c \ b)=\sigma . \end{array} $$

Consequently, we obtain \([\tau , \sigma ]=\sigma ^2=(a \ b \ c)\) as desired. \(\blacksquare \)

Theorem 10.14

If \(Q_8\) is the quaternion group, then \(Q_8^\prime =\{1,1\}\).

Proof

We know that \(Q_8=\{1, \ -1, \ I, \ -I, \ J, \ -J, \ K, \ -K\}\). The commutator subgroup contains the element

$$ \begin{array}{l} [I,J]=I^{-1}J^{-1}IJ=(-I)(-J)IJ=(IJ)(IJ)=K^2=-1. \end{array} $$

Similarly, \([J,K]=-1\) and \(K,I]=-1\). On the other hand, 1 and \(-1\) commute with all elements of \(Q_8\), hence \([a,-1]=[a,1]=1\), for all \(a\in Q_8\). Therefore, the commutator subgroup is the subgroup of \(Q_8\) generated by \(-1\) and 1, which is \(Q_8^\prime =\{1,1\}\). \(\blacksquare \)

Theorem 10.15

Let \(\mathbb {F}\) be a field and \(n\ge 2\) be an integer. Then,

$$ \begin{array}{l} SL_n(\mathbb {F})^\prime = GL_n(\mathbb {F})^\prime =SL_n(\mathbb {F}), \end{array} $$

except when \(n=2\) and \(\mathbb {F}\) consists of 2 or 3 elements. Thus, \(SL_n(\mathbb {F})\) is perfect (with the exceptions listed).

Proof

Let A and B are two arbitrary matrices in \(GL_n(\mathbb {F})\). Then, we have \(A^{-1}B^{-1}\) \(AB\in GL_n(\mathbb {F})^\prime \). Since

$$ \begin{array}{l} \det (A^{-1}B^{-1}AB)=\det (A^{-1})\det (B^{-1}) \det (A) \det (B)=1, \end{array} $$

it follows that \(A^{-1}B^{-1}AB\in SL_n(\mathbb {F})\). This yields that \(GL_n(\mathbb {F})^\prime \subseteq SL_n(\mathbb {F})\). On the other hand, since \(SL_n(\mathbb {F})\subseteq GL_n(\mathbb {F})\), it follows that \(SL_n(\mathbb {F})^\prime \subseteq GL_n(\mathbb {F})^\prime \). So, we have \(SL_n(\mathbb {F})^\prime \subseteq GL_n(\mathbb {F})^\prime \subseteq SL_n(\mathbb {F})\). Consequently, it is enough to show that \(SL_n(\mathbb {F})^\prime = SL_n(\mathbb {F})\). To establish this equality, by Theorem 7.41, it is enough to show that each matrix of the form \(E_{ij}(\lambda )\) with \(i\not =j\) and \(\lambda \in \mathbb {F}^*\), is a product of commutators. It is easy to check that for any distinct indices ijk,

$$ \begin{array}{l} \ [E_{ik}(\lambda ), E_{kj}(1)]=E_{ik}(-\lambda )E_{kj}(-1)B_{ik}(\lambda )E_{kj}(1)=E_{ij}(\lambda ). \end{array} $$

his expresses each \(E_{ij}(\lambda )\) as a commutator when \(n\ge 3\). For \(n=2\), we have

$$ \begin{array}{ll} \ \left[ \left[ \begin{array}{lc} a^{-1} \ &{} \ 0 \ \\ \ 0 \ &{} \ a \ \end{array} \right] , \left[ \begin{array}{cc} 1 \ &{} \ b \ \\ \ 0 \ &{} \ 1 \ \end{array} \right] \right] &{} = \left[ \begin{array}{cc} a^{-1} \ &{} \ 0 \ \\ \ 0 \ &{} \ a \ \end{array} \right] ^{-1} \left[ \begin{array}{cc} 1 \ &{} \ b \ \\ \ 0 \ &{} \ 1 \ \end{array} \right] ^{-1} \left[ \begin{array}{cc} a^{-1} \ &{} \ 0 \ \\ \ 0 \ &{} \ a \ \end{array} \right] \left[ \begin{array}{cc} 1 \ &{} \ b \ \\ \ 0 \ &{} \ 1 \ \end{array} \right] \\ &{} = \left[ \begin{array}{cc} a \ &{} \ 0 \ \\ \ 0 \ &{} \ a^{-1} \ \end{array} \right] ^{-1} \left[ \begin{array}{cc} 1 \ &{} \ -b \ \\ \ 0 \ &{} \ 1 \ \end{array} \right] ^{-1} \left[ \begin{array}{cc} a^{-1} \ &{} \ 0 \ \\ \ 0 \ &{} \ a \ \end{array} \right] \left[ \begin{array}{cc} 1 \ &{} \ b \ \\ \ 0 \ &{} \ 1 \ \end{array} \right] \\ &{} =\left[ \begin{array}{cc} 1 \ &{} \ b(1-a^2) \ \\ \ 0 \ &{} \ 1 \ \end{array} \right] . \end{array} $$

So, if \(\mathbb {F}\) contains an element a such that \(a\not =0\) and \(a^2\not =1\), then we can express any matrix \(E_{12}(\lambda )\) as a commutator by taking \(b=(1-a^2)^{-1}\lambda \), and similarly for \(E_{21}(\lambda )\). This yields that \(SL_n(\mathbb {F})^\prime =SL_n(\mathbb {F})\) except when \(n=2\) and \(a^3=a\), for all \(a\in \mathbb {F}\), and this happens only when \(|\mathbb {F}|=2\) or 3. \(\blacksquare \)

Theorem 10.16

If \(D_n\) is the dihedral group of order 2n, then by Corollary 7.74, we have \(D_n=\langle R,S\rangle \), where S is a reflection and R is a rotation. Then, \(D^\prime _{2n}=\langle R^2\rangle \).

Proof

Since [SR] is \(S^{-1}R^{-1}SR=SSRR=R^2\), it follows that \(R^2\) is a commutator. Moreover, since \([S,R^i]=S^{-1}R^{-i}SR^i=SSR^iR^i=R^{2i}\), it follows that every element of \(\langle R^2 \rangle \) is a commutator. Now, we prove that every commutator belong to \(\langle R^2\rangle \). Suppose that A and B are two arbitrary elements of \(D_{2n}\). We consider the following cases:

Case 1: Both A and B are rotations. In this case, we can write \(A=R^i\) and \(B=R^j\), for some integers i and j. Since \(R^i\) and \(R^j\) commute, it follows that \([A,B]=I_n\).

Case 2: A is a rotation and B is a reflection. Then, we have \(A=R^i\) and \(B=R^jS\), for some integers i and j. Since \(B=B^{-1}\), it follows that

$$ \begin{array}{l} [A,B]=A^{-1}B^{-1}AB=A^{-1}BAB=R^{-i}R^jSR^iR^jS=R^{j-i}SR^{i+j}S. \end{array} $$

By Theorem 7.75, since \(RS=SR^{-1}\), it follows that \([A,B]=R^{-2i}\in \langle R^{2} \rangle \).

Case 3: A is a reflection and B is a rotation. In this case, we have \([A,B]^{-1}=B^{-1}A^{-1}BA\). By the case (2), \([A,B]^{-1}\in \langle R^{2} \rangle \), and this implies that \([A,B]\in \langle R^{2} \rangle \).

Case 4: Both A and B are reflections. Then, we have \(A=R^iS\) and \(B=R^jS\). Hence, we obtain \(A=A^{-1}\) and \(B=B^{-1}\). So, we conclude that

$$ \begin{array}{ll} [A,B]&{} =A^{-1}B^{-1}AB=ABAB=(AB)^2=(R^iSR^jS)^2\\ &{} =(R^iR^{-j}SS)^2=R^{2(i-j)}\in \langle R^{2} \rangle . \end{array} $$

This completes the proof. \(\blacksquare \)

Exercises

  1. 1.

    Compute \([GL_n(\mathbb {F}):SL_n(\mathbb {F})]\), where \(\mathbb {F}\) is finite.

  2. 2.

    Suppose that \(O_n(\mathbb {F})=\{A\in GL_n(\mathbb {F}) \ | \ A^tA=I_n\}\) and \(SO_n(\mathbb {F})=\{A \in O_n(\mathbb {F})\ |\) \(\det (A)=1\}\). Prove that \(SO_n(\mathbb {F}) \unlhd O_n(\mathbb {F})\) and compute \([O_n(\mathbb {F}) :SO_n(\mathbb {F})]\).

  3. 3.

    Determine the derived subgroup of \(SO_n(\mathbb {F})\).

10.3 Maximal Subgroups

A maximal subgroup of a given group G is a proper subgroup of G, that is, a subgroup which is neither the identity nor G itself, and in order to be maximal, it can not be contained in a larger proper subgroup of G. More precisely:

Definition 10.17

A subgroup M of a group G is called a maximal subgroup if \(M\not =G\) and the only subgroups containing M are M and G.

Example 10.18

We know that the subgroups of \(\mathbb {Z}\) are of the form \(n\mathbb {Z}\). Note that \(n\mathbb {Z}\subseteq m\mathbb {Z}\) if and only if m|n, and so the maximal subgroups of \(\mathbb {Z}\) are precisely \(p\mathbb {Z}\) for p prime.

Example 10.19

In each Hasse diagram, the lower subgroup is a maximal subgroup in the upper one.

Theorem 10.20

If H is a proper subgroup of a finite group G, then there is a maximal subgroup of G containing H.

Proof

If H is a maximal subgroup of G, then we have nothing to do. If H is not a maximal subgroup, we pick a proper subgroup \(H_1\) of G such that \(H\subset H_1\). Again, if \(H_1\) is maximal in G,then we are done; otherwise, we pick a proper subgroup \(H_2\) of G such that \(H\subset H_1\subset H_2\). Continuing in this way, we construct a chain of proper subgroups of G:

$$ \begin{array}{l} H\subset H_1\subset H_2\subset \ldots , \end{array} $$

and this gives a strictly increasing chain of integers

$$ \begin{array}{l} |H|<|H_1|<|H_2| < \ldots , \end{array} $$

which is bounded above by |G|, and so must terminate. Consequently, we obtain a maximal subgroup containing H. \(\blacksquare \)

We conclude that in non-trivial finite groups, maximal subgroups will always exist. However, not all groups will have maximal subgroups.

Example 10.21

The rational numbers \(\mathbb {Q}\) under addition have no maximal subgroups. To see the reason, suppose that M is a maximal subgroup of \(\mathbb {Q}\). There exists a rational number r/s (\(r,s\in \mathbb {Z}\) and \(s\not =0\)) such that \(r/s\not \in M\). Since \(\mathbb {Z}\) is a proper subgroup of \(\mathbb {Q}\), it follows that \(M\not =\{0\}\). Let \(m/n\in M\) with \(m,n\in \mathbb {Z}\) and \(mn\not =0\). Since \(m/n\in M\), it follows that \(nm/n\in M\), and so \(m\in M\). Since M is a maximal subgroup, it follows that

$$\begin{aligned} M+\langle r/s \rangle =\mathbb {Q}. \end{aligned}$$
(10.5)

Consider \(r/sms \in \mathbb {Q}\). Then, by (10.5), there exist \(h\in M\) and \(t\in \mathbb {Z}\) such that \(r/sms=h+tr/s\). Hence, we have \(r/s=msh+tmr=m(sh+tr)\). Since \(m\in M\), it follows that \(r/s\in M\). This is a contradiction.

Theorem 10.22

The center of a group G is properly contained in every maximal subgroup of G having a composite index.

Proof

Let M be a maximal subgroup of G with composite index. Suppose that Z(G) is not contained in M. Then, there exists \(a\in Z(G)\) such that \(a\not \in M\). We consider \(\langle M, a\rangle \), the subgroup generated by \(M\cup \{a\}\). Since M is a maximal subgroup and \(M\subset \langle M, a\rangle \), it follows that \(\langle M, a\rangle =G\). Assume that \(x\in G\) is an arbitrary element. Since \(a\in Z(G)\), it follows that \(x=a^kb\), for some \(b\in M\) and \(k\in \mathbb {Z}\). Now, for every \(y\in M\), we have

$$ \begin{array}{l} xyx^{-1}=(a^kb)y(a^kb)^{-1}=a^kbyb^{-1}a^{-k}=byb^{-1}, \end{array} $$

because \(a\in Z(G)\). Since \(b,y\in M\), it follows that \(byb^{-1}\in M\). This means that \(xyx^{-1}\in M\), and so M is a normal subgroup of G. Since G/M has composite order, we conclude that G/M has a non-trivial subgroup, say H/M. Hence, \(\{M\}\subset H/M \subset G/M\). This yields that \(M\subset H\subset G\), and this is a contradiction with the maximality of M.

Note that if \(Z(G)=M\), then M is a normal subgroup of G and by the same arguments, we obtain a contradiction. \(\blacksquare \)

Lemma 10.23

If an element of a group is not contained in any maximal subgroup, then the group is cyclic.

Proof

Suppose that G is a group and \(x\in G\) with x is not in any maximal subgroup of G. Since x is not in any maximal subgroup, it follows that \(\langle x \rangle \) is not maximal. Moreover, since x is not in any maximal subgroup, it follows that \(\langle x \rangle \) is not a subset of any maximal subgroup. This yields that \(\langle x \rangle = G\). \(\blacksquare \)

Theorem 10.24

Let G be a finite group. If G has exactly one maximal subgroup, then the order of G is a power of a prime.

Proof

First, we show that G is cyclic. Assume that M is the unique maximal subgroup of G. Let \(x\in G-M\). Then, the subgroup generated by x is contained in no maximal subgroup and hence is equal to G. Consequently, G is a cyclic group.

Clearly, we have \(|G|>1\). Assume that |G| has more than one prime factor. Then, it may be written \(|G|=p^km\), where p is prime, \(k, \ m\) are positive integers and \((p,m)=1\). Since G is cyclic, if k is a divisor of |G|, then G has a subgroup of order k. In particular, let \(m=qr\) with q prime. So, G has a subgroup of order \(p^kr\) and \(p^{k-1}m\). Since \((p^kr,p^{k-1}m)=1\), neither of these is a subgroup of the other. Since the order of each of these is less than that of order G, it follows that each of them is a proper subgroup of G. Since there are no proper divisors of |G| greater than the order of either of these, no proper subgroup contains either of them. Therefore, we conclude that both of these are maximal subgroups. Since G is defined to have exactly one maximal subgroup, the assumption that |G| has more than one prime factor must be invalid. \(\blacksquare \)

Theorem 10.25

If a finite group has exactly two maximal subgroups, then it is a cyclic group.

Proof

Suppose that G is a finite group. Let \(M_1\) and \(M_2\) be the only distinct maximal subgroups of G. By Lagrange’s Theorem, if a group is finite, the order of any of its proper subgroups can be no greater than half the order of the group. So, we have \(|M_1|\le |G|/2\) and \(M_2|\le |G|/2\). Therefore, we can write

$$ \begin{array}{l} |M_1\cup M_2|\le |M_1|+|M_2|-1\le |G|/2+|G|/2-1=|G|-1<|G|. \end{array} $$

Consequently, there exists \(x\in G\) such that \(x\not \in M_1\cup M_2\). Hence, by Lemma 10.23, we conclude that G is cyclic. \(\blacksquare \)

If M is a maximal subgroup of a group G, then also every conjugate \(aMa^{-1}\) of M is maximal in G. Indeed, \(aMa^{-1}\subset H<G\) implies that \(M<a^{-1}Ha<G\). For this reason the maximal subgroups are studied up to conjugation.

Exercises

  1. 1.

    Let M be a maximal subgroup of a group G. Prove that if \(M\unlhd G\), then [G : M] is finite and equal to a prime number.

  2. 2.

    Let G be the additive group consisting of all rational numbers. Prove that G contains no maximal subgroup.

  3. 3.

    Let G be a group in which each proper subgroup is contained in maximal subgroup of finite index in G. If every two maximal subgroups on G are conjugate in G, prove that G is a cyclic group.

  4. 4.

    If G is a perfect group and M is a maximal subgroup of G, prove that

    1. (1)

      M contains the center of G;

    2. (2)

      M/Z(G) is maximal in G/Z(G).

10.4 Worked-Out Problems

Problem 10.26

If \(n\ge 5\), prove that \(A_n\) is the unique proper non-trivial normal subgroup of the symmetric group \(S_n\).

Solution Suppose that N is a proper non-trivial normal subgroup of \(S_n\) and let \(\sigma \) be a non-identity permutation in N. Then, there exists i such that \(i\sigma \not = i\). We choose \(j\not =i, \ i\sigma \). Now, if \(\tau =(i \ j)\), then \(\alpha =\sigma \tau \sigma ^{-1}\tau ^{-1}\) is non-identity and lies in N. Moreover, \(\alpha \) is a product of the transpositions \(\sigma \tau \sigma ^{-1}\) and \(\tau \). Consequently, it is either a cycle of length 3 or a permutation of the form \((a \ b)(c \ d)\). Since N is normal, it contains either all cycles of length 3 or all permutations of type \((a \ b)(c \ d)\). Therefore, by Theorem 5.43, we deduce that \(N=A_n\). \(\blacksquare \)

Problem 10.27

The fact that the set of all commutators in a group need not be a subgroup is an old result; the following example is due to P.J. Cassidy (1979).

  1. (a)

    Let \(\mathbb {F}[x,y]\) denote the ring of all polynomials in two variables over a field \(\mathbb {F}\), and let \(\mathbb {F}[x]\) and \(\mathbb {F}[y]\) denote the subrings of all polynomials in x and in y, respectively. Define G to be the set of all matrices of the form

    $$ \begin{array}{l} A=\left[ \begin{array}{ccc} \ 1 \ \ &{} \ \ f(x) \ \ &{} \ h(x,y) \ \\ 0 &{} 1 &{} g(y) \\ 0 &{} 0 &{} 1 \end{array} \right] , \end{array} $$

    where \(f(x)\in \mathbb {F}[x]\) and \(g(y)\in \mathbb {F}[y]\) and \(h(x,y)\in \mathbb {F}[x,y]\). Prove that G is a multiplicative group and that \(G^\prime \) consists of all those matrices for which \(f(x)=0=g(y)\).

  2. (b)

    If (0, 0, h) is a commutator, show that there are polynomials \(f(x),f^\prime (x)\in \mathbb {F}[x]\) and \(g(x),g^\prime (x)\in \mathbb {F}[y]\) with \(h(x,y)=f(x)g^\prime (y)-f^\prime (x)g(y)\).

  3. (c)

    Show that \(h(x,y)=x^2+xy+y^2\) does not posses a decomposition as in part (b), and conclude that \((0,0,h)\in G^\prime \) is not a commutator.

solution (a) If A denoted by triple (fgh), then

$$\begin{aligned} (f,g,h)(f^\prime ,g^\prime , h^\prime )=(f+f^\prime ,g+g^\prime ,h+h^\prime +fg^\prime ). \end{aligned}$$
(10.6)

Since G is a subset of a matrix group, we need only to check the subgroup axioms, i.e., it is not necessary to check the associativity. The operation in (10.6) shows that the multiplication of two elements of G is also in G. It is clear that \(I_3\in G\) with \(f(x)=h(x,y)=g(y)=0\). Moreover, if \(f+f^\prime =0\), \(g+g^\prime =0\) and \(h+h^\prime +fg^\prime =0\), then we get \(f^\prime =f\), \(g^\prime =-g\) and \(h^\prime =-h+fg\). This yields that the element \((-f,-g, -h+fg)\) is the inverse of (fgh). Hence, we conclude that G is a group.

If \(h=h(x,y)=\displaystyle \sum \limits a_{ij}x^iy^j\), then

$$ (0,0,h)=\prod \limits _{i,j}[(a_{ij}x^i,0,0), (0,y^j,0)]. $$

This shows that \(G^\prime \) consists of all those matrices for which \(f(x)=0=g(y)\).

(b) Let (0, 0, h) be a commutator. Then we can write

$$ \begin{array}{l} (0,0,h) = (f,g,h_1)(f^\prime ,g^\prime ,h_2) (f,g,h_1)^{-1}(f^\prime ,g^\prime ,h_2)^{-1}\\ =(f,g,h_1)(f^\prime ,g^\prime ,h_2)(-f,-g,-h_1+fg)(-f^\prime ,-g^\prime ,-h_2+f^\prime g^\prime )\\ = (f+f^\prime , g+g^\prime , h_1+h_2+fg^\prime ) (-f-f^\prime , -g-g^\prime , -h_1-h_2+fg+f^\prime g^\prime +fg^\prime )\\ = \big (0,0, fg^\prime +fg^\prime +fg+f^\prime g^\prime +(f+f^\prime )(-g-g^\prime )\big )\\ = (0,0,fg^\prime -f^\prime g). \end{array} $$

(c) If \(f(x)=\displaystyle \sum b_ix^i\) and \(f^\prime (x)=\displaystyle \sum c_ix^i\), then there are equations

$$ \begin{array}{l} h(0,y)=f(0)g^\prime (y)-f^\prime (0)g(y), \\ \displaystyle \frac{\partial }{\partial x} h(x,y)\Big | _{x=0}=\displaystyle \frac{\partial }{\partial x} f(x)\Big | _{x=0}g^\prime (y)-g(y) \displaystyle \frac{\partial }{\partial x} f^\prime (x)\Big | _{x=0}, \\ \displaystyle \frac{\partial ^2}{\partial x^2} h(x,y)\Big | _{x=0}=\displaystyle \frac{\partial ^2}{\partial x^2} f(x)\Big | _{x=0}g^\prime (y)-g(y) \displaystyle \frac{\partial ^2}{\partial x^2} f^\prime (x)\Big | _{x=0}. \end{array} $$

Hence, we obtain the following three equations

$$ \begin{array}{l} b_0g^\prime (y)-c_0g(y)=y^2,\\ b_1g^\prime (y)-c_1g(y)=y,\\ b_2g^\prime (y)-c_2g(y)=1. \end{array} $$

Considering \(\mathbb {F}[x,y]\) as a vector space over \(\mathbb {F}\), one obtains the contradiction that the independent set \(\{1, y, y^2\}\) is in the subspace spanned by \(\{g, g^\prime \}\).

10.5 Supplementary Exercises

  1. 1.

    If N is a normal subgroup of the group G, prove that \((G/N)^\prime =G^\prime N/N\).

  2. 2.

    Let a and b be two elements of order m and n, respectively, in group G. Prove that if a and b both commute with [ab], and d is the greatest common divisor of m and n, then \([a,b]^d=e\).

  3. 3.

    Suppose that G is a group and \(|G|=p^n\), where p is a prime number. If \([G:C_G(x)]\le p\), for all \(x\in G\), prove that

    1. (a)

      \(C_G(x)\unlhd G\), for all \(x\in G\);

    2. (b)

      \(G^\prime \le Z(G)\);

    3. (c)

      \(|G^\prime |\le p\).

  4. 4.

    Let G be a subgroup of a group G. If \([H,G^\prime ]=e\), prove that \([H^\prime ,G]=e\).

  5. 5.

    Let p be prime and let G be a non-abelian group of order \(p^3\). Show that \(Z(G)=[G, G]\) and this is a subgroup of order p.

  6. 6.

    Let \(K\le M<G\), with \(K\trianglelefteq G\). Prove that M/K is a maximal subgroup of G/K if and only if M is a maximal subgroup of G.