Elizabeth Browning probably didn’t realize that she was really talking about mathematics when she penned her 43rd sonnet, How Do I Love Thee? This chapter provides a more comprehensive answer to this question than Browning was able to present in the remaining stanzas where she enumerates the ways she loves the veiled object of her sonnet. With the power of mathematics, equations are derived that provide a thorough enumeration, leaving no stone untouched. This is done through the simple expedient of selecting a set of items from a set. It is surprising, as when one falls in love, how fast innocent simplicity explodes into a tangled web of complexity. Perhaps this is what makes love stories, and mathematics, so enduringly interesting.

Assume there are n distinguishable items in a set from which k items are selected.Footnote 1 The object of this chapter is to count the number of possible ways to make such a selection. There are four different counting paradigms that depend upon the order items are selected and whether selected items are removed or returned to the set. Selection with replacement occurs when selected items are returned, otherwise the selection is termed selection without replacement. A permutation occurs when the order of selected items is maintained, otherwise the selection corresponds to a combination.

To illustrate the four different selection paradigms, consider the case where 2 items are selected from the set {1, 2, 3}. A selection can be represented by an integer so that 31 corresponds to first selecting item 3 followed by item 1. The first line of the table below shows that there are 6 possible permutations when items are not returned to the set (possibilities represented by {12, 21, 13, 31, 23, 32}). When order is not maintained (so that the selection 12 is counted as being the same as the selection 21) then the number of possibilities reduces to 3 (possibilities represented by {12, 13, 23}). When items are returned after selection, the number of possibilities in each of the above cases increases by 3 corresponding to the addition of possibilities given by {11, 22, 33}.

 

#

Permutation

#

Combination

Without Replacement

6

{12, 21, 13, 31, 23, 32}

3

{12, 13, 23}

With Replacement

9

{12, 21, 13, 31, 23, 32, 11, 22, 33}

6

{12, 13, 23, 11, 22, 33}

The rest of this chapter derives equations for the number of possible selections for each of the four counting paradigms, establishes relationships between them, and derives identities that arise from the resultant equations.Footnote 2 Throughout this chapter, let the number of different items in a set be denoted by n from which k items are selected.

2.1 Permutations: With and Without Replacement

We first consider permutations when items are not returned to the set, a quantity that is denoted by \(p^r_{k,n}\).Footnote 3 This value satisfies the recurrence

$$\displaystyle \begin{aligned} p^r_{k,n} = n p^r_{k-1,n}, \ \ n \ge 1, \ k \ge 1 \end{aligned} $$
(2.1)

Initial values of the recurrence are \(p^r_{0,n} = 1\) and \(p^r_{k,n} = 0\) for k < 0 or n < 0. To explain (2.1) note that the first of the k selections can be done in n different ways, leaving k − 1 items left to be selected. Since the selected item is returned to the step, this leaves \(p^r_{k-1,n}\) remaining possibilities. Recurrence (2.1) can be solved to yield

$$\displaystyle \begin{aligned} p^r_{k,n} = n^k \end{aligned} $$
(2.2)

Values of \(p^r_{k,n}\) for small parameter values are given in the following table:

 

Values of \(p^r_{k,n} = n^k \)

kn

2

3

4

5

6

7

8

9

1

2

3

4

5

6

7

8

9

2

 

9

16

25

36

49

64

81

3

  

64

125

216

343

512

729

4

   

625

1,296

2,401

4,096

6,561

5

    

7,776

16,807

32,768

59,049

6

     

117,649

262,144

531,441

7

      

2,097,152

4,782,969

8

       

43,046,721

The number of different possibilities for selecting permutations without replacement, denoted by p k,n, satisfies the recursion

$$\displaystyle \begin{aligned} p_{k,n} = n p_{k-1,n-1}, \ \ n \ge 1, \ k \ge 1 \end{aligned} $$
(2.3)

with initial values of p 0,n = 1 and p k,n = 0 for k < 0 of n < 0.Footnote 4 To explain this, note again that the first selection can be done in n different ways. Since the selected item now is removed from the set, this leaves the remaining k − 1 items to be selected from a set of n − 1 items, thus the quantity p k−1,n−1. This recurrence can be solved to yield

$$\displaystyle \begin{aligned} p_{k,n} = n^{\underline{k}} \end{aligned} $$
(2.4)

where the lower factorial is defined by

$$\displaystyle \begin{aligned} n^{\underline{k}} = n(n-1) \cdots (n-k+1) \end{aligned} $$
(2.5)

As an aside, note that two algebraic identities follow directly definition (2.5):

$$\displaystyle \begin{aligned} n^{\underline{k}} = n^{\underline{\ell}} \ \left (n-\ell \right ) ^{\underline{k-\ell}} , \ \ \ell \le k \end{aligned} $$
(2.6)

and

$$\displaystyle \begin{aligned} n \ n^{\underline{k}} = n^{\underline{k+1}} + k n^{\underline{k}} \end{aligned} $$
(2.7)

Values of p k,n for small parameter values are given in the following table (notice the size differences between this and the table above):

 

Values of \(p_{k,n} = n^{ \underline {k}} \)

kn

2

3

4

5

6

7

8

9

1

2

3

4

5

6

7

8

9

2

 

6

12

20

30

42

56

72

3

  

24

60

120

210

336

504

4

   

120

360

840

1,680

3,024

5

    

720

2,520

6,720

15,120

6

     

5,040

20,160

60,480

7

      

40,320

181,440

8

       

362,880

So far the analysis is straightforward; however, a counterintuitive result follows from these the two defining equations (2.3) and (2.4). The birthday problem is typically stated by calculating the probability that, from a set of k people, at least two people have the same birthday. To calculate this, assume that birthdays occur uniformly throughout the year.Footnote 5 If all birthdays are unique, then a selection from 365 days of k items is a permutation without replacement. Setting n = 365 in equation (2.4) shows that there are \(365^{ \underline {k}}\) such permutations. The number of total possible selections from 365 days that allows duplicate days is equivalent to selecting k days with replacement which is given by equation (2.2), 365k. Thus, the fraction of random k selections where there are no duplicate birthdays equalsFootnote 6

$$\displaystyle \begin{aligned} \frac{p_{k,n}}{p^r_{k,n}} = \prod_{j=1}^{k-1} \left (1 - \frac{j}{n} \right ) \end{aligned} $$
(2.8)

To determine the probability that there are multiple birthdays, subtract (2.8) from 1. The numerical results are surprising as shown in the following table:

k

10

15

20

25

30

35

40

45

50

55

\(1-p_{k,n}/p^r_{k,n}\)

.117

.253

.411

.569

.706

.814

.891

.941

.97

.986

The break-even point occurs when k = 23, showing that there is a 50.72% probability that two people have the same birthday! The table shows how fast this percentage increases with k so that the 99% level is breached at k = 57. A fun way to see mathematics in action is to grab the microphone at a typical wedding and ask people having a birthday today to raise their hands. It will almost never fail that at least a couple of hands shoot up!

2.1.1 Dearrangements

To explain another problem that can be solved only using equations (2.3) and (2.4), consider a list of the integers (1, 2, …, n) that are permuted to a new ordering (q 1, q 2, …, q n) so that no integer is in its original position, q ii, i = 1, …, n. Such a rearrangement is termed a dearrangement. Denote the number of such possibilities, by g n which has boundary values: g 0 = 1 and g 1 = 0. Consider integer j and assume that after permutation it is in position k, and hence g k = j. There are n − 1 possibilities to select k if kj. The number of remaining dearrangements that are possible depend on where integer k is permuted. If k exchanges position with j, so that g j = k, then this leaves n − 1 remaining items in the dearrangement, a value given by g n−1. If this is not the case and integer k is found in position j so that g  = k, then two slots of the permutation are taken leaving n − 2 items left in the dearrangement, a value given by g n−2. Summing these disjoint possibilities shows that

$$\displaystyle \begin{aligned} g_n = (n-1) (g_{n-1} + g_{n-2}) \end{aligned} $$
(2.9)

The number of dearrangements grows quickly with n as shown in the following table:

n

2

4

6

8

10

12

14

g n

1

9

265

14,833

1,334,961

176,214,841

32,071,101,049

A billion dearrangements are surpassed with n = 13 (2,290,792,932 ways to be exact). Thus, if you take a fresh pack of cards, separate out one suit in its numeric order and shuffle these 13 cards thoroughly, then about 36.787% of 13! possibilities will correspond to dearrangements. In fact, it is not too difficult to show that as n gets large, the fraction of random permutations of n items that are dearrangements converges to 1∕e where e ≈ 2.718281828 is called Euler’s number after the Swiss mathematician Leonard Euler (1707–1783) who, among his vast achievements, studied properties of the exponential function.Footnote 7

The difficulty with solving recurrence (2.9) lies in the multiplicative factor (n − 1) found in the equation. To counteract this, consider a scaled version where f n = g nn!. This produces the recurrence

$$\displaystyle \begin{aligned} \begin{array}{rcl} \qquad f_n &\displaystyle =&\displaystyle \frac{g_n}{n!} = \frac{n-1}{n!} \ \left ( g_{n-1} + g_{n-2} \right ) \\ &\displaystyle =&\displaystyle \frac{n-1}{n!} \ \left ((n-1)! f_{n-1} + (n-2)! f_{n-2} \right ) \\ &\displaystyle =&\displaystyle \left ( 1 - \frac{1}{n} \right ) f_{n-1} + \frac{1}{n} f_{n-2} \end{array} \end{aligned} $$

Rewriting this reveals a difference that can be formed between successive index values

$$\displaystyle \begin{aligned} f_n - f_{n-1} = - \frac{1}{n} \left ( f_{n-1} - f_{n-2} \right ) \end{aligned}$$

This can be iterated to yield

$$\displaystyle \begin{aligned} f_n - f_{n-1} = (-1)^n \frac{1}{n!} \end{aligned}$$

This telescopes to the boundary f 0 = 1 leading to the following summation:

$$\displaystyle \begin{aligned} f_n = \sum_{i=0}^n \frac{(-1)^i}{i!} \end{aligned}$$

Converting back to the original recursion produces a lovely result: the number of dearrangements equals an alternating sum of permutations:

$$\displaystyle \begin{aligned} g_n = n! f_n = \sum_{i=0}^n (-1)^i p_{i,n} \end{aligned} $$
(2.10)

2.2 Combinations: Without Replacement

Consider a particular combination obtained from selecting k items from a set of size n. The ordering of the items in this combination can be permuted in k! ways without adding to the number of combinations. Thus an equation for the number of possible combinations, denoted by c k,n, is given by

$$\displaystyle \begin{aligned} c_{k,n} = \frac{n^{\underline{k}}}{k!} = \prod_{j=0}^{k-1} \frac{n-j}{k-j} \end{aligned} $$
(2.11)

It is customary to write this using a binomial coefficient Footnote 8

$$\displaystyle \begin{aligned} c_{k,n} = \left ( \begin{array}{c} n \\ k \end{array} \right ) = \frac{n!}{(n-k)! \ k!} \end{aligned} $$
(2.12)

Values of c k,n for small parameter values are given in the following table:

 

Values of \(c_{k,n} = \left ( \begin {array}{c} n \\ k \end {array} \right ) \)

kn

1

2

3

4

5

6

7

8

9

0

1

1

1

1

1

1

1

1

1

1

1

2

3

4

5

6

7

8

9

2

 

1

3

6

10

15

21

28

36

3

  

1

4

10

20

35

56

84

4

   

1

5

15

35

70

126

5

    

1

6

21

56

126

6

     

1

7

28

84

7

      

1

8

36

8

       

1

9

9

        

1

A recursive derivation of (2.11) is instructive and proves to be useful in future derivations. Initial conditions are easily calculated: c k,n = 0 for k > n or k < 0, c 1,n = n and c n,n = 1. To derive a general recurrence for c k,n, partition selections into two disjoint sections. If item j is selected, then the remaining k − 1 items must be selected from the n − 1 remaining items, a quantity given by c k−1,n−1. If item j is not selected, then it is equivalent to not being in the set, a quantity given by c k,n−1. The total number of combinations without replacement is the sum of these two disjoint possibilities and thus equals

$$\displaystyle \begin{aligned} c_{k,n} = c_{k-1,n-1} + c_{k,n-1} \end{aligned} $$
(2.13)

Simple algebra establishes that equation (2.11) (or equation (2.12)) satisfies recursion (2.13). The value of c k,n equals the number of ways to pick k items from a set of n or, equivalently, equals the number of possible subsets of size k that can be formed from a set of n items.

2.2.1 Binomial Identities

Straightforward algebra establishes the following identities between binomial coefficients:

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \quad \qquad \left ( \begin{array}{c} n \\ k \end{array} \right ) &=& \left ( \begin{array}{c} n \\ n-k \end{array} \right ) \end{array} \end{aligned} $$
(2.14)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \qquad \left ( \begin{array}{c} n \\ k \end{array} \right ) &=& \left ( \begin{array}{c} n-1 \\ k-1 \end{array} \right ) + \left ( \begin{array}{c} n-1 \\ k \end{array} \right ) \end{array} \end{aligned} $$
(2.15)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \qquad \left ( \begin{array}{c} n \\ k \end{array} \right ) &=& \frac{n}{k} \ \left ( \begin{array}{c} n-1 \\ k-1 \end{array} \right ) = \frac{n-k+1}{k} \ \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) \end{array} \end{aligned} $$
(2.16)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \qquad \left ( \begin{array}{c} n \\ k \end{array} \right ) \left ( \begin{array}{c} k \\ j \end{array} \right ) &=& \left ( \begin{array}{c} n \\ j \end{array} \right ) \left ( \begin{array}{c} n-j \\ k-j \end{array} \right ) \end{array} \end{aligned} $$
(2.17)

Identity (2.14) shows the symmetry of binomials coefficients and identity (2.15) is a direct restatement of recurrence (2.13). Identity (2.16) follows from the product expansion of (2.11) and from setting j = 1 in (2.17). A typical application of (2.17) is to separate variables j and k in a summation. As an example, consider the following derivation which uses (2.17) in the first step:

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \qquad \qquad \quad \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) \frac{a^k}{1+k} &=& \frac{1}{n+1} \sum_{k=0} ^ n \left ( \begin{array}{c} n+1 \\ k+1 \end{array} \right ) a^k \\ &=& \frac{1}{n+1} \sum_{j=1}^{n+1} \left ( \begin{array}{c} n+1 \\ j \end{array} \right ) a^{j-1} \\ &=& \frac{(1+a)^{n+1} - 1}{a (1+n)} \end{array} \end{aligned} $$
(2.18)

Two special cases of this identity arise when a = 1 or a = −1:

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \qquad \qquad \qquad \qquad \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) \frac{1}{1+k} &=& \frac{2^{n+1} - 1}{n+1} \end{array} \end{aligned} $$
(2.19)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \sum_{k=0}^n (-1)^k \left ( \begin{array}{c} n \\ k \end{array} \right ) \frac{1}{1+k} &=& \frac{ 1}{n+1} \end{array} \end{aligned} $$
(2.20)

Summations of binomial coefficients are typically derived using induction from an easily calculated base class. There are two ways to interpret the recursion of equation (2.13). The backward view starts from c k,n and recurses to a lower value of n with the values c k−1,n−1 and c k,n−1. This generates an identity which starts from the base case, c 0,2 + c 1,2 + c 2,2 = 4, suggesting that

$$\displaystyle \begin{aligned} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) {=} 2^n \end{aligned} $$
(2.21)

Assume this is true for all values less than or equal to n. Using identity (2.15) implies that

$$\displaystyle \begin{aligned} \begin{array}{rcl} \qquad \qquad \sum_{k=0}^{n+1} \left ( \begin{array}{c} n+1 \\ k \end{array} \right ) &\displaystyle =&\displaystyle \sum_{k=0}^{n+1} \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) + \left ( \begin{array}{c} n \\ k \end{array} \right ) \\ &\displaystyle =&\displaystyle \sum_{k=0}^{n+1} \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) + 2^n \\ &\displaystyle =&\displaystyle \sum_{k=0}^{n} \left ( \begin{array}{c} n \\ k \end{array} \right ) + \left ( \begin{array}{c} n \\ -1 \end{array} \right ) + 2^n \\ &\displaystyle =&\displaystyle 2^n + 0 + 2^n \\ &\displaystyle =&\displaystyle 2^{n+1} \end{array} \end{aligned} $$

thus proving the claim.

A variation of identity (2.21) involves summing over even or odd indexed values of k. Notice that small examples include: c 0,3 + c 2,3 = 4 and c 0,4 + c 2,4 + c 4,4 = 8 which suggest that

$$\displaystyle \begin{aligned} \sum_{k \ \text{even}} \left ( \begin{array}{c} n \\ k \end{array} \right ) = 2^{n-1}\end{aligned} $$
(2.22)

Also note that c 1,3 + c 3,3 = 4 and c 1,4 + c 3,4 = 8 which suggests that

$$\displaystyle \begin{aligned} \sum_{k \ \text{odd}} \left ( \begin{array}{c} n \\ k \end{array} \right ) = 2^{n-1}\end{aligned} $$
(2.23)

Assume that both of these assumptions holds for all values up to n and calculate

$$\displaystyle \begin{aligned} \begin{array}{rcl} \quad \qquad \qquad \sum_{k \ \text{even}} \left ( \begin{array}{c} n+1 \\ k \end{array} \right ) &\displaystyle =&\displaystyle \sum_{k \ \text{even}} \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) + \left ( \begin{array}{c} n \\ k \end{array} \right ) \\ &\displaystyle =&\displaystyle 2^{n-1} + \sum_{k \ \text{even}} \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) \\ &\displaystyle =&\displaystyle 2^{n-1} + \sum_{k \ \text{odd}} \left ( \begin{array}{c} n \\ k \end{array} \right ) \\ &\displaystyle =&\displaystyle 2^{n-1} + 2^{n-1} = 2^n \end{array} \end{aligned} $$

A similar argument holds if the initial summation takes place over odd indices. This establishes the double induction and proves identities (2.22) and (2.23).

Equations (2.21), (2.22), and (2.23) are all special cases of a much deeper result—the binomial theorem. This states that

$$\displaystyle \begin{aligned} (x + y )^n = \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) x^k y^{n-k} \end{aligned} $$
(2.24)

This is easily established for small n values. Assume it holds for all values up to n. Then identity (2.15) shows that the pattern continues:

$$\displaystyle \begin{aligned} \begin{array}{rcl} (x+y)^{n+1} &\displaystyle =&\displaystyle \sum_{k=0}^{n+1} \left ( \begin{array}{c} n+1 \\ k \end{array} \right ) x^k y^{n+1-k}\\ &\displaystyle =&\displaystyle \sum_{k=0}^{n+1} \left ( \left ( \begin{array}{c} n \\ k \end{array} \right ) + \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) \right ) x^k y^{n+1-k}\\ &\displaystyle =&\displaystyle y \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) x^k y^{n-k} + x \sum_{k=1}^{n+1} \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) x^{k-1} y^{n+1-k}\\ &\displaystyle = &\displaystyle y (x+y)^n + x \sum_{\ell=0}^n \left ( \begin{array}{c} n \\ \ell \end{array} \right ) x^{k} y^{n-k}\\ &\displaystyle =&\displaystyle y (x+y)^n + x (x+y)^n = (x+y)^{n+1}\\ \end{array} \end{aligned} $$

There are a countless number of identities that arise by varying parameters of the binomial theorem besides those just mentioned. Listing just a few:

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \left ( 1 - \frac{1}{\ell} \right ) ^n &=& \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) \left ( \frac{-1}{\ \ell } \right )^k \end{array} \end{aligned} $$
(2.25)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \qquad \qquad (x + a)^n - (x - a)^n &=& 2 \sum_{k \ \text{odd} }\left ( \begin{array}{c} n \\ k \end{array} \right ) x^{n-k} a^k \end{array} \end{aligned} $$
(2.26)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \left ( \frac{1}{x} + \frac{1}{y} \right )^n &=& \frac{1}{y^n} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) \left ( \frac{y}{x} \right )^k \end{array} \end{aligned} $$
(2.27)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \sum_{k=0}^{n-i} (-1)^{k} \left ( \begin{array}{c} n-i \\ k \end{array} \right ) &=& \left \{ \begin{array}{cl} 0, & i=0, \ldots , n-1 \\ 1, & i=n \end{array} \right. \end{array} \end{aligned} $$
(2.28)

A further identity can be obtained by forming a telescoping summation combined with the binomial theorem. To derive this, observe that

$$\displaystyle \begin{aligned} \begin{array}{rcl} n &\displaystyle =&\displaystyle \sum_{k=1}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) k - \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) (k-1) \\ &\displaystyle =&\displaystyle \sum_{k=1}^n \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) (n - 2k + 2) \ \ \ \ \ \ \ \ \ \text{From Identity } \mbox{(2.16)} \\ &\displaystyle = &\displaystyle n + \sum_{k=1}^{n+1} \left ( \begin{array}{c} n \\ k-1 \end{array} \right ) (n - 2k + 2) \\ &\displaystyle =&\displaystyle n + \sum_{k=0}^{n} \left ( \begin{array}{c} n \\ k \end{array} \right ) (n - 2k ) \end{array} \end{aligned} $$

Thus

$$\displaystyle \begin{aligned} \sum_{k=0}^{n} \left ( \begin{array}{c} n \\ k \end{array} \right ) (n - 2k ) =0 \end{aligned}$$

which, using the binomial theorem, establishes the identity

$$\displaystyle \begin{aligned} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) k =n 2^{n-1} \end{aligned} $$
(2.29)

A similar telescoping summation can be used to calculate

$$\displaystyle \begin{aligned} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) k (k-1) = n(n-1)2^{n-2} \end{aligned} $$
(2.30)

and

$$\displaystyle \begin{aligned} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) k^2 = n(n+1)2^{n-2} \end{aligned} $$
(2.31)

The next identity we derive is the forward view of recurrence (2.13) that proceeds from n − 1 to n and corresponds to a summation of the numerator of the combinatorial coefficient. This summation follows directly from the listing recursions from (2.13):

$$\displaystyle \begin{aligned} \begin{array}{rcl} \qquad \qquad \qquad c_{k+1,n+1} &\displaystyle =&\displaystyle c_{k,n} + c_{k+1,n} \\ c_{k+1,n} &\displaystyle =&\displaystyle c_{k,n-1} + c_{k+1,n-1} \\ c_{k+1,n-1} &\displaystyle =&\displaystyle c_{k,n-2} + c_{k+1, n-2} \\ \vdots &\displaystyle &\displaystyle \ \ \ \ \ \vdots \ \ \ \ \ \ \ \ \ \ \vdots \ \ \ \\ c_{k+1,k+2} &\displaystyle =&\displaystyle c_{k,k+1} + c_{k+1, k+1} \end{array} \end{aligned} $$

Collecting these yields

$$\displaystyle \begin{aligned} c_{k+1,n+1} = c_{k ,n} + c_{k,n-1} + \cdots + c_{k,k+1} + 1\end{aligned} $$

implies that

$$\displaystyle \begin{aligned} \left ( \begin{array}{c} n+1 \\ k+1 \end{array} \right ) = \sum_{\ell = k }^{n} \left ( \begin{array}{c} \ell \\ k \end{array} \right ) \end{aligned} $$
(2.32)

With the index substitution, j =  − k, the right-hand side of identity (2.32) can be rewritten as

$$\displaystyle \begin{aligned} \sum_{\ell = k }^{n} \left ( \begin{array}{c} \ell \\ k \end{array} \right ) = \sum_{j=0}^{n-k} \left ( \begin{array}{c} k+j \\ k \end{array} \right ) \end{aligned} $$
(2.33)

Making the substitution m − 1 = n − k, with equations (2.32) and (2.33), shows thatFootnote 9

$$\displaystyle \begin{aligned} \sum_{r=0}^{m-1} \left ( \begin{array}{c} k+r \\ k \end{array} \right ) = \left ( \begin{array}{c} m+k \\ k+1 \end{array} \right ) \end{aligned} $$
(2.34)

Combinatoric arguments can often provide insight into identities without the need for algebraic manipulation. Consider selecting k items from a list of n. Divide the list n into two sublists of size m and n − m where 0 ≤ m ≤ n and suppose that j items are selected from the first list and k − j items from the second list. There are c j,mc kj,nm ways in which such a selection can be made. Accounting for all such possibilities leads to the identity

$$\displaystyle \begin{aligned} \left ( \begin{array}{c} n \\ k \end{array} \right ) = \sum_{j=0}^n \left ( \begin{array}{c} m \\ j \end{array} \right ) \left ( \begin{array}{c} n-m \\ k-j \end{array} \right ) , \ \ 0 \le m \le n \end{aligned} $$
(2.35)

The conditions we have placed on binomial terms so that c k,n = 0 if k < 0, n < 0 or k > n allow us to write the right-hand side of equation (2.35) without needing to put restrictions on the binomial coefficients contained in the summation. This identity is termed Vandermonde convolution named after Alexandre-Théophile Vandermonde (1735–1796), a French mathematician who supported the French revolution of 1789 and is most known for his mathematical work in determinants.

2.3 Combinations with Replacement

This leaves the last problem—to calculate the number of combinations obtained when one uses a replacement strategy. To derive an equation, let \(c^r_{k,n}\) denote the number of combinations obtained when selecting k items from a set of n items using replacement.Footnote 10 We set \(c^r_{0,1}\) = 1 and it is clear that \(c^r_{1,n} = n\) and \(c^r_{k,1} = 1\). To derive a general equation for \(c^r_{k,n}\), partition selections into two disjoint sections. If item j is selected, then the number of combinations with replacement is the same as if it had been selected on the first selection, a quantity given by \(c^r_{k-1,n}\). On the other hand, if item j is not selected, then it is equivalent to not being in the set, a quantity given by \(c^r_{k,n-1}\). The total number of combinations is the sum of these two disjoint possibilities and thus equals

$$\displaystyle \begin{aligned} c^r_{k,n} = c^r_{k-1,n} + c^r_{k,n-1} \end{aligned} $$
(2.36)

It is interesting to compare equations (2.13) and (2.36). They differ only in the indices of the first term. The n index of \(c^r_{k-1,n}\) as compared to the n − 1 index of c k−1,n−1 is a result of putting a selected item back in the list when the replacement policy is utilized. This results in a substantial increase in the number of possibilities that a replacement policy has in comparison to a non-replacement policy.

To proceed with the derivation, consider equation (2.36) where k = 2:

$$\displaystyle \begin{aligned} \begin{array}{rcl} \qquad \qquad \qquad \qquad c^r_{2,n} &\displaystyle =&\displaystyle c^r_{1,n} + c^r_{2,n-1} \\ &\displaystyle =&\displaystyle c^r_{1,n} + c^r_{1,n-1} + c^r_{2,n-2} \end{array} \end{aligned} $$

Observe that \(c^r_{2,n-2}\) is of the same form as \(c^r_{2,n}\) except it moves 2 down on the n value which suggests iterating

$$\displaystyle \begin{aligned} \begin{array}{rcl} \qquad \qquad c^r_{2,n} &\displaystyle =&\displaystyle c^r_{1,n} + c^r_{1,n-1} + \cdots + c^r_{1,2} + c^r_{1,1} \\ &\displaystyle =&\displaystyle n + (n-1) + \cdots + 2 + 1\\ &\displaystyle = &\displaystyle \left ( \begin{array}{c} n+1 \\ 2 \end{array} \right ) \end{array} \end{aligned} $$

This case suggests the following guess for a general solution:

$$\displaystyle \begin{aligned} c^r_{k,n} = \left ( \begin{array}{c} n+k-1 \\ k \end{array} \right ) \end{aligned}$$

Assume this equation holds for all values less than or equal to some value k. Then, using the recurrence (2.36) permits

$$\displaystyle \begin{aligned} \begin{array}{rcl} c^{r}_{k+1,n} &\displaystyle =&\displaystyle c^r_{k,n} + c^r_{k+1,n-1}\\ &\displaystyle =&\displaystyle c^r_{k,n} + c^r_{k,n-1} + c^r_{k+1,n-2}\\ &\displaystyle =&\displaystyle c^r_{k,n} + c^r_{k,n-1} \cdots + c^r_{k,2} + c^r_{k+1,1}\\ &\displaystyle =&\displaystyle \left ( \begin{array}{c} k+n-1 \\ k \end{array} \right ) + \cdots + \left ( \begin{array}{c} k+2 \\ k \end{array} \right ) + \left ( \begin{array}{c} k+1 \\ k \end{array} \right ) + 1 \\ &\displaystyle =&\displaystyle \sum_{r=0}^{n-1} \left ( \begin{array}{c} k+r \\ k \end{array} \right ) = \left ( \begin{array}{c} n+(k+1) - 1 \\ k+1 \end{array} \right ) \end{array} \end{aligned} $$

where the last equation uses the identity (2.34). This shows that the induction is satisfied and thus that

$$\displaystyle \begin{aligned} c^r_{k,n} = \left \langle \begin{array}{c} n \\ k \end{array} \right \rangle \end{aligned} $$
(2.37)

where binomial-R coefficients are defined by

$$\displaystyle \begin{aligned} \left \langle \begin{array}{c} n \\ k \end{array} \right \rangle = \left ( \begin{array}{c} n+k-1 \\ k \end{array} \right ) \end{aligned} $$
(2.38)

Values of \(c^r_{k,n}\) for small parameter values are given in the following table:

 

Values of \(c^r_{k,n} = \left \langle \begin {array}{c} n \\ k \end {array} \right \rangle \)

kn

1

2

3

4

5

6

7

8

9

0

1

1

1

1

1

1

1

1

1

1

1

2

3

4

5

6

7

8

9

2

1

3

6

10

15

21

28

36

45

3

1

4

10

20

35

56

84

120

165

4

1

5

15

35

70

126

210

330

495

5

1

6

21

56

126

252

462

792

1,287

6

1

7

28

84

210

462

924

1,716

3,003

7

1

8

36

120

330

792

1,716

3,432

6,435

8

1

9

45

165

495

1,287

3,003

6,435

12,870

9

1

10

55

220

715

2,002

5,005

11,440

24,310

2.3.1 Binomial-R Identities

Some identities that are easy to verify include

$$\displaystyle \begin{aligned} \begin{array}{rcl} {}\qquad \qquad \qquad \left \langle \begin{array}{c} n \\ k \end{array} \right \rangle &=& \left \langle \begin{array}{c} k+1 \\ n-1 \end{array} \right \rangle \\ \left \langle \begin{array}{c} n \\ k \end{array} \right \rangle &=& \left \langle \begin{array}{c} n \\ k-1 \end{array} \right \rangle + \left \langle \begin{array}{c} n-1 \\ k \end{array} \right \rangle \\ \left \langle \begin{array}{c} n \\ k \end{array} \right \rangle &=& \frac{n}{k} \ \left \langle \begin{array}{c} n+1 \\ k-1 \end{array} \right \rangle \\ \left ( \begin{array}{c} n \\ k \end{array} \right ) &=& \left \langle \begin{array}{c} n+1-k \\ k \end{array} \right \rangle \end{array} \end{aligned} $$
(2.39)

Adding one more term to (2.34) produces a similar identity expressed in terms of binomial-R coefficients:

$$\displaystyle \begin{aligned} \begin{array}{rcl} {}\qquad \qquad \sum_{r=0}^n \left ( \begin{array}{c} k+r \\ k \end{array} \right ) & = & \sum_{r=0}^{n-1} \left ( \begin{array}{c} k+r \\ k \end{array} \right ) + \left ( \begin{array}{c} n+k \\ k \end{array} \right ) \\ &=& \left ( \begin{array}{c} n+k \\ k+1 \end{array} \right ) + \left ( \begin{array}{c} n+k \\ k \end{array} \right ) \\ &=& \left ( \begin{array}{c} n+k+1 \\ k+1 \end{array} \right ) \\ &=& \left \langle \begin{array}{c} n+1 \\ k+1 \end{array} \right \rangle \end{array} \end{aligned} $$
(2.40)

The flow of this derivation uses equations from (2.16) and (2.34). One can also derive an identity involving the sum binomial-R coefficients given by

$$\displaystyle \begin{aligned} \sum_{k=0}^m \left \langle \begin{array}{c} n \\ k \end{array} \right \rangle = \left \langle \begin{array}{c} n+1 \\ m \end{array} \right \rangle \end{aligned} $$
(2.41)

The size difference between combinations without and with replacement can be quantified, similar to that of equation (2.8), by forming their ratio:

$$\displaystyle \begin{aligned} \frac{c_{k,n}}{c^r_{k,n} } = \prod_{j=1}^{k-1} \left ( 1 - \frac{k}{n+k-j}\right ) \end{aligned} $$
(2.42)

With the same values as in the birthday problem, k = 23 and n = 365, the ratio of combinations given in equation (2.42) yields an answer of around 25% in comparison to 50% found with permutations.

2.3.2 Polynomial Solutions to Combinatorial Problems

To view the previous analysis within a general framework, suppose there are a set of n integers, m i, i = 1, …, n, each between 0 and k having a total sum that equals k:

$$\displaystyle \begin{aligned} m_1 + m_2 + \cdots + m_n = k , \ \ \ \ 0 \le m_i \le k\end{aligned} $$

How many ways can such numbers be selected to satisfy these constraints? This problem can be thought as the number of n partitions of the integer k. For example, if n = 3 and k = 2, then there are six possible partitions:

$$\displaystyle \begin{aligned} \{2,0,0\}, \ \{1,1,0\}, \ \{1, 0,1\},\ \{0,2,0\}, \ \{0,1,1\}, \ \{0,0,2\}\end{aligned} $$
(2.43)

To answer the general question, suppose that k balls are thrown randomly into n buckets. The value of m i counts the total number of balls that land in bucket i. Some thought shows that this is equivalent to the number of combinations for selecting k items from a set of n where a replacement strategy is used. To see this, associate a bucket with each item in the set of n items. With this association, selecting the i’th item in the set is equivalent to throwing a ball into the i’th bucket. The constraints on the buckets mean that only up to k balls can land in any particular bucket. Thus the solution to the posed question is that the number of possible partitions equals \(c^r_{k,n}\). This simple solution leads to an extremely useful concept which will be derived in the following paragraphs.

The great thing about being a mathematician is that your work, which is really like play, can be done almost anywhere so there is never a danger of becoming bored. Perhaps, for instance, you are stuck in the middle of a theater during a particularly uninteresting play. Then, as long as you have a pen, you can play with equations on the back or margins of the program. You might, during one of these occasions, jot down a simple infinite polynomial like

$$\displaystyle \begin{aligned} f(x) = 1 + x + x^2 + x^3 + \cdots\end{aligned} $$

If x is between 0 and 1, then such a sequence convergences and its sum is given by

$$\displaystyle \begin{aligned} \frac{1}{1-x} = \sum_{i=0}^\infty x^i \end{aligned}$$

The linkage between this and combinatorics arises when f is raised to a power

$$\displaystyle \begin{aligned} f^n(x) = \left( \frac{1}{1-x} \right )^n \end{aligned}$$

It is clear that this expression is another infinite polynomial

$$\displaystyle \begin{aligned} f^n(x) = \sum_{i=0}^\infty a_{i,n} x^i \end{aligned}$$

with integer coefficients a i,n. What are these coefficients?

As a concrete example, suppose that n = 3 and k = 2. Consider the value of a 2,3 which corresponds to the coefficient of x 2 in f 3(x). To facilitate the argument, write

$$\displaystyle \begin{aligned} &f^3(x) \\ &= \underbrace{(x^0 + x^1 + x^2 + \cdots )}_{\text{first group}} \times \ \underbrace{(x^0 + x^1 + x^2 + \cdots )}_{\text{second group} } \times \ \underbrace{(x^0 + x^1 + x^2 + \cdots )}_{\text{third group}} \end{aligned} $$

Let the exponent of x selected in the first group be denoted by m 1 and similarly define m 2 and m 3. When the sum of these x exponents, m 1 + m 2 + m 3, equals 2 then those factors contribute to the value of a 2,3. Since the coefficients of f(x) are all 1, the value of a 2,3 is the number of ways three non-negative integers sum to 2. In our example, this corresponds to the sets given in (2.43). The solution to the problem posed a couple of paragraphs back showed that this quantity equals \(c^r_{2,3}\). Clearly this argument generalizes, and thus \(a_{i,n} = c^r_{i,n}\) which establishes the equation:

$$\displaystyle \begin{aligned} f^n(x) = \sum_{i=0}^\infty \left \langle \begin{array}{c} n \\ i \end{array} \right \rangle x^i \end{aligned} $$
(2.44)

To link this analysis back to counting, recall that \(c^r_{k,n}\) is the number of ways that m 1 + m 2 + ⋯ + m n = k where m i was bounded below by 0 and above by k. Suppose that the value of m i has a different set of constraints. For example, consider a simple case where n = 3 and k = 2 that has the following constraints:

$$\displaystyle \begin{aligned} m_1 \in \{ 0,1 \} , \ \ \ \ m_2 \in \{ 1,2\}, \ \ \ \ m_3 \in \{ 0,2\} \end{aligned}$$

The number of combinations where m 1 + m 2 + m 3 = 2 can be easily listed

$$\displaystyle \begin{aligned} \begin{array}{c} \{1,1,0\},\ \ \ \{0,2,0\} {} \end{array} \end{aligned} $$
(2.45)

But is there a way to calculate that there are 2 possibilities without actually listing them all?

The answer lies in the previous analysis. Associate the polynomial x 0 + x 1 with the first value of m 1 which mathematically incorporates the constraint that m 1 ∈{0, 1}. Similarly associate the polynomial x 1 + x 2 with m 2 and x 0 + x 2 with m 3. The polynomial g(x) defined by the product of all these sub polynomials is given by

$$\displaystyle \begin{aligned} g(x) = (x^0 + x^1) (x^1 + x^2) (x^0 + x^2) = x + 2 x^2 + 2 x^3 + 2 x^4 + x^5 \end{aligned} $$
(2.46)

By construction, the coefficient of x corresponds to the number of possible combinations that result when m 1 + … + m k = . Hence, the expansion in equation (2.46) shows that the number of such possibilities equals 2 when  = 2, thus confirming the enumeration given in (2.45). Additionally, the expansion shows that there are no ways to sum to 0, one way to sum to 1 or 5, and two ways to sum to 2, 3, or 4.

To state the general case, suppose that m i can only have values

$$\displaystyle \begin{aligned} m_i \in \{v_{i,1}, v_{i,2}, \ldots , v_{i, n_i} \} \end{aligned} $$
(2.47)

and define

$$\displaystyle \begin{aligned} g_i(x) = x^{v_{i,1}} + x^{v_{i,2}} + \cdots + x^{v_{i,n_i}} \end{aligned}$$

from which the following product is formed:

$$\displaystyle \begin{aligned} g(x) = \prod_{i=1}^n g_i(x) \end{aligned}$$

Then ,the coefficient of x in the polynomial g(x) equals the number of ways that m 1 + m 2 + ⋯ + m n =  where each m i satisfies its particular set of constraints (2.47). In essence, this technique solves an entire class of difficult problems—a strikingly deep result considering that it arises from a simple combinatoric derivation.

As a final example, consider the number of different values that can be obtained by summing combinations of the first n primes, p i, i = 1, …, n. Each prime can either be selected in the sum or not and thus \(g_i(x)=1 + x^{p_i}, \ i=1, \ldots , n,\) and

$$\displaystyle \begin{aligned} g(x) = \prod_{i=1}^n 1 + x^{p_i} \end{aligned}$$

For the special case of n = 4 this expansion yields

$$\displaystyle \begin{aligned} g(x) = 1 + x^2 + x^3 + 2 x^5 + 2 x^7 + x^8 + x^9 + 2 x^{10} + 2 x^{12} + x^{14} + x^{15} + x^{17} \end{aligned}$$

There are multiple results contained in this expression: there are two ways to sum the first four primes (2,3,5,7) leading to integers in set {5, 7, 10, 12} and one way for integers in set {0, 2, 3, 8, 9, 14, 15, 17}. The set of integers {4, 6, 11, 13, 16, 18, 19, …} has been left out of the set of possibilities and there are 12 different sums that are possible since this equals the number of terms contained in g(x). The number of prime numbers contained in the set of possibilities equals 5 (the set {2, 3, 5, 7, 17}) and the largest consecutive sequence of numbers has four members (the series 7, 8, 9, 10). Clearly, these observations open new questions concerning how they scale as n increases. Is the longest consecutive series, for instance, bounded if n increases indefinitely?

2.4 Transforms and Identities

Let x = (x 0, …, x n) be a vector of length n + 1 and define functions f (x) and g (x) as follows:

$$\displaystyle \begin{aligned} \begin{array}{rcl} {}\qquad \qquad f_\ell(\boldsymbol{x}) &= &\sum_{k=0}^\ell \left ( \begin{array}{c} \ell \\ k \end{array} \right ) x_k \end{array} \end{aligned} $$
(2.48)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} g_\ell(\boldsymbol{x}) &= & \sum_{k=0}^\ell(-1)^{\ell-k} \left ( \begin{array}{c} \ell \\ k \end{array} \right ) x_k, \ \ \ \ \ \ell = 0, \dots, n \end{array} \end{aligned} $$
(2.49)

Define a = (a 0, …, a n) and b = (b 0, …, b n) and set b  = f(a),  = 0, …, n. Expanding (2.49) yields

$$\displaystyle \begin{aligned} \begin{array}{rcl} g_\ell(\boldsymbol{b}) &\displaystyle =&\displaystyle \sum_{k=0}^\ell (-1)^{\ell-k} \left ( \begin{array}{c} \ell \\ k \end{array} \right ) b_k \\ &\displaystyle =&\displaystyle \sum_{k=0}^\ell (-1)^{\ell-k} \left ( \begin{array}{c} \ell \\ k \end{array} \right ) \sum_{i=0}^k \left ( \begin{array}{c} k \\ i \end{array} \right ) a_i \\ &\displaystyle =&\displaystyle \sum_{i=0}^\ell a_i \left ( \begin{array}{c} \ell \\ i \end{array} \right ) \sum_{k=i}^\ell \left ( \begin{array}{c} \ell-i \\ k-i \end{array} \right ) (-1)^{\ell-k}, \ \ \ \text{ From identity } \mbox{(2.17)}\\ &\displaystyle =&\displaystyle \sum_{i=0}^\ell a_i \left ( \begin{array}{c} \ell \\ i \end{array} \right ) \sum_{j=0}^{\ell-i} \left ( \begin{array}{c} \ell-i \\ j \end{array} \right ) (-1)^{\ell - j - i} \\ &\displaystyle =&\displaystyle \sum_{i=0}^\ell a_i (-1)^{\ell - i} \left ( \begin{array}{c} \ell \\ i \end{array} \right ) \sum_{j=0}^{\ell-i} \left ( \begin{array}{c} \ell-i \\ j \end{array} \right ) (-1)^{j} \\ &\displaystyle =&\displaystyle a_ \ell \;\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{From identity } \mbox{(2.28)} \end{array} \end{aligned} $$

The paired equations b  = f (a) and a  = g (b) show that functions f and g are inverse functions of each other. This observation creates a useful transformation termed binomial transformation. To distinguish these functions, f is typically termed a binomial transform and g a binomial inverse.

Binomial transformation can be used to create a paired set of identities whenever a sequence satisfies either (2.48) or (2.49). For example, equation (2.31) corresponds to the binomial transform of a  =  2. This implies that b  = ( + 1)2−2,  = 0, …, n thus creating the paired identity

$$\displaystyle \begin{aligned} n^2 = \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) (-1)^{n-k} k(k+1)2^{k-2} \end{aligned} $$
(2.50)

The inverse transforms of (2.21), (2.25), (2.29), and (2.30) are given by

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) (-1)^{n-k} 2^k &=& 1 \end{array} \end{aligned} $$
(2.51)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) (-1)^{n-k} \left ( 1 - \frac{1}{\ell} \right )^k&=& \left ( \frac{-1}{\ell}\right )^n \end{array} \end{aligned} $$
(2.52)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) (-1)^{n-k} k 2^{k-1} &=& n \end{array} \end{aligned} $$
(2.53)
$$\displaystyle \begin{aligned} \begin{array}{rcl} \qquad \qquad {} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) (-1)^{n-k} k (k-1) 2^{k-2} &=& n(n-1) \end{array} \end{aligned} $$
(2.54)

The inverse transform of (2.18) provides the paired identity

$$\displaystyle \begin{aligned} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) (-1)^{n-k} \ \frac{(1+a)^{k+1} - 1}{k+1} = \frac{a^{n+1}}{n+1} \end{aligned} $$
(2.55)

A special case of (2.18) and (2.55) for a = 2 creates the following two identities:

$$\displaystyle \begin{aligned} \begin{array}{rcl} {}\qquad \qquad \qquad \quad \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right ) \frac{2^{k+1}}{k+1} &=& \frac{3^{n+1} - 1}{n+1} \end{array} \end{aligned} $$
(2.56)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \sum_{k=0}^n \left ( \begin{array}{c} n \\ k \end{array} \right )(-1)^{n-k} \frac{3^{k+1} - 1}{k+1} &=& \frac{2^{n+1}}{n+1} \end{array} \end{aligned} $$
(2.57)

Another form of a binomial transform is given by

$$\displaystyle \begin{aligned} h_\ell(\boldsymbol{x}) = \sum_{k=0}^\ell (-1)^k \left ( \begin{array}{c} \ell \\ k \end{array} \right ) x_k \end{aligned} $$
(2.58)

which defines an involution, a function that is its own inverse. To establish this, let b  = h (a) and proceed as follows:

$$\displaystyle \begin{aligned} \begin{array}{rcl} \qquad \quad h_\ell(\boldsymbol{b}) &\displaystyle =&\displaystyle \sum_{k=0}^\ell (-1)^k \left ( \begin{array}{c} \ell \\ k \end{array} \right ) b_k\\ &\displaystyle =&\displaystyle \sum_{k=0}^\ell (-1)^k \left ( \begin{array}{c} \ell \\ k \end{array} \right ) \sum_{i=0}^k (-1)^i \left ( \begin{array}{c} k \\ i \end{array} \right ) a_i \\ &\displaystyle =&\displaystyle \sum_{i=0}^\ell a_i \left ( \begin{array}{c} \ell \\ i \end{array} \right ) \sum_{k=i}^\ell \left ( \begin{array}{c} \ell - i \\ k-i \end{array} \right ) (-1)^{k+i}\\ &\displaystyle =&\displaystyle \sum_{i=0}^\ell a_i \left ( \begin{array}{c} \ell \\ i \end{array} \right ) \sum_{j=0}^{\ell-i} \left ( \begin{array}{c} \ell - i \\ j \end{array} \right ) (-1)^{j}\\ &\displaystyle =&\displaystyle a_\ell \end{array} \end{aligned} $$

To illustrate a use of this form of binomial transformation, substitute a = −b in (2.18) which then implies the two identities

$$\displaystyle \begin{aligned} \sum_{k=0}^n (-1)^k \left ( \begin{array}{c} n \\ k \end{array} \right ) \frac{b^k}{k+1} = - \frac{(1-b)^{n+1} - 1}{b(n+1)} \end{aligned} $$
(2.59)

and

$$\displaystyle \begin{aligned} \sum_{k=0}^n (-1)^k \left ( \begin{array}{c} n \\ k \end{array} \right ) \frac{(1-b)^{k+1} - 1}{b(k+1)} = - \frac{ b^{n}}{(n+1)} \end{aligned} $$
(2.60)

As a special case of these identities, set b = 2 which then creates identities

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \sum_{k=0}^n (-1)^k \left ( \begin{array}{c} n \\ k \end{array} \right ) \frac{2^k}{k+1} &=& \left \{ \begin{array}{cl} 0 , & n \ \text{odd,} \\ \frac{1}{n+1}, & n \ \text{even} \end{array} \right. \end{array} \end{aligned} $$
(2.61)
$$\displaystyle \begin{aligned} \begin{array}{rcl} {}\qquad \qquad \sum_{k=0, \ k \ \text{even} }^n (-1)^k \left ( \begin{array}{c} n \\ k \end{array} \right ) \frac{1}{ (k+1)} &=& \frac{2^n}{n+1} \end{array} \end{aligned} $$
(2.62)