An Introduction to Orthogonal Polynomials

Foupouagnigni, Mama

doi:10.1007/978-3-030-36744-2_1

Mama Foupouagnigni^3,4

Part of the book series: Tutorials, Schools, and Workshops in the Mathematical Sciences ((TSWMS))

Included in the following conference series:

AIMS-Volkswagen Stiftung Workshops

1444 Accesses

Abstract

In this introductory talk, we first revisit with proof for illustration purposes some basic properties of a specific system of orthogonal polynomials, namely the Chebyshev polynomials of the first kind. Then we define the notion of orthogonal polynomials and provide with proof some basic properties such as: The uniqueness of a family of orthogonal polynomials with respect to a weight (up to a multiplicative factor), the matrix representation, the three-term recurrence relation, the Christoffel-Darboux formula and some of its consequences such as the separation of zeros and the Gauss quadrature rules.

The research of the author was partially supported by the AIMS-Cameroon Research Allowance 2018–2019.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Orthogonal Polynomials

Orthogonal Polynomials and Applications

Classical Orthogonal Polynomials Revisited

Article Open access 31 May 2023

Keywords

Mathematics Subject Classification (2000)

Primary 33C45; Secondary 42C05

1 Introduction: An Example of a Family of Orthogonal Polynomials

Univariate orthogonal polynomials (or orthogonal polynomials for short) are systems of polynomials (p _n)_n with deg(p _n) = n, satisfying a certain orthogonality relation. They are very useful in practice in various domains of mathematics, physics, engineering and so on, because of the many properties and relations they satisfy. As examples of areas where orthogonal polynomials play important roles, I could list approximation theory (see [5, 23]) and also numerical analysis (see for example [9, 10]). Among those relations, we can mention the following, with the first seven valid for all families of orthogonal polynomials. The last three are in general valid for some specific families of orthogonal polynomials, the so-called classical orthogonal polynomials (see [1,2,3, 6, 7, 12, 14]) and the preliminary training given by S. Mboutngam, M. Kenfack Nangho and P. Njionou Sadjang of these proceedings):

–
Orthogonality relation
–
Matrix representation
–
Three-term recurrence relation
–
Christoffel-Darboux formula
–
Separation of zeros
–
Gauss quadrature
–
Generating functions
–
Second-order holonomic differential, difference or q-difference equation
–
Rodrigues formula
–
Expansion of functions which are continuous differentiable and square integrable, in terms of Fourier series of OP.

Before going into details and for illustration purposes, let us give a concrete example of a family of orthogonal polynomials, then state and prove some of its properties most of which are common to any family of orthogonal polynomials.

Theorem 1.1 (Chebyshev Polynomials of the First Kind [17, 21])

The polynomial family (T _n)_ndefined by (and called Chebyshev polynomials of the first kind or Chebyshev polynomials for short as we will study only this family in this article)

$$\displaystyle \begin{aligned} T_n(x)=\cos{}(n\theta),\;x=\cos\theta,\;0<\theta<\pi,\,n\in\mathbb{N}, \end{aligned} $$

(1.1)

fulfills the following properties:

1.
T _nis a polynomial of degree n in x with leading coefficient a _n = 2 ⁿ⁻¹, satisfying the following recurrence relation (called three-term recurrence relation)
$$\displaystyle \begin{aligned} T_{n+1}(x)+T_{n-1}(x)=2xT_n(x),\;n\geq 1,\;T_0(x)=1,\;T_1(x)=x; \end{aligned} $$
(1.2)
2.
(T _n)_nsatisfies the following relation (called orthogonality relation)
$$\displaystyle \begin{aligned} \int_0^\pi \cos{}(n\theta)\cos{}(m\theta)d\theta=k_n\delta_{n,m}=\int_{-1}^1T_n(x)\,T_m(x){dx\over \sqrt{1-x^2}}, \end{aligned} $$
(1.3)

with $k_0=\pi ,\;k_n={\pi \over 2},\;n\geq 1$.
3.
T _n satisfies the second-order holonomic differential equation:
$$\displaystyle \begin{aligned} (1-x^2)\,T_n^{\prime\prime}(x)-x\,T_n^{\prime}(x)+n^2\,T_n(x)=0,\; n\geq 0. \end{aligned} $$
(1.4)
4.
For any n ≥ 1, T _nhas exactly n zeros, all belonging to the interval of orthogonality (−1, 1). Those zeros, ranked in increasing order, are given by:
$$\displaystyle \begin{aligned} x_{n,k}=\cos\left( {2(n-k)+1\over 2n}\pi\right),\;1\leq k\leq n,\, n\geq 1. \end{aligned} $$
(1.5)
5.
The zeros x _n,k of T _n satisfy
$$\displaystyle \begin{aligned} \begin{array}{rcl}{} x_{n,j}\neq x_{n+1,k},\;\forall n\geq 1, \;1\leq j\leq n,\; 1\leq k\leq n+1; \end{array} \end{aligned} $$
(1.6)

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} x_{n+1,k}<x_{n,k}<x_{n+1,k+1},\;1\leq k\leq n. \end{array} \end{aligned} $$
(1.7)
6.
The monic Chebyshev polynomial of degree n ≥ 1 is the polynomial deviating least from zero on [−1, 1] among all monic polynomials of degree n:
$$\displaystyle \begin{aligned} \min\left\{ \max_{-1\leq x\leq 1} |q_n(x)|,\;q_n\in\mathbb{R}[x], q_n(x)=x^n+\dots\right\}=\max_{-1\leq x\leq 1} \left|{T_n(x)\over 2^{n-1}}\right|={1\over 2^{n-1}}, \end{aligned} $$
(1.8)

where $\mathbb {R}[x]$ is the ring of polynomials with real coefficients.
7.
The following property, which is called the Gauss quadrature formula for the specific case of the Chebyshev polynomials, is valid
$$\displaystyle \begin{aligned} \int_{-1}^1 {f(x)\over \sqrt{1-x^2}}dx={\pi\over n}\sum_{k=1}^n f(x_{n,k}),\,\forall f\in \mathbb{R}_{2n-1}[x],\,n\geq 1, \end{aligned} $$
(1.9)

$\mathbb {R}_{2n-1}[x]$is the ring of polynomials of degree at most 2n − 1, with real coefficients. In addition, the integral of any function continuous on the compact interval [−1, 1] can be approximated by the previous formula:
$$\displaystyle \begin{aligned} \int_{-1}^1 {f(x)\over \sqrt{1-x^2}}dx=\lim_{n\rightarrow \infty} {\pi\over n}\sum_{k=1}^n f(x_{n,k}),\,\forall f\in {\mathcal C}[-1,\,1], \end{aligned} $$
(1.10)

where ${\mathcal C}[-1,\,1]$is the set of continuous functions on the interval [−1, 1].

Proof

Let us provide a quick proof of the first six above properties.

Proof of Property 1 Equation (1.2) is obtained by direct computation:

$$\displaystyle \begin{aligned} T_0(x)=\cos{}(0)=1,\;T_1(x)=\cos\theta=x, \end{aligned}$$

and

$$\displaystyle \begin{aligned} T_{n+1}(x)+T_{n-1}(x)=\cos{}(n+1)\theta+\cos{}(n-1)\theta=2\,\cos\theta\,\cos{}(n\theta)=2\,x\,T_n(x), \end{aligned}$$

using the cosine addition formula $\cos {}(a+b)=\cos a\,\cos b-\sin a\,\sin b$.

Next, we now prove by induction that T _n is a polynomial of degree n in the variable x with 2ⁿ⁻¹ as leading coefficient, that is

$$\displaystyle \begin{aligned} T_n(x)=2^{n-1}\,x^n +\text{lower degree terms},\,n\geq 1. \end{aligned} $$

(1.11)

For n = 1, Eq. (1.11) is satisfied as T ₁(x) = x = 2¹⁻¹x and its degree is 1. By assuming that Eq. (1.11) is satisfied for a fixed integer n ≥ 1, we can then write T _n as T _n(x) = 2ⁿ⁻¹x ⁿ + A _n−1(x) where A _n−1 is a polynomial of degree at most n − 1 in the variable x. We complete the proof by using relation (1.2) to obtain that

$$\displaystyle \begin{aligned} T_{n+1}(x)=2x\,T_n(x)-T_{n-1}(x)=2x\,(2^{n-1}\,x^n+A_{n-1}(x))-T_{n-1}(x)=2^n\,x^{n+1}+\tilde{A}_{n}(x), \end{aligned}$$

where $\tilde {A}_{n}$ is a polynomial of degree at most n in x. Therefore, T _n is a polynomial of degree n in the variable x with 2ⁿ⁻¹ as leading coefficient.

From the three-term recurrence relation (1.2), one can generate any T _n; and in particular, the first 10 Chebyshev polynomials are given by:

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} T_0(x)&\displaystyle =&\displaystyle 1,\\ T_1(x)&\displaystyle =&\displaystyle x;\\ T_{{2}}(x) &\displaystyle =&\displaystyle 2\,{x}^{2}-1,\\ T_{{3}} (x) &\displaystyle =&\displaystyle 4\,{x}^{3}-3\,x,\\ T_{{4}} (x) &\displaystyle =&\displaystyle 8\,{x}^{4}-8\,{x}^{2}+1,\\ T_{{5}} (x) &\displaystyle =&\displaystyle 16\,{x}^{5}-20\,{x}^{3}+5\,x,\\ T_{{6}} (x) &\displaystyle =&\displaystyle 32\,{x}^{6}-48\,{x}^{4}+18\,{x}^{2}-1,\\ T_{{7}} (x) &\displaystyle =&\displaystyle 64\,{x}^{7}-112\,{x}^{5}+56\,{x}^{3}-7\,x,\\ T_{{8}} (x) &\displaystyle =&\displaystyle 128\,{x}^{8}-256\,{x}^{6}+160\,{x}^{4}-32\,{x}^{2}+1,\\ T_{{9}} (x) &\displaystyle =&\displaystyle 256\,{x}^{9}-576\,{x}^{7}+432\,{x}^{5}-120\,{x}^{3}+9\,x. \end{array} \end{aligned} $$

(1.12)

Proof of Property 2 Relation (1.3) is proved by direct computation using again the addition formula

$$\displaystyle \begin{aligned} 2\cos{}(n\theta)\,\cos{}(m\theta)=\cos{}(n+m)\theta+\cos{}(n-m)\theta, \end{aligned}$$

and the fact that $x=\cos \theta ,\,0<\theta <\pi \Longrightarrow dx=-\sin \theta \,d\theta =-\sqrt {1-\cos ^2\theta }d\theta $.

Proof of Property 3 Relation (1.4) is also proved by direct computation. In fact

$$\displaystyle \begin{aligned} \begin{array}{rcl} T_n^{\prime}(x)&\displaystyle =&\displaystyle {d\over dx}T_n(x)={d\theta\over dx}{d\over d\theta} T_n(x)={-1\over \sin\theta}{d\over d\theta} \cos{}(n\theta)={n\,\sin{}(n\theta)\over \sin\theta},\,n\geq 1,\\ T_n^{\prime\prime}(x)&\displaystyle =&\displaystyle {d\over dx}{d\over dx}T_n(x)\\ &\displaystyle =&\displaystyle {d\theta\over dx}{d\over d\theta}\left( {d\theta\over dx}{d\over d\theta} T_n(x)\right)\\ &\displaystyle =&\displaystyle {-1\over \sin\theta}{d\over d\theta}\left( {-1\over \sin\theta}{d\over d\theta} \cos{}(n\theta)\right)\\ &\displaystyle =&\displaystyle {n\cos\theta\,\sin{}(n\theta)\over \sin\theta\,\sin^2\theta}+{-n^2\cos{}(n\theta)\over\sin^2\theta}\\ &\displaystyle =&\displaystyle {x\,T_n^{\prime}(x)\over 1-x^2}+{-n^2\,T_n(x)\over 1-x^2},\,n\geq 1. \end{array} \end{aligned} $$

Proof of Property 4 To obtain the zeros of T _n, we solve the following equation for a fixed n ≥ 1, x ∈ (−1, 1) and θ ∈ (−ππ).

$$\displaystyle \begin{aligned} T_n(x)=0\Longleftrightarrow \cos{}(n\theta)=0 \Longleftrightarrow n\theta= {\pi\over 2}+k\pi,\;k\in \mathbb{Z}. \end{aligned}$$

Since 0 < θ < π, then 0 ≤ k ≤ n − 1. Therefore, T _n has exactly n zeros which are $\cos \left ({(2k+1)\pi \over 2n}\right ),\,0\leq k\leq n-1$. But since those zeros are ranked by decreasing order for the function $\theta \rightarrow \cos \theta $ is decreasing on (−π, π) and the sequence $k\rightarrow {(2k+1)\pi \over 2n}$ is increasing, there is a need to reverse the order. This is done by replacing k by n − k. Therefore we obtain the following zeros ranked by increasing order

$$\displaystyle \begin{aligned} x_{n,k}=\cos\theta_{n,k},\;\text{with} \;\theta_{n,k}={2(n-k)+1\over 2n}\,\pi,\,1\leq k\leq n. \end{aligned}$$

The zeros also belong to the interval of orthogonality (−1, 1).

Proof of Property 5 Equation (1.6) is satisfied since the cosine function is a bijection from (0, π) into (−1, 1) and

$$\displaystyle \begin{aligned} \theta_{n,j}\neq \theta_{n+1,k},\,\forall n\geq 1, \;1\leq j\leq n,\; 1\leq k\leq n+1. \end{aligned}$$

The inequalities (1.7) are deduced using the fact that the cosine function is strictly decreasing in (−π, π) combined with the following inequalities which can be obtained by a direct and quick computation

$$\displaystyle \begin{aligned} \theta_{n+1,k+1}<\theta_{n,k}<\theta_{n+1,k},\,1\leq k\leq n. \end{aligned}$$

The interlacing properties of the zeros of the Chebyshev polynomials can be observed on the above graph of the first ten Chebyshev polynomials (Fig. 1).

Proof of Property 6 Let us first denote the monic Chebyshev polynomial of degree n by t _n: $t_n(x)={T_n(x)\over 2^{n-1}},\,n\geq 1,\,t_0(x)=T_0(x)=1$. Next, we define the set of monic polynomials of degree n, ${\mathcal P}_n$, the sup-norm ||.||_max and the subset I of the set of real numbers ${\mathbb R}$, respectively, by

$$\displaystyle \begin{aligned} \begin{array}{rcl} {\mathcal P}_n&\displaystyle =&\displaystyle \left\{q_n\in\mathbb{R}_n[x], q_n(x)=x^n+\text{lower degree terms}\right\}\\ ||p||{}_{\mathrm{max}}&\displaystyle =&\displaystyle \max_{-1\leq x\leq 1} |p(x)|,\\ I&\displaystyle =&\displaystyle \left\{ ||p||{}_{\mathrm{max}},\,p\in {\mathcal P}_n\right\}. \end{array} \end{aligned} $$

To prove that $\min I=||t_n||{ }_{\mathrm{max}}={1\over 2^{n-1}}$, for a fixed but arbitrary integer n ≥ 1, we proceed as follows:

–
In the first step, we derive the extrema for the function t _n:
$$\displaystyle \begin{aligned} t^{\prime}_n(x)=0\Longleftrightarrow {n\sin{}(n\theta)\over \sin{}(\theta)}=0\Longleftrightarrow \sin{}(n\theta)=0,\,\sin\theta \neq 0. \end{aligned}$$

Since 0 ≤ θ ≤ π, we get $\theta ={k\pi \over n},\,1\leq k\leq n-1$. We have excluded k = 0 and k = n to make sure that $\sin \theta \neq 0$. The extrema for t _n are therefore
$$\displaystyle \begin{aligned} z_{n,k}=\cos\left({k\pi\over n}\right),\,1\leq k\leq n-1. \end{aligned}$$
–
In the second step, we study the sign of t _n(x) on the extrema. Before this, we remark that for θ = 0, x = 1 := z _n,0 and for θ = π, x = −1 := z _n,n, enabling us to get the following information on the action of t _n on z _n,k:
$$\displaystyle \begin{aligned} \begin{array}{rcl}{} t_n(-1)&\displaystyle =&\displaystyle {\cos{}(n\,\pi)\over 2^{n-1}}={(-1)^n\over 2^{n-1}}=t_n(z_{n,n}), \end{array} \end{aligned} $$
(1.13)

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \,t_n(1)&\displaystyle =&\displaystyle {\cos{}(n\,0)\over 2^{n-1}}= {1\over 2^{n-1}}=t_n(z_{n,0}), \end{array} \end{aligned} $$
(1.14)

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \,t_n(z_{n,k})&\displaystyle =&\displaystyle {(-1)^k\over 2^{n-1}},\,1\leq k\leq n-1. \end{array} \end{aligned} $$
(1.15)

Equations (1.13) and (1.14) confirm that Eq. (1.15) which was initially valid for 1 ≤ k ≤ n − 1 is also valid for k = 0, n and can then be written as
$$\displaystyle \begin{aligned} \,t_n(z_{n,k})={(-1)^k\over 2^{n-1}},\,0\leq k\leq n. \end{aligned}$$

The previous equation, combined with the fact that $t_n(x)={\cos {}(n\theta )\over 2^{n-1}}$, allows us to deduce that
$$\displaystyle \begin{aligned} ||t_n||{}_{\mathrm{max}}={1\over 2^{n-1}}. \end{aligned}$$
1.
In the third step, we remark that the set I is not empty since it contains ||t _n||_max. In addition, it has zero as a lower bound. Let us assume that ||t _n||_max is not the minimum element of I. Then there exists a polynomial q belonging to ${\mathcal P}_n$ such that
$$\displaystyle \begin{aligned} -{1\over 2^{n-1}}<q(x)<{1\over 2^{n-1}},\,-1\leq x\leq 1. \end{aligned}$$

We next set P _n−1(x) = t _n(x) − q(x) and observe, taking into account the previous inequalities, that P _n−1 which is a polynomial of degree at most n − 1 fulfills the following properties:
$$\displaystyle \begin{aligned} \begin{array}{rcl} P_{n-1}(z_{n,2j})&\displaystyle =&\displaystyle t_n(z_{n,2j})-q(z_{n,2j})={1\over 2^{n-1}}-q(z_{n,2j})>0,\\ P_{n-1}(z_{n,2j+1})&\displaystyle =&\displaystyle t_n(z_{n,2j+1})-q(z_{n,2j+1})={-1\over 2^{n-1}}-q(z_{n,2j+1})<0, \end{array} \end{aligned} $$

for any integer j such that 0 ≤ 2j + 1 ≤ n. We obtain a contradiction to the fact that the polynomial P _n−1 which is of degree at most n − 1 will have n zeros for it will change its sign n times in the intervals (z _n,k, z _n,k+1), k = 0…n − 1. We therefore conclude that $||t_n||{ }_{\mathrm{max}}={1\over 2^{n-1}}$ is the minimum of I.

Proof Illustration of Property 7 The Gauss formula (1.9) is given in the general case in the paper by A. S. Jooste in these proceedings (see also [3, 6, 12, 22], but one would need to proceed with additional careful computations to verify that the Christoffel number λ _n,k in the general Gauss quadrature formula is given by $\lambda _{n,k}={\pi \over n}$ for the specific case of the Chebyshev polynomials T _n (see also [15], Theorem 8.4, where the Christoffel numbers have been given explicitly for Chebyshev polynomials of the first, second, third and fourth kinds). We refer to [6], page 33 and also to [3], page 252 for the proof of relation (1.10) and other approximation formulas. □

2 Construction of a System of Orthogonal Polynomials

In this section, after having provided a concrete example of a family of orthogonal polynomials with proof of some of its nice properties—some of which are common for any family of orthogonal polynomials—, we will now show how to construct a family of orthogonal polynomials from a scalar product and then relate this with the definition of orthogonal polynomials.

Let us consider a scalar product (, ) defined on $\mathbb {R}[x]\times \mathbb {R}[x]$ in terms of a Stieltjes integral as

$$\displaystyle \begin{aligned} (p,q)=\int_a^b p(x)\,q(x)\,d\alpha(x), \end{aligned} $$

(2.1)

where $\mathbb {R}[x]$ is the ring of polynomials with a real variable and dα is a non-negative Borel measure supported in the interval (a, b). As scalar product, it fulfills the following properties:

$$\displaystyle \begin{aligned} \begin{array}{rcl} (p,p)&\displaystyle \geq &\displaystyle 0,\,\forall p\in\mathbb{R}[x],\;\text{and}\;(p,p)= 0\Longrightarrow p=0,\\ (p,q)&\displaystyle =&\displaystyle (q,p),\,\forall p,\,q\in\mathbb{R}[x],\\ (\lambda\,p,q)&\displaystyle =&\displaystyle \lambda\,(p,q),\,\forall \lambda\in\mathbb{R},\,\forall p,\,q\in\mathbb{R}[x],\\ (p+q,r)&\displaystyle =&\displaystyle (p,r)+(q,r),\,\forall p,\,q,\,r\in\mathbb{R}[x]. \end{array} \end{aligned} $$

As an example of scalar product on $\mathbb {R}[x]$ with connection to known systems of orthogonal polynomials, we mention:

$$\displaystyle \begin{aligned} (p,q)=\int_{-1}^1 p(x)\,q(x){dx\over \sqrt{1-x^2}}, \end{aligned} $$

(2.2)

which yields the Chebyshev orthogonal polynomials.

The following theorem provides a method for construction of a family of polynomials, orthogonal with respect to a given scalar product. It is called Gram-Schmidt orthogonalisation process.

Theorem 2.1 (Gram-Schmidt Orthogonalisation Process [6, 12, 22])

The polynomial systems (q _n)_nand (p _n)_ndefined recurrently by the relations

$$\displaystyle \begin{aligned} q_0=1,\;q_n=x^n-\sum_{k=0}^{n-1} {(x^n,q_k)\over (q_k,q_k)}\,q_k,\;n\geq 1,\;p_k={q_k\over \sqrt{(q_k,q_k)}},\;k\geq 0, \end{aligned} $$

(2.3)

satisfy the relations

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \deg(q_n)&\displaystyle =&\displaystyle \deg(p_n)=n,\forall n\geq 0, \end{array} \end{aligned} $$

(2.4)

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} (q_n,q_m)&\displaystyle =&\displaystyle 0,\,n\neq m,\;(q_n,q_n)\neq 0,\,\forall n\geq 0,\; \end{array} \end{aligned} $$

(2.5)

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} (p_n,p_m)&\displaystyle =&\displaystyle 0,\,n\neq m,\;(p_n,p_n)=1,\,\forall n\geq 0. \end{array} \end{aligned} $$

(2.6)

The polynomials (q _n)_nand (p _n)_nare said to be orthogonal or orthonormal with respect to the scalar product (, ), respectively. In fact, they represent the same polynomial system with different normalisation: (q _n)_nis monic —to say the coefficient of the leading monomial is equal to 1; while (p _n)_nis orthonormal—to say (p _n, p _n) = 1 or the corresponding norm of p _nis equal to 1.

Proof

Equation (2.4) is obvious while Eq. (2.6) is a direct consequence of Eq. (2.5). We will prove Eq. (2.5) by induction. Because of the properties of the scalar product, we just need to prove the following:

$$\displaystyle \begin{aligned} (q_n,q_m)=0,\,\forall n\geq 1,\,0\leq m\leq n-1. \end{aligned} $$

(2.7)

For n = 1, we have, using relations (2.3)

$$\displaystyle \begin{aligned} (q_1,q_0)=\left(x-{(x,q_0)\over (q_0,q_0)}\,q_0,q_0\right)=(x,q_0) -{(x,q_0)\over (q_0,q_0)}\,(q_0,q_0)=0. \end{aligned}$$

We now assume that relation (2.7) is satisfied up to a given n ≥ 1. Let $m\in \mathbb {N}$, 0 ≤ m ≤ n.

$$\displaystyle \begin{aligned} \begin{array}{rcl} (q_{n+1},q_m)&\displaystyle =&\displaystyle \left(x^{n+1}-\sum_{k=0}^{n} {(x^{n+1},q_k)\over (q_k,q_k)}\,q_k,q_m\right)\\ &\displaystyle =&\displaystyle (x^{n+1},q_m)-\sum_{k=0}^{n} {(x^{n+1},q_k)\over (q_k,q_k)}\,\left(q_k,q_m\right)\\ &\displaystyle =&\displaystyle (x^{n+1},q_m)-{(x^{n+1},q_m)\over (q_m,q_m)}\,\left(q_m,q_m\right)=0, \end{array} \end{aligned} $$

since from the induction hypothesis (Eq. (2.7)) and the symmetry of the scalar product, (q _k, q _m) = 0, 0 ≤ k ≠ m ≤ n − 1. □

Definition 2.2 (Orthogonal Polynomials [6])

Any sequence of polynomials (p _n)_n satisfying Eqs. (2.4) and (2.5) (rewritten as follows with q _n replaced by p _n)

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \deg(p_n)&\displaystyle =&\displaystyle n, \end{array} \end{aligned} $$

(2.8)

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \int_a^b p_n(x)\,p_m(x)\,d\alpha(x)&\displaystyle =&\displaystyle 0,\;n\neq m, \end{array} \end{aligned} $$

(2.9)

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \int_a^b p_n(x)\,p_n(x)\,d\alpha(x)&\displaystyle \neq&\displaystyle 0,\;\forall n\geq 0, \end{array} \end{aligned} $$

(2.10)

is said to be orthogonal with respect to the measure dα, and called an orthogonal polynomial system or an orthogonal polynomial for short.

Definition 2.3 (Orthogonal Polynomials w.r.t. to a Weight Function [3, 6, 12, 17, 18, 21])

When the measure dα is absolutely continuous, that is dα(x) = ρ(x) dx where ρ is an appropriate function—called weight function, then relations (2.8)–(2.10) read

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \deg(p_n)&\displaystyle =&\displaystyle n, \end{array} \end{aligned} $$

(2.11)

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \int_a^b p_n(x)\,p_m(x)\,\rho(x)\,dx&\displaystyle =&\displaystyle 0,\;n\neq m, \end{array} \end{aligned} $$

(2.12)

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \int_a^b p_n(x)\,p_n(x)\,\rho(x)\,dx&\displaystyle \neq&\displaystyle 0,\;\forall n\geq 0. \end{array} \end{aligned} $$

(2.13)

The polynomial system (p _n)_n is said to be orthogonal with respect to the weight functionρ. Because of the form of the orthogonality relation, the variable here is continuous. We therefore obtain orthogonal polynomials of a continuous variable.

Definition 2.4 (Orthogonal Polynomials of a Discrete Variable [12, 18])

When the measure dα is discrete and supported in $\mathbb {N}$, that is, α = ρ on $\mathbb {N}$, then the relations (2.8), (2.9) and (2.10) become

$$\displaystyle \begin{aligned} \begin{array}{rcl} \deg{p_n}&\displaystyle =&\displaystyle n,\,n\geq 0,\\ \sum_{k=0}^N \rho(k)\,p_n(k)\,p_m(k)&\displaystyle =&\displaystyle 0,\,\forall n,\,m\in \mathbb{N},\,n\neq m,\\ \sum_{k=0}^N \rho(k)\,p_n(k)\,p_n(k)&\displaystyle \neq&\displaystyle 0,\,n\geq 0, \end{array} \end{aligned} $$

where the parameter N belongs to $\mathbb {N}\cup \{\infty \}$. (p _n)_n is said to be orthogonal with respect to the discrete weight ρ. It is also called a sequence of orthogonal polynomials of a discrete variable.

Notice that if N is finite, then there exist only a finite number of orthogonal polynomials, this because the bilinear application defined in (2.1) is positive definite not on the entire $\mathbb {R}[x]$ but rather on its linear subspace, $\mathbb {R}_l[x]$, for an appropriate choice of the positive integer l.

Definition 2.5 (Orthogonal Polynomials of a q-Discrete Variable [8, 11, 13, 18])

When the measure dα is q-discrete and supported in $q^{\mathbb {Z}}$, that is, α = ρ on $q^{\mathbb {Z}}$, where $\mathbb {Z}$ is the set of integers, then the relations (2.8), (2.9) and (2.10) become

$$\displaystyle \begin{aligned} \begin{array}{rcl} \deg{p_n}&\displaystyle =&\displaystyle n,\,n\geq 0,\\ \sum_{k=0}^N \rho(q^k)\,p_n(q^k)\,p_m(q^k)&\displaystyle =&\displaystyle 0,\,\forall n,\,m\in \mathbb{N},\,n\neq m,\\ \sum_{k=0}^N \rho(q^k)\,p_n(q^k)\,p_n(q^k)&\displaystyle \neq&\displaystyle 0,\,n\geq 0, \end{array} \end{aligned} $$

where the parameter N belongs to $\mathbb {N}\cup \{\infty \}$. (p _n)_n is said to be orthogonal with respect to the q-discrete weight ρ. It is also called a sequence of orthogonal polynomials of a q-discrete variable.

Remark 2.6

When the measure dα is discrete or q-discrete supported on a quadratic or a q-quadratic lattice, this gives the orthogonal polynomials of a quadratic or a q-quadratic variable. As examples of such polynomials, we mention the Wilson and the Askey-Wilson polynomials [4, 8, 12, 13].

3 Basic Properties of Orthogonal Polynomials

3.1 The Uniqueness of a Family of Orthogonal Polynomials

Before stating the result about the uniqueness of a family of orthogonal polynomials, let us start with the following remarks:

1.
If (p _n)_n is a family of orthogonal polynomials, then due to the fact that the degree of each p _n is equal to n, any subset of $\{p_n,n\in \mathbb {N}\}$ is a linearly independent subset of the linear space $\mathbb {R}[x]$.
2.
Moreover, for any n ≥ 1, the set {p _k, 0 ≤ k ≤ n}, like the canonical basis of monomials {x ^k, 0 ≤ k ≤ n}, constitutes a basis of the linear space $\mathbb {R}_n[x]$ of polynomials of degree at most n.

The following result states an equivalent orthogonality relation.

Lemma 3.1

Let (p _n)_nbe a sequence of polynomials with deg(p _n) = n, n ≥ 0. Then Eqs.(2.9) and (2.10) are equivalent to the two following equations

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \int_a^b p_n(x)\,x^m\,d\alpha(x)&\displaystyle =&\displaystyle 0,\;\forall n\geq 1, \,0\leq m\leq n-1, \end{array} \end{aligned} $$

(3.1)

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \int_a^b p_n(x)\,x^n\,d\alpha(x)&\displaystyle \neq&\displaystyle 0,\;\forall n\geq 0. \end{array} \end{aligned} $$

(3.2)

Proof

The proof is obtained by combining the orthogonality relations (2.9) and (2.10) or (respectively (3.1) and (3.2)) with the expansion of the polynomial p _m in the canonical basis of monomials (respectively the expansion of x ^m in the basis {p _k, 0 ≤ k ≤ n}). □

The uniqueness of a family of polynomials orthogonal with respect to a measure dα can then be stated as follows.

Theorem 3.2 (Uniqueness of a Family of Orthogonal Polynomials)

To the measure dα corresponds a unique (up to a multiplicative factor) family of orthogonal polynomials. Or equivalently, if (p _n)_nand (q _n)_nare two families of polynomials satisfying relations (2.8)–(2.10), then they are proportional, to say that there exists a sequence (b _n)_nsuch that p _n = b _nq _n, n ≥ 0, with b _n ≠ 0, n ≥ 0.

Proof

The proof is obtained by expanding the polynomial q _n in the basis {p _k, 0 ≤ k ≤ n} of $\mathbb {R}_n[x]$ and using the orthogonality relations (2.9) and (2.10) to show that the other coefficients, except the leading one, are equal to zero. □

3.2 The Matrix Representation

The following results give information about the Hankel determinant and a matrix representation of a given family of orthogonal polynomials. Before that, let us define what we mean by linear functional and orthogonality with respect to a linear functional.

Definition 3.3 (Linear Functional)

Linear functional here means any linear mapping from $\mathbb {R}[x]$ to $\mathbb {R}$.

The sequence of polynomials (p _n)_n will be said to be orthogonal with respect to the linear functional $\mathcal {U}$ if deg(p _n) = n and

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \langle \mathcal{U},x^mp_n\rangle &\displaystyle =&\displaystyle 0,\,n\geq 0,\,0\leq m\leq n, \end{array} \end{aligned} $$

(3.3)

$$\displaystyle \begin{aligned} \begin{array}{rcl} {} \langle \mathcal{U},x^np_n\rangle &\displaystyle \neq&\displaystyle 0,\,\forall n\geq 0. \end{array} \end{aligned} $$

(3.4)

In this case, the linear functional $\mathcal {U}$ is said to be quasi-definite, to say that there exists a family of polynomials orthogonal with respect to $\mathcal {U}$.

As example, we define a linear functional $\mathcal {L}$ by

$$\displaystyle \begin{aligned} \langle \mathcal{L},p\rangle=\int_{-1}^1 {p(x)\over \sqrt{1-x^2}}dx, \end{aligned}$$

corresponding to the Chebyshev orthogonal polynomials (T _n)_n. The definition of orthogonality by means of a linear functional is very useful in practice because it enables an elegant proof of equivalent properties of standard orthogonal polynomials [6, 7, 12, 16] in addition to providing the proof of the so-called Favard Theorem [6] stating that any sequence of polynomials satisfying a three-term recurrence relation with some specific restriction on one of its coefficients is orthogonal with respect to a quasi-definite functional.

Theorem 3.4 ([21])

Let (p _n)_nbe a sequence of polynomials with deg(p _n) = n, n ≥ 0 and satisfying the orthogonality conditions (2.9) and (2.10).

1.
Then for any integer n ≥ 0, the following relation holds
$$\displaystyle \begin{aligned} \Delta_n> 0,\,n\geq 0, \end{aligned} $$
(3.5)

where Δ _nis the Hankel determinant defined by
$$\displaystyle \begin{aligned} \Delta_n=\det(\mu_{k+j})_{0\leq k,j\leq n}= \left|\begin{array}{ccccc} \mu_0 & \mu_1& \cdots & \mu_{n-1} & \mu_n\\ \mu_1 & \mu_1& \cdots & \mu_{n-1} & \mu_{n+1}\\ \vdots & \vdots & \vdots & \vdots & \vdots \\ \mu_{n-1} & \mu_n& \cdots & \mu_{2n-2} & \mu_{2n-1}\\ \mu_{n} & \mu_{n+1}& \cdots & \mu_{2n-1} & \mu_{2n} \end{array}\right|, \,\,n\geq 1,\,\Delta_0:=\mu_0. \end{aligned} $$
(3.6)

The number μ _nwhich is given by
$$\displaystyle \begin{aligned} \mu_n=\int_a^b x^n\,d\alpha(x),\;n\geq 0,\end{aligned} $$

denotes the canonical moment with respect to the measure dα.
2.
For any positive integer n, the polynomial p _nhas the following matrix representation
$$\displaystyle \begin{aligned} p_n(x)=\,{a_{n,n}\over \Delta_{n-1}} \left|\begin{array}{ccccc} \mu_0 & \mu_1& \cdots & \mu_{n-1} & \mu_n\\ \mu_1 & \mu_1& \cdots & \mu_{n-1} & \mu_{n+1}\\ \vdots & \vdots & \vdots & \vdots & \vdots \\ \mu_{n-1} & \mu_n& \cdots & \mu_{2n-2} & \mu_{2n-1}\\ 1& x& \cdots & x^{n-1} & x^n \end{array}\right| \,,\end{aligned} $$
(3.7)

where a _n,nis the leading coefficient of p _n.
3.
Conversely, given any sequence of real numbers (μ _n)_nsatisfying relation (3.5), then since Δ _n ≠ 0, n ≥ 0, there exists a sequence of polynomials orthogonal with respect to the quasi-definite linear functional $\mathcal {U}$defined on the canonical basis of monomials by
$$\displaystyle \begin{aligned} \langle\mathcal{U},x^n\rangle=\mu_n,\,n\geq 0.\end{aligned} $$

In addition, from (3.5) the linear functional $\mathcal {U}$is positive-definite and as a consequence (see [ 6]) there exists a positive Borel measure associated with it.

The corresponding family is given explicitly by (3.7).

Proof

1.
For the proof of the first property, let (p _n)_n be a sequence of polynomials with deg(p _n) = n, n ≥ 0 and satisfying the orthogonality conditions (2.9) and (2.10) which are equivalent to orthogonality conditions (3.1) and (3.2). Writing for a fixed integer n ≥ 1
$$\displaystyle \begin{aligned} p_n(x)=\sum_{k=0}^n a_{n,k}\,x^k,\end{aligned} $$

in the orthogonality relation (3.1) for the integers m = 0…n and then for orthogonality relation (3.2), we obtain the following system of linear equations for the unknowns (a _n,k)_k whose matrix form is given by
$$\displaystyle \begin{aligned} \left( \begin{array}{ccccc} \mu_0 & \mu_1& \cdots & \mu_{n-1} & \mu_n\\ \mu_1 & \mu_1& \cdots & \mu_{n-1} & \mu_{n+1}\\ \vdots & \vdots & \vdots & \vdots & \vdots \\ \mu_{n-1} & \mu_n& \cdots & \mu_{2n-2} & \mu_{2n-1}\\ \mu_{n} & \mu_{n+1}& \cdots & \mu_{2n-1} & \mu_{2n} \end{array} \right)\left( \begin{array}{c} a_{n,0}\\a_{n,1}\\ \vdots\\ a_{n,n-1}\\ a_{n,n} \end{array} \right)=\left( \begin{array}{c} 0\\ 0\\ \vdots \\ 0\\ k_n \end{array} \right), \end{aligned} $$
(3.8)

where $k_n=\int _a^b p_n(x)\,x^nd\alpha \neq 0$. Since the polynomial sequence (p _n)_n not only exists and is uniquely determined by fixing k _n, then necessarily, the Hankel determinant is different from zero. The positiveness of the Hankel’s determinant will be deduced in the following paragraph.
2.
To prove the second property, we first use (3.7) to obtain for 0 ≤ m ≤ n that
$$\displaystyle \begin{aligned} \int_a^b p_n(x)\,x^m\,d\alpha(x)=\,{a_{n,n}\over \Delta_{n-1}} \left|\begin{array}{ccccc} \mu_0 & \mu_1& \cdots & \mu_{n-1} & \mu_n\\ \mu_1 & \mu_1& \cdots & \mu_{n-1} & \mu_{n+1}\\ \vdots & \vdots & \vdots & \vdots & \vdots \\ \mu_{n-1} & \mu_n& \cdots & \mu_{2n-2} & \mu_{2n-1}\\ \mu_{m} & \mu_{m+1}& \cdots & \mu_{m+n-1} & \mu_{m+n} \end{array}\right| \,. \end{aligned} $$
(3.9)

The previous relation reads
$$\displaystyle \begin{aligned} \int_a^b p_n(x)\,x^m\,d\alpha(x)=0,\,0\leq m\leq n-1 \end{aligned} $$
(3.10)

since the m + 1-th row and the last row of the determinant will be identical. Also, use of (3.9) for m = n taking combined with the following relation
$$\displaystyle \begin{aligned} \int_a^b p_n(x)\,x^n\,d\alpha(x)={1\over a_{n,n}}\,\int_a^b p_n(x)\,p_n(x)\,d\alpha(x)={d_n^2\over a_{n,n}} \end{aligned}$$

obtained using orthogonality, leads to
$$\displaystyle \begin{aligned} \int_a^b p_n(x)\,x^n\,d\alpha(x)=a_{n,n}\,{\Delta_n\over \Delta_{n-1}}={d_n^2\over a_{n,n}}\neq 0,\, n\geq 1. \end{aligned} $$
(3.11)

We then deduce from Eqs. (3.10) and (3.11) combined with (3.1) and (3.2) that (p _n)_n is orthogonal with respect to dα(x).

The positiveness of Δ_n is seen from the relation
$$\displaystyle \begin{aligned} \Delta_n=\Delta_0\,\prod_{k=1}^n {d_k^2\over a_{k,k}^2}=\mu_0\,\prod_{k=1}^n {d_k^2\over a_{k,k}^2}>0, \end{aligned}$$

deduced from (3.11).
3.
The third property is proved by showing, in a similar way as done in the proof of Property 2 above, that the polynomial sequence given by (3.7) satisfies orthogonality relations (3.3) and (3.4).

□

3.3 The Three-Term Recurrence Relation

Theorem 3.5 (Three-Term Recurrence Relation [6, 21, 24])

Any polynomial sequence (p _n)_n, orthogonal with respect to the measure dα or fulfilling the orthogonality relations (2.8)–(2.10), satisfies the following relation called three-term recurrence relation

$$\displaystyle \begin{aligned} x\,p_n(x)={a_n\over a_{n+1}}p_{n+1}+\left({b_n\over a_n}-{b_{n+1}\over a_{n+1}}\right)p_n+{a_{n-1}\over a_n}\,{d_n^2\over d_{n-1}^2}p_{n-1},\;p_{-1}=0,\,p_0=1, \end{aligned} $$

(3.12)

with

$$\displaystyle \begin{aligned} p_n=a_n\,x^n+b_n\,x^{n-1}+\,\mathit{\text{lower degree terms}}, \;\mathit{\text{ and}}\; d_n^2=(p_n,p_n). \end{aligned} $$

(3.13)

When (p _n) is monic (i.e. a _n = 1) or orthonormal (ie. d _n = 1), then Eq.(3.12) can be written in the following forms, respectively:

$$\displaystyle \begin{aligned} p_{n+1}=(x-\beta_n)\,p_n-\gamma_n\,p_{n-1},\;p_{-1}=0,\,p_0=1, \end{aligned} $$

(3.14)

with $\beta _n=b_n-b_{n+1},\;\gamma _n={d_n^2\over d_{n-1}^2}$ , and

$$\displaystyle \begin{aligned} x\,p_n=\alpha_{n+1}\,p_{n+1}+\eta_n\,p_n+\alpha_n\,p_{n-1},\;p_{-1}=0,\,p_0=1, \end{aligned} $$

(3.15)

with $\alpha _n={a_{n-1}\over a_n},\;\eta _n={b_n\over a_n}-{b_{n+1}\over a_{n+1}}.$

Also, the recurrence coefficients of the monic and orthonormal forms of the orthogonal polynomial system are connected by

$$\displaystyle \begin{aligned} \eta_n=\beta_n,\;\gamma_n=\alpha_n^2. \end{aligned} $$

(3.16)

Proof

For fixed n ≥ 0, we expand xp _n in the basis {p ₀, p ₁, …, p _n+1}

$$\displaystyle \begin{aligned} x\,p_n=\sum_{k=0}^{n+1} c_{k,n}\,p_k, \end{aligned}$$

and then use orthogonality to obtain

$$\displaystyle \begin{aligned} c_{k,n}={\int_a^b xp_{n}(x)\,p_k(x)\,d\alpha(x)\over \int_a^b p_{k}(x)\,p_k(x)\,d\alpha(x)}= {\int_a^b p_{n}(x)\,xp_k(x)\,d\alpha(x)\over \int_a^b p_{k}(x)\,p_k(x)\,d\alpha(x)}=0,\;\text{for } 0\leq k\leq n-2. \end{aligned}$$

Hence

$$\displaystyle \begin{aligned} x\,p_n=c_{n+1,n}\,p_{n+1}+c_{n,n}\,p_n +c_{n-1,n}\,p_{n-1}. \end{aligned} $$

(3.17)

Substituting (3.13) into (3.12) and identifying the leading coefficients of the monomials x ⁿ⁺¹ and x ⁿ yields

$$\displaystyle \begin{aligned} c_{n+1,n}={a_n\over a_{n+1}},\;c_{n,n}=\left({b_n\over a_n}-{b_{n+1}\over a_{n+1}}\right). \end{aligned} $$

(3.18)

Using (3.17) twice combined with the orthogonality properties (2.9) and (2.10) gives

$$\displaystyle \begin{aligned} \begin{array}{rcl} c_{n-1,n}\,d_{n-1}^2&\displaystyle =&\displaystyle \int_a^b p_{n}(x)(x)\,xp_{n-1}(x)\,d\alpha(x)\\ &\displaystyle =&\displaystyle \int_a^b p_{n}(x)(x)\, [ c_{n,n-1}\,p_{n}+c_{n-1,n-1}\,p_{n-1} +c_{n-2,n-1}\,p_{n-2} ]d\alpha(x)\\ &\displaystyle =&\displaystyle c_{n,n-1}\,d_n^2, \end{array} \end{aligned} $$

from which we deduce using (3.18) that

$$\displaystyle \begin{aligned} c_{n-1,n}=c_{n,n-1}\,{d_n^2\over d_{n-1}^2}={a_{n-1}\over a_n}\,{d_n^2\over d_{n-1}^2}. \end{aligned}$$

Equations (3.16) are obtained by identifying the coefficients of Eq. (3.14) with those of the monic form of Eq. (3.15). □

3.4 The Christoffel-Darboux Formula

The following formulas are consequences of the three-term recurrence relation.

Theorem 3.6 (Christoffel-Darboux Formula [6, 21])

Any system of orthogonal polynomials satisfying the three-term recurrence relation (3.12), satisfies also a so-called Christoffel-Darboux formula given, respectively, in its initial and confluent forms as

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \sum_{k=0}^n{p_k(x)p_k(y)\over d_k^2}&\displaystyle =&\displaystyle {a_n\over a_{n+1}}\,{1\over d_n^2}\,{p_{n+1}(x)\,p_n(y)-p_{n+1}(y)\,p_n(x)\over x-y},\;x\neq y,\quad \end{array} \end{aligned} $$

(3.19)

$$\displaystyle \begin{aligned} \begin{array}{rcl}{} \sum_{k=0}^n{p_k(x)p_k(x)\over d_k^2}&\displaystyle =&\displaystyle {a_n\over a_{n+1}}\,{1\over d_n^2}\,\left(p_{n+1}^{\prime}(x)\,p_n(x)-p_{n+1}(x)\,p_n^{\prime}(x)\right). \end{array} \end{aligned} $$

(3.20)

Proof

For the proof of Eq. (3.19), we multiply by p _k(y), Eq. (3.17) in which n is replaced by k, to obtain

$$\displaystyle \begin{aligned} x\,p_k(x)\,p_k(y)=c_{k+1,k}\,p_{k+1}(x)\,p_k(y)+c_{k,k}\,p_k(x)\,p_k(y)+c_{k-1,k}\,p_{k-1}(x)\,p_k(y). \end{aligned}$$

Interchanging the role of x and y in the previous equation, we obtain

$$\displaystyle \begin{aligned} y\,p_k(x)\,p_k(y)=c_{k+1,k}\,p_{k+1}(y)\,p_k(x)+c_{k,k}\,p_k(x)\,p_k(y)+c_{k-1,k}\,p_{k-1}(y)\,p_k(x). \end{aligned}$$

Subtracting the last two equations from each other, we obtain that

$$\displaystyle \begin{aligned} {p_k(x)\,p_k(y)\over d_k^2}={A_{k}(x,y)-A_{k-1}(x,y)\over x-y}, \end{aligned}$$

where

$$\displaystyle \begin{aligned} A_{k}(x,y)={c_{k+1,k}\over d_k^2}\,(p_{k+1}(x)\,p_k(y)-p_{k+1}(y)\,p_k(x)), \end{aligned}$$

after taking into account the relation ${c_{k+1,k}\over d_k^2}={a_k\over a_{k+1}}{1\over d_k^2}={c_{k,k+1}\over d_{k+1}^2}.$ Equation (3.19) is obtained by summing the previous equation for k from 0 to n, taking into account that A ₋₁(x, y) = 0 as p ₋₁(x) = 0. Equation (3.20) is obtained by taking the limit of (3.19) when y tends to x. □

3.5 The Interlacing Properties of the Zeros

The following properties of the zeros of orthogonal polynomials are direct consequences of the confluent form of the Christoffel-Darboux formula (3.20). Their proof is given in the lecture notes of A. Jooste in these proceedings.

Theorem 3.7 (On the Zeros of Orthogonal Polynomials [3, 12, 21, 22])

If (p _n)_nis a polynomial system, orthogonal with respect to the positive Borel measure dα supported on the interval (a, b), then we have the following properties:

1.
p _nhas n simple real zeros x _n,ksatisfying a < x _n,k < b, 1 ≤ k ≤ n.
2.
p _nand p _n+1have no common zero. The same applies for P _nand $P_n^{\prime }$;
3.
if x _n,1 < x _n,2 < ⋯ < x _n,nare the n zeros of p _n, then
$$\displaystyle \begin{aligned} x_{n+1,k}<x_{n,k}<x_{n+1,k+1},\;1\leq k\leq n. \end{aligned}$$

Remark 3.8

It should be noticed that the three-term recurrence relation in the current section yields a matrix representation of the multiplication operator, called the Jacobi matrix (see for instance [25]). From this fact one can deduce that the zeros of the n-th orthogonal polynomials are the eigenvalues of the leading principal submatrix of size n × n of such a Jacobi matrix. This provides a method to find in an efficient numerical way such zeros even in the quasi-definite case.

3.6 Solution to the L ²(α) Extremal Problem

Theorem 3.9 (Minimal Property)

Let (p _n)_nbe a sequence of monic polynomials orthogonal with respect to a positive Borel measure dα(x) supported on the real line. For any fixed positive integer n, p _nis the minimal polynomial with respect to the L ²-norm

$$\displaystyle \begin{aligned} ||p||{}_\alpha=\sqrt{\int p^2(x)\,d\alpha(x)} \end{aligned}$$

associated with the corresponding orthogonality measure:

$$\displaystyle \begin{aligned} \min\left\{ \int q_n^2(x)\,d\alpha(x),\;q_n\in\mathbb{R}[x], q_n(x)=x^n+\mathit{\text{ lower degree terms }}\right\}=\int p_n^2(x)\,d\alpha(x) \end{aligned} $$

(3.21)

where $\mathbb {R}[x]$ is the ring of polynomials with real coefficients.

Proof

Let n ≥ 1 and q _n be a monic polynomial of degree n. Combining the expansion of q _n in terms of the (p _k)_k

$$\displaystyle \begin{aligned} q_n(x)=\sum_{k=0}^n a_{n,k}\,p_k(x), \end{aligned}$$

with the orthogonality give

$$\displaystyle \begin{aligned} \int q_n^2(x)\,d\alpha(x)=\sum_{k=0}^n a_{n,k}^2\,d_k^2. \end{aligned}$$

Therefore,

$$\displaystyle \begin{aligned} \int q_n^2(x)\,d\alpha(x)\geq a_{n,n}^2\,d_n^2=d_n^2=\int p_n^2(x)\,d\alpha(x). \end{aligned}$$

In addition, there is equality if and only if a _n,k = 0, 0 ≤ k ≤ n − 1. □

It should be noticed that relation (3.21) which is valid for any sequence of polynomial orthogonal to the positive Borel measure dα(x), is similar to relation (1.8), given for the specific case of the Chebyshev polynomials of the first kind, with the Sup-norm (instead of the corresponding L ²-norm).

3.7 Gauss Quadrature Formula

The following property, which is called the Gauss quadrature formula is valid for any sequence of polynomials orthogonal with respect to the weight function ρ.

Theorem 3.10 ([3, 12, 21, 22])

Let (p _n)_nbe a family of polynomials satisfying orthogonality relations (2.11)–(2.13). Then there exists a sequence of positive real numbers (λ _n,k)_n, called Christoffel numbers, such that

$$\displaystyle \begin{aligned} \int_a^b \rho(x)\,f(x)dx=\sum_{k=1}^n \lambda_{n,k}\,f(x_{n,k}),\,\forall f\in \mathbb{R}_{2n-1}[x],\,n\geq 1,\end{aligned} $$

(3.22)

where the x _n,k, 1 ≤ k ≤ n are the zeros of p _nranked by increasing order. In addition, the integral of any function continuous on the compact interval [a, b] can be approximated by the previous formula:

$$\displaystyle \begin{aligned} \int_a^b \rho(x)\,f(x)dx=\lim_{n\rightarrow \infty} \sum_{k=1}^n \lambda_{n,k}\,f(x_{n,k}),\,\forall f\in {\mathcal C}[a,\,b].\end{aligned} $$

(3.23)

Proof

The proof of Eq. (3.22) which generalises property number 7 of the first theorem, is given in the paper by A. Jooste. It can also be found in [3, 12, 21, 22]. The proof of Eq. (3.23) is given in [3, 6]. □

3.8 Concluding Remarks

We would like to complete this paper with the following information and remark which will help to connect this lecture notes with the forthcoming ones, especially with those involved with classical and semi-classical orthogonal polynomials, as well as orthogonal polynomials of the Sobolev type:

1.
Among the classes of orthogonal polynomials, we mention the classical orthogonal polynomials and the semi-classical orthogonal polynomials. The first class is contained in the second one.
2.
Classical orthogonal polynomials of a continuous, discrete, q-discrete, quadratic and q-quadratic variable, respectively, are those orthogonal with respect to a weight function satisfying a so-called Pearson equation which is a first-order linear homogeneous differential, difference, q-difference, divided-difference or q-divided-difference equation with polynomial coefficients of degree one and at most 2, respectively, with some boundary conditions at the ends of the interval. Depending on the type of the variable, we get classical orthogonal polynomials of continuous, a discrete, a q-discrete, a quadratic and a q-quadratic variable.

Semi-classical orthogonal polynomial are defined in the same way like the classical ones, but with less restriction on the degree of the polynomial coefficients of the Pearson equation which can take higher values.
3.
The properties such as the uniqueness of a family of polynomials orthogonal with respect to a measure, the matrix representation, the three-term recurrence relation, the Christoffel-Darboux formula and its confluent form, the interlacing properties of the zeros and the Gauss quadrature formula are valid for any family of orthogonal polynomials. In addition, it should also be noticed that Theorems 3.4 (Matrix representation), 3.5 (Three-term recurrence relation), and 3.6 (Christoffel-Darboux formula) are also valid if we replace the positive Borel measure by a quasi-definite linear functional. In this case and for Theorem 3.4, the positiveness of the Hankel’s determinant is to be replaced by the fact that this determinant does not vanish.
4.
The Chebyshev polynomials of the first, second, third and fourth kinds are up to now the only known families of orthogonal polynomials for which the zeros are explicitly known. In addition to the Chebyshev polynomials of the first kind which have been studied here, the three other families are, respectively, given for $z=\cos \theta ,\, 0<\theta <\pi $, by [15, 21]
$$\displaystyle \begin{aligned} U_n(z)=\frac{\sin{}((n+1)\theta)}{\sin \theta},\, V_n(z)=\frac{\cos{}((n+{1\over 2})\theta)}{\cos{}(\frac{\theta}{2})},\, W_n(z)=\frac{\sin{}((n+{1\over 2})\theta)}{\sin{}(\frac{\theta}{2})}. \end{aligned}$$

The zeros of U _n(z), V _n(z) and W _n(z) are given in increasing order, respectively, by
$$\displaystyle \begin{aligned} \begin{array}{rcl} z_{n,k}&\displaystyle =&\displaystyle \cos \theta_{n,k},\text{ with } \theta_{n,k}={n+1-k\over n+1}\pi,\, k=1,\,2,\ldots,n, \\ z_{n,k}&\displaystyle =&\displaystyle \cos\theta_{n,k},\text{ with } \theta_{n,k}=\frac{2(n-k)+1}{2n+1}\pi,\, k=1,2,\ldots,n, \\ z_{n,k}&\displaystyle =&\displaystyle \cos\theta_{n,k},\text{ with } \theta_{n,k}=\frac{2(n+1-k)}{2n+1}\pi,\, k=1,2,\ldots,n. \end{array} \end{aligned} $$
5.
Additional information on general orthogonal polynomials can be found for example in [5, 6, 12, 19,20,21].

References

W.A. Al-Salam, Characterization theorems for orthogonal polynomials, in Orthogonal Polynomials: Theory and Practice, ed. by P. Nevai. NATO ASI Series C, vol. 294 (Kluwer Academic Publishers, Dordrecht, 1990), pp. 1–24
Google Scholar
G.E. Andrews, R. Askey, Classical orthogonal polynomials, in Polynômes orthogonaux et applications, ed. by C. Brezinski et al. Lecture Notes in Mathematics, vol. 1171 (Springer, Berlin, 1985), pp. 36–62
Google Scholar
G.E. Andrews, R. Askey, R. Roy, Special Functions. Encyclopedia of Mathematics and Its Applications 71 (Cambridge University Press, Cambridge, 1999)
Google Scholar
N.M. Atakishiyev, M. Rahman, S.K. Suslov, On classical orthogonal polynomials. Constr. Approx. 11, 181–223 (1995)
Article MathSciNet Google Scholar
C. Brezinski, Padé-type Approximation and General Orthogonal Polynomials. International Series of Numerical Mathematics, vol. 50 (Birkhäuser Verlag, Basel, 1980)
Google Scholar
T.S. Chihara, An Introduction to Orthogonal Polynomials (Gordon and Breach, New York, 1978)
MATH Google Scholar
A.G. García, F. Marcellán, L. Salto, A distributional study of discrete classical orthogonal polynomials. J. Comput. Appl. Math. 57, 147–162 (1995)
Article MathSciNet Google Scholar
G. Gasper, M. Rahman, Basic Hypergeometric Series. Encyclopedia of Mathematics and its Applications, vol. 35 (Cambridge University Press, Cambridge, 1990)
Google Scholar
W. Gautschi, Orthogonal Polynomials: Computation and Approximation. Numerical Mathematics and Scientific Computation (Oxford Science Publications, Oxford University Press, New York, 2004)
MATH Google Scholar
W. Gautschi, Orthogonal polynomials, quadrature, and approximation: computational methods and software (in Matlab), in Orthogonal Polynomials and Special Functions. Lecture Notes in Mathematics, vol. 1883 (Springer, Berlin, 2006), pp. 1–77
Google Scholar
W. Hahn, Über Orthogonalpolynome, die q-Differenzengleichungen genügen. Math. Nachr. 2, 4–34 (1949)
Article MathSciNet Google Scholar
M.E.H. Ismail, Classical and Quantum Orthogonal Polynomials in One Variable. Encyclopedia Mathematics and its Applications, vol. 98 (Cambridge University Press, Cambridge, 2005)
Google Scholar
R. Koekoek, P.A. Lesky, R.F. Swarttouw, Hypergeometric Orthogonal Polynomials and Theirq-Analogues. Springer Monographs in Mathematics (Springer-Verlag, Berlin, 2010)
Book Google Scholar
F. Marcellàn, A. Branquinho, J. Petronilho, Classical orthogonal polynomials: a functional approach. Acta Appl. Math. 34, 283–303 (1994)
Article MathSciNet Google Scholar
J.C. Mason, D.C. Handscomb, Chebyshev Polynomials (Chapman and Hall/CRC, New York, 2003)
MATH Google Scholar
J.C. Medem, On the q-polynomials: a distributional study. J. Comput. Appl. Math. 135, 157–196 (2001)
Article MathSciNet Google Scholar
A.F. Nikiforov, V.B. Uvarov, Special Functions of Mathematical Physics (Birkhäuser, Boston, 1984)
MATH Google Scholar
A.F. Nikiforov, S.K. Suslov, V.B. Uvarov, Classical Orthogonal Polynomials of a Discrete Variable (Springer, Berlin, 1991)
Book Google Scholar
H. Stahl, V. Totik, General orthogonal polynomials, in Encyclopedia of Mathematics and Its Applications, vol. 43 (Cambridge University Press, Cambridge, 1992)
Book Google Scholar
P.K. Suetin, Orthogonal Polynomials in Two Variables (Gordon and Breach Science Publishers, Amsterdam, 1999)
MATH Google Scholar
G. Szegö, Orthogonal Polynomials, American Mathematical Society Colloquium Publications, vol. 23, 4th edn. (American Mathematical Society, Providence, 1975)
Google Scholar
N.M. Temme, Special Functions: An Introduction to the Classical Functions of Mathematical Physics (Wiley, New York, 1996)
Book Google Scholar
L.N. Trefethen, Approximation Theory and Approximation Practice (Society for Industrial and Applied Mathematics, Philadelphia, 2013)
MATH Google Scholar
F. Tricomi, Vorlesungen über Orthogonalreihen (Springer, Berlin, 1955)
Book Google Scholar
W. Van Assche, Non-symmetric linear difference equations for multiple orthogonal polynomials, in CRM Proceedings and Lecture Notes, vol. 25 (American Mathematical Society, Providence, 2000), pp. 391–405
Google Scholar

Download references

Acknowledgements

The author would like to thank the anonymous reviewer for his careful reading of the manuscript and his many insightful comments and suggestions.

Author information

Authors and Affiliations

Department of Mathematics, Higher Teachers’ Training College, University of Yaounde I, Yaounde, Cameroon
Mama Foupouagnigni
The African Institute for Mathematical Sciences, Limbe, Cameroon
Mama Foupouagnigni

Authors

Mama Foupouagnigni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mama Foupouagnigni .

Editor information

Editors and Affiliations

University of Yaoundé I, Yaoundé, Cameroon; African Institute for Mathematical Sciences, Limbe, Cameroon
Mama Foupouagnigni
Institute for Mathematics, University of Kassel, Kassel, Germany
Wolfram Koepf

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Foupouagnigni, M. (2020). An Introduction to Orthogonal Polynomials. In: Foupouagnigni, M., Koepf, W. (eds) Orthogonal Polynomials. AIMSVSW 2018. Tutorials, Schools, and Workshops in the Mathematical Sciences . Birkhäuser, Cham. https://doi.org/10.1007/978-3-030-36744-2_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-36744-2_1
Published: 12 March 2020
Publisher Name: Birkhäuser, Cham
Print ISBN: 978-3-030-36743-5
Online ISBN: 978-3-030-36744-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

An Introduction to Orthogonal Polynomials

Abstract

Similar content being viewed by others

Orthogonal Polynomials

Orthogonal Polynomials and Applications

Classical Orthogonal Polynomials Revisited

Keywords

Mathematics Subject Classification (2000)

1 Introduction: An Example of a Family of Orthogonal Polynomials

Theorem 1.1 (Chebyshev Polynomials of the First Kind [17, 21])

Proof

2 Construction of a System of Orthogonal Polynomials

Theorem 2.1 (Gram-Schmidt Orthogonalisation Process [6, 12, 22])

Proof

Definition 2.2 (Orthogonal Polynomials [6])

Definition 2.3 (Orthogonal Polynomials w.r.t. to a Weight Function [3, 6, 12, 17, 18, 21])

Definition 2.4 (Orthogonal Polynomials of a Discrete Variable [12, 18])

Definition 2.5 (Orthogonal Polynomials of a q-Discrete Variable [8, 11, 13, 18])

Remark 2.6

3 Basic Properties of Orthogonal Polynomials

3.1 The Uniqueness of a Family of Orthogonal Polynomials

Lemma 3.1

Proof

Theorem 3.2 (Uniqueness of a Family of Orthogonal Polynomials)

Proof

3.2 The Matrix Representation

Definition 3.3 (Linear Functional)

Theorem 3.4 ([21])

Proof

3.3 The Three-Term Recurrence Relation

Theorem 3.5 (Three-Term Recurrence Relation [6, 21, 24])

Proof

3.4 The Christoffel-Darboux Formula

Theorem 3.6 (Christoffel-Darboux Formula [6, 21])

Proof

3.5 The Interlacing Properties of the Zeros

Theorem 3.7 (On the Zeros of Orthogonal Polynomials [3, 12, 21, 22])

Remark 3.8

3.6 Solution to the L 2(α) Extremal Problem

Theorem 3.9 (Minimal Property)

Proof

3.7 Gauss Quadrature Formula

Theorem 3.10 ([3, 12, 21, 22])

Proof

3.8 Concluding Remarks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation

3.6 Solution to the L ²(α) Extremal Problem