1 Introduction

Let \(\mathcal {H}\) denote the class of analytic functions in the unit disk \(\mathbb {D}:=\{z\in \mathbb {C}:\, |z|<1\}\). Let \(\mathcal {A}\) be the class of functions \(f\in \mathcal {H}\) such that \(f(0)=0\) and \(f'(0)=1\), and denote by \(\mathcal {S}\) the class of functions \(f\in \mathcal {A}\) which are univalent (i.e., one-to-one) in \(\mathbb {D}\). Thus, \(f\in \mathcal {S}\) has the following representation

$$\begin{aligned} f(z)= z+\sum _{n=2}^{\infty }a_n z^n,\, |z|<1. \end{aligned}$$
(1.1)

In the late 1960s, Zalcman posed the conjecture that if \(f\in \mathcal {S}\), then

$$\begin{aligned} |a_n^2-a_{2n-1}|\le (n-1)^2 \quad \text{ for } n\ge 2, \end{aligned}$$
(1.2)

with equality only for the Koebe function \(k(z)=z/(1-z)^2\), or its rotation. It is important to note that the Zalcman conjecture implies the celebrated Bieberbach conjecture \(|a_n|\le n\) for \(f\in \mathcal {S}\) (see [3]), and a well-known consequence of the area theorem shows that (1.2) holds for \(n=2\) (see [6]). The Zalcman conjecture remains an open problem, even after de Branges’ proof of the Bieberbach conjecture [5].

For \(f\in \mathcal {S}\), Krushkal has proved the Zalcman conjecture for \(n=3\) (see [10]), and recently for \(n=4,5\) and 6 (see [11]). For a simple and elegant proof of the Zalcman conjecture for the case \(n=3\), see [11]. However, the Zalcman conjecture for \(f\in \mathcal {S}\) is still open for \(n>6\). On the other hand, using complex geometry and universal Teichmüller spaces, Krushkal claimed in an unpublished work [12] to have proved the Zalcman conjecture for all \(n\ge 2\). Personal discussions with Prof. Krushkal indicates that there is a gap in the proof of Krushkal’s unpublished work [12], and so the Zalcman conjecture remains open for the class \(\mathcal {S}\) for \(n>6\).

If \(f\in \mathcal {S}\), then the coefficients of \([f(z^2)]^{1/2}\) and 1/f(1/z) are polynomials in \(a_n\), which contains the expression of the form \(\lambda a_n^2-a_{2n-1}\), pointed out by Pfluger [17].

1.1 Generalized Zalcman conjecture

In 1999, Ma [15] proposed the following generalized Zalcman conjecture: If \(f\in \mathcal {S}\), then

$$\begin{aligned}|a_{n}a_{m}-a_{n+m-1}|\le (n-1)(m-1),\,\,\, n\ge 2, m\ge 2\end{aligned}$$

Clearly, for \(n=m\), the generalized Zalcman conjecture reduces to the Zalcman conjecture. which remains an open problem till date. However Ma [15] proved this generalized Zalcman conjecture for classes \(\mathcal {S}^{*}\) and \(\mathcal {S}_{\mathbb {R}}\), where \(\mathcal {S}_{\mathbb {R}}\) denotes the class of all functions in \(\mathcal {S}\) with real coefficients.

Further, Ma [15] asked for what positive real values of \(\lambda \) does the following inequality hold?

$$\begin{aligned} |\lambda a_na_m-a_{n+m-1}|\le \lambda nm -n-m+1 \,\,\,\,\, (n,\,m=2,3,\ldots ). \end{aligned}$$
(1.3)

Clearly equality holds for the Koebe function \(k(z) = z/(1 - z)^2\) and its rotations.

Remark 1.1

It is obvious that if (1.3) holds, then \(\lambda nm -n-m+1\ge 0\) i.e.,

$$\begin{aligned}\lambda \ge \frac{n+m-1}{nm}.\end{aligned}$$

Note that if \(\lambda \ge \dfrac{n+m-1}{nm}\), \(\mu \ge \lambda \) and (1.3) holds, then

$$\begin{aligned} |\mu a_na_m-a_{n+m-1}|\le & {} (\mu -\lambda )|a_na_m|+|\lambda a_na_m-a_{n+m-1}|\\ \nonumber\le & {} (\mu -\lambda )nm+\lambda nm-n-m+1\,\,\,\,\, (\text{ since } |a_n|\le n)\\ \nonumber= & {} \mu nm-n-m+1. \end{aligned}$$

That is if (1.3) holds for some \(\lambda \), then it holds for every \(\mu \ge \lambda \).

In view of Remark 1.1, it is natural to consider the following problem

Problem 1.4

Let \(f\in \mathcal {S}\), fix nm and consider the following set \(\{\lambda : \lambda \text{ satisfies } \)(1.3)}. The problem is to find the infimum of this set.

At this juncture we must mention that the Zalcman conjecture and its other generalized form has been proved for some subclasses of \(\mathcal {S}\), such as starlike functions, typically real functions, close-to-convex functions [3, 13, 14]. For basic properties of starlike functions, typically real functions and close-to-convex functions we refer to [6].

2 Main results

Using the variational method, together with the Bombieri method [2], we prove the following two results.

Theorem 2.1

Let \(f\in \mathcal {S}\) be an extremal function for the extremal problem \(|2a_2a_3-a_4|\) and let \(\Gamma \) be the image of \(|z|=1\) under 1/f(z). If \(Re\, a_2>0\) and \(Im\, a_2\ne 0\) then \(\Gamma \) lies either in upper or lower half plane.

Theorem 2.2

Let \(f\in \mathcal {S}\) be given by \(f(z)=z+\sum _{n=2}^{\infty } a_n z^n\) then

$$\begin{aligned} |3a_2a_3-a_4|\le 14, \end{aligned}$$
(2.1)

with equality only for functions of the form

$$\begin{aligned}\frac{z}{(1-e^{i \theta }z)^2},\,\,\, \text{ where } \,\, \theta \text{ is } \text{ real }.\end{aligned}$$

For the proof of Theorem 2.2, we follow the technique of Ozawa [16].

3 Preliminaries

In this section we discuss some preliminary ideas which will be useful to prove our main results.

3.1 Schiffer’s Variational method

In 1938, Schiffer developed variational method as a tool for treating the extremal problems arising in univalent function theory. Schiffer’s boundary variation [23, 24] which is applicable to very general extremal problems, showed that any function in \(\mathcal {S}\) which maximizes \(|a_{n}|\) must map the disk onto the complement of a single analytic arc [25] which lies on the trajectory of a certain quadratic differential. The omitted arc was found to have monotonic modulus and other nice properties. Bieberbach’s conjecture asserted that this must be a radial half-line. In 1955, Garabedian and Schiffer [7] finally succeeded in using this approach, in combination with Loewner’s method, to prove that \(|a_{4}|\le 4\). The work in [7] gives a general method to attack coefficient problems for univalent functions, but involves a great amount of computational work. Later in 1960, Z. Charzynski and M. Schiffer [4] gave a greatly simplified proof of \(|a_4|\le 4\). In [4], the authors give a new proof of the Grunsky inequality based on variational methods.

In the application of Schiffer’s variational method, the complement \(\Gamma \) of the range of an extremal function consists of analytic curves satisfying a differential inequality of the form \(Q(w)dw^2<0\). More precisely, the Schiffer’s differential equation is of the form

$$\begin{aligned}\left( \frac{dw}{dt}\right) ^2\,\frac{Q(w)}{w^{n+1}}=-1.\end{aligned}$$

In general, such an expression is called a quadratic differential, where Q is meromorphic and the arcs for which \(Q(w)dw^2>0\) are called its trajectories. In many important cases the function Q is a rational function. The zeros and poles of Q are referred to as the singularities of the quadratic differential. In order to emphasise the geometric point of view, consider a metric \(ds^2=|Q(w)||dw|^2\), which is euclidean except at the singularities. The trajectories are the geodesics of this metric. For the detailed study of this quadratic differential and its local and global trajectories we refer to Jenkins [8, 9] and Strebel [26].

By Schiffer’s variational method, the extremal function of certain extremal problems satisfy a differential equation of the form

$$\begin{aligned}z^2 \frac{w'^2}{w^{m+1}}P^*(w)=Q^*(z),\end{aligned}$$

where \(P^*\) is a polynomial in w and \(Q^*\) is a rational function in z. The exact formulation for this differential equation was established by Schaffer and Spencer [18,19,20,21,22] in a series of papers on coefficient regions for univalent functions. The problem they considered is to characterize the sequences \(\{a_n\}\) which define such functions and can be solved if one can determine the region \(V_n\) in \((2n-2)\)-dimensional space to which the point \((a_2, a_3,\ldots , a_n)\) is confined. The most likely way to success is to determine the boundary of \(V_n\) through the extremal properties of the corresponding functions, which can be found by developing a specific variational method for geometric and rigorous meaning of the interior variation, see [1].

If \(f(z)=w\) maximizes an extremal problem \(\mathcal {J}(a_2, a_3, a_4, \ldots , a_n)\), then f satisfies the following differential equation (see [20])

$$\begin{aligned}(zw')^2\sum _{v=2}^{n}A_vw^{-v-1}=B+\sum _{v=1}^{n-1}(B_vz^{-n+v}+\overline{B_v}z^{n-v}),\end{aligned}$$

where

$$\begin{aligned}A_v=\sum _{k=v}^{n}a_k^{(v)}\mathcal {J}_k,\,\,\,\, B_v=\sum _{k=1}^{v}ka_k\mathcal {J}_{n+k-v},\,\,\,\, B=\sum _{k=2}^{n}(k-1)a_k\mathcal {J}_k.\end{aligned}$$

Here \(a_k^{(v)}\) are the coefficients of

$$\begin{aligned}f(z)^v=\sum _{k=v}^{\infty }a_k^{(v)}z^k,\end{aligned}$$

and

$$\begin{aligned}\mathcal {J}_v=\frac{\partial \mathcal {J}}{\partial a_v}=\frac{1}{2}\left( \frac{\partial \mathcal {J}}{\partial x_v}-i\frac{\partial \mathcal {J}}{\partial y_v}\right) ,\,\,\,\,\,\,\, \text{ where } a_v=x_v+iy_v.\end{aligned}$$

3.2 Bombieri method

Bombieri [2] has proved a general result about critical trajectories of a quadratic differential \(Q(\xi )d\xi ^2\) on the \(\xi \)-sphere, arising from the following problem.

Let there be given a quadratic differential \(Q (\xi ) d\xi ^2\) on the \(\xi \)- sphere. Then a "good" subset \(T_0\) (\(T_0\) is said to be good if it satisfies a certain connectedness condition) of the set \(\overline{T}\) where T is the set of critical trajectories of \(Q (\xi ) d\xi ^2\), is a continuously differentiable Jordan arc J on the \(\xi -\) sphere. Now the question is can we assert that \(J\cap T_0\) is either empty set or a single point under aferesaid conditions on J ? The answer to this question is given by Bombieri [2].

Theorem A

[2] Let R be the \(\xi -\) sphere, \(Q (\xi )d\xi ^2\) be a quadratic differential on it with at most three distint poles, only one of which has order at least 2. This point is called B. \(T_0\) be a connected component of \(\overline{T}\setminus B\), and let J be a continuously differentiable Jordan arc on R not containing poles of \(Q (\xi ) d\xi ^2\), such that \(B\notin \overline{J}\) and

$$\begin{aligned}\Im (Q(\xi )d\xi ^2)\ne 0\,\,\,\, \text{ on } J.\end{aligned}$$

Then \(\overline{J}\) can meet \(T_0\) at most in one point.

Corollary

Theorem A remains true if J contains one simple pole A of \(Q(\xi )d\xi ^2\), provided \(T_0\) is the connected component of \(\overline{T}\setminus B\) containing A, and we have

$$\begin{aligned}T_0 \cap \overline{J}=A.\end{aligned}$$

Remark 3.1

Theorem A and and its corollary remains true if the condition

$$\begin{aligned}\Im (Q(\xi )d\xi ^2)\ne 0\,\,\,\, \text{ on } J.\end{aligned}$$

is weakened to

$$\begin{aligned}\Re (Q(\xi )d\xi ^2)<0\end{aligned}$$

at every point where \(\Im (Q(\xi )d\xi ^2)=0\,\,\,\, \text{ on } J.\)

We will use this in our proof to obtain the differential equation satisfied by the extremal function. The main aim of this paper is to solve generalized Zalcman conjcture for the initial coefficients for functions in \(\mathcal {S}\).

4 Proof of the main results

We first prepare some material which will be used in the proof of our main results.

Let \(a_2=x_2+iy_2\), \(a_3=x_3+iy_3\), \(a_4=x_4+iy_4\), and \(\mathcal {J}=a_4-\lambda a_2a_3\). Then a simple computation gives

$$\begin{aligned}\mathcal {J}=(x_4-\lambda x_2x_3+\lambda y_2y_3)+i(y_4-\lambda x_2y_3-\lambda y_2x_3).\end{aligned}$$

Further let

$$\begin{aligned}\mathcal {J}_{2}:=\frac{1}{2}\left( \frac{\partial \mathcal {J}}{\partial x_2}-i\frac{\partial \mathcal {J}}{\partial y_2}\right) =\frac{1}{2}(-2\lambda x_3-2i\lambda y_3)=-\lambda a_3\end{aligned}$$
$$\begin{aligned}{} & {} \mathcal {J}_{3}:=\frac{1}{2}\left( \frac{\partial \mathcal {J}}{\partial x_3}-i\frac{\partial \mathcal {J}}{\partial y_3}\right) =\frac{1}{2}(-2\lambda x_2-2i\lambda y_2)=-\lambda a_2. \end{aligned}$$
(4.1)
$$\begin{aligned}{} & {} \mathcal {J}_{4}:=\frac{1}{2}\left( \frac{\partial \mathcal {J}}{\partial x_4}-i\frac{\partial \mathcal {J}}{\partial y_4}\right) =\frac{1}{2}(1+1)=1 \end{aligned}$$
(4.2)
$$\begin{aligned} B= & {} a_2\mathcal {J}_2+2a_{3}\mathcal {J}_3+3a_4\mathcal {J}_4 \nonumber \\= & {} -a_2\lambda a_3-2a_3\lambda a_2+3a_4 \nonumber \\= & {} 3(a_4-\lambda a_2a_3) \end{aligned}$$
(4.3)
$$\begin{aligned} A_2= & {} a_2^{(2)}\mathcal {J}_2 +a_3^{(2)}\mathcal {J}_3+a_4^{(2)}\mathcal {J}_4 \nonumber \\= & {} \mathcal {J}_2+2a_2\mathcal {J}_3+(a_2^2+2a_3)\mathcal {J}_4 \nonumber \\= & {} -\lambda a_3-2\lambda a_2^2+a_2^2+2a_3 \nonumber \\= & {} (1-2\lambda )a_2^2+(2-\lambda )a_3 \end{aligned}$$
(4.4)
$$\begin{aligned} A_3= & {} a_3^{(3)}\mathcal {J}_3+a_4^{(3)}\mathcal {J}_4 \nonumber \\= & {} -\lambda a_2+3a_2 \nonumber \\= & {} (3-\lambda )a_2 \end{aligned}$$
(4.5)
$$\begin{aligned} A_4= & {} a_4^{(4)}\mathcal {J}_4=1 \end{aligned}$$
(4.6)
$$\begin{aligned} B_1= & {} \mathcal {J}_4=1 \end{aligned}$$
(4.7)
$$\begin{aligned} B_2= & {} \mathcal {J}_3+2a_2\mathcal {J}_4 \nonumber \\= & {} -\lambda a_2+2a_2 \nonumber \\= & {} (2-\lambda )a_2 \end{aligned}$$
(4.8)
$$\begin{aligned} B_3= & {} \mathcal {J}_2+2a_2\mathcal {J}_3+3a_3\mathcal {J}_4 \nonumber \\= & {} - \lambda a_3-2\lambda a_2^2+3a_3 \nonumber \\= & {} (3-\lambda )a_3-2\lambda a_2^2. \end{aligned}$$
(4.9)

Since the functional \(a_4-\lambda a_2a_3\) is rotationally invariant, we can consider the extremal problem

$$\begin{aligned}\max _{\mathcal {S}}\,\Re \{a_4-\lambda a_2a_3\}.\end{aligned}$$

By Schiffer’s variational method, the extremal function satisfies the following differential equation

$$\begin{aligned} z^2 \frac{w'^2}{w^5}\left( ((2-\lambda )a_3+(1-2\lambda )a_2^2)w^2+(3-\lambda )a_2w+1\right) \nonumber \\ =\frac{1}{z^3}\left( 1+Pz+Qz^2+Rz^3+Sz^4+Tz^5+z^6\right) , \end{aligned}$$
(4.10)

where

$$\begin{aligned} P= & {} (2-\lambda )a_{2}, \nonumber \\ Q= & {} (3-\lambda )a_3-2\lambda a_{2}^2, \nonumber \\ R= & {} 3(a_4-\lambda a_2a_3), \nonumber \\ S= & {} \overline{(3-\lambda )a_3-2\lambda a_{2}^2}, \nonumber \\ T= & {} (2-\lambda )\overline{a_2}. \end{aligned}$$
(4.11)

We are now ready to give the proofs of Theorems 2.1 and 2.2.

Proof of Theorem 2.1

The proof of this theorem requires Bombieri’s method [2] together with Schiffer’s variational method. By Schiffer’s variational method the image of \(|z|=1\) by any extremal function satisfies

$$\begin{aligned}\left( \frac{dw}{dt}\right) ^2\frac{1}{w^5}\left( -3a_{2}^2w^2+a_2w+1\right) +1=0,\end{aligned}$$

with a suitable parameter t. Take \(Q^*(w)dw^2\) as the associated quadratic differential, so that

$$\begin{aligned}Q^*(w)dw^2=-\frac{dw^2}{w^5}\left( -3a_{2}^2w^2+a_2w+1\right) .\end{aligned}$$

Let \(w=1/\xi \) and \(Q(\xi )d\xi ^2\) be \(Q^*(1/\xi )d(1/\xi )^2\), then

$$\begin{aligned}Q(\xi )d\xi ^2=-\left( -3a_2^2+a_2\xi +\xi ^2\right) \frac{d\xi ^2}{\xi }.\end{aligned}$$

Let \(a_2=x_2+iy_2\), and \(\xi \) be real. Then we have

$$\begin{aligned}\Im Q(\xi )d\xi ^2=(6x_2y_2-y_2\xi )\frac{d\xi ^2}{\xi }.\end{aligned}$$

Since \(y_2\ne 0\), \(\Im Q(\xi )d\xi ^2=0\) only if \(\xi =6x_2\), In view of Bombieri’s Theorem 1, and its Remark, we have

$$\begin{aligned}\Re Q(\xi )d\xi ^2=(-3y_2^2+3x_2^2-x_2\xi -\xi ^2)\frac{d\xi ^2}{\xi },\end{aligned}$$

and at \(\xi =6x_2\) we have

$$\begin{aligned}\Re Q(\xi )d\xi ^2=-\frac{1}{6x_2}(3y_2^2+39x_2^2)<0,\,\,\,\,\,( \text{ since } x_2>0).\end{aligned}$$

Thus from Bombieri’s Theorem 1 and its Remark the image \(\Gamma \) of \(|z|=1\) by \(\xi \) intersects the real axis only at the origin. It is easy to observe that \(a_2\) can’t be zero for any extremal function. Thus, \(\xi =0\) is a simple pole of \(Q(\xi )d\xi ^2\). Hence \(\xi =0\) is an end point of \(\Gamma \). Hence \(\Gamma \) must lie in either the upper or lower half plane. \(\square \)

Proof of Theorem 2.2

Taking \(\lambda =3\) in (4.10), we get that the extremal function satisfies the following differential equation

$$\begin{aligned}z^2 \frac{w'^2}{w^5}\left( (-a_3-5a_{2}^2)w^2+1\right) =g(z)=\frac{1}{z^3}\left( 1+Pz+Qz^2+Rz^3+Sz^4+Tz^5+z^6\right) , \end{aligned}$$

where

$$\begin{aligned} P= & {} -a_{2}, \nonumber \\ Q= & {} -6a_{2}^2, \nonumber \\ R= & {} 3(a_4-3a_2a_3), \nonumber \\ S= & {} -6\overline{a_{2}^2}, \nonumber \\ T= & {} -\overline{a_{2}}. \end{aligned}$$
(4.12)

The image of the unit circle \(|z|=1\) under \(w=f(z)\) has at least one finite end point, and a point on \(|z|=1\), which corresponds to a finite end point is a double zero of g(z). Therefore g(z) can be rewritten as

$$\begin{aligned} g(z)=\frac{1}{z^3}(z-E)^2(z^4+Dz^3+Cz^2+Bz+A). \end{aligned}$$
(4.13)

Comparing (4.12) and (4.13), we obtain

$$\begin{aligned}{} & {} BE^2-2AE= -a_{2},\\{} & {} A-2EB+E^2C= -6a_{2}^2,\\{} & {} B-2EC+E^2D= 3(a_4-3a_2a_3),\\{} & {} C-2ED+E^2= -6\overline{a_{2}^2},\\{} & {} D-2E= -\overline{a_{2}}. \end{aligned}$$

Further g satisfies the following functional equation

$$\begin{aligned}g(z)=\overline{g\left( \frac{1}{\overline{z}}\right) }.\end{aligned}$$

It is easy to see that

$$\begin{aligned}{} & {} AE^2=\overline{A}\overline{E}^2=1,\,\,\,\, |A|=|E|=1,\\{} & {} BE^2-2AE= \overline{D}-2\overline{E},\\{} & {} A-2EB+E^2C= \overline{C}-2\overline{E}\overline{D}+\overline{E}^2,\\{} & {} B-2EC+E^2D= \overline{B}-2\overline{E}\overline{C}+\overline{E}^2\overline{D}. \end{aligned}$$

Since

$$\begin{aligned}BE^2-2AE= \overline{D}-2\overline{E},\end{aligned}$$

we have,

$$\begin{aligned}D=\overline{B}\overline{E}^2-2\overline{A}\overline{E}+2E=\overline{B}\frac{1}{\overline{A}}+2E-2\frac{1}{\overline{E}^2}\overline{E}=\overline{B}A+2E-2E,\end{aligned}$$

and hence

$$\begin{aligned} D=\overline{B}A. \end{aligned}$$
(4.14)

Further,

$$\begin{aligned}A-2EB+E^2C= \overline{C}-2\overline{E}\overline{D}+\overline{E}^2,\end{aligned}$$

which implies that

$$\begin{aligned} C= & {} \frac{\overline{C}}{E^2}-\frac{2\overline{E}\overline{D}}{E^2}+\frac{\overline{E}^2}{E^2}-\frac{A}{E^2}+\frac{2EB}{E^2}\\= & {} \overline{C}A-\frac{2\overline{E}B\overline{A}}{E^2}+\frac{1}{E^4}-\frac{1}{E^4}+\frac{2B}{E}\\= & {} \overline{C}A-\frac{2B}{A}+\frac{2B}{A}. \end{aligned}$$

Therefore,

$$\begin{aligned} C=\overline{C}A. \end{aligned}$$
(4.15)

By summarising we have the following relations:

$$\begin{aligned}AE^2=1,\,\,\,\, |A|=|E|=1,\,\,\,\, D=\overline{B}A,\,\,\,\, C=\overline{C}A\end{aligned}$$
$$\begin{aligned}{} & {} BE^2-2AE = -a_{2}\end{aligned}$$
(4.16)
$$\begin{aligned}{} & {} A-2EB+E^2C = -6a_{2}^2 \end{aligned}$$
(4.17)
$$\begin{aligned}{} & {} B-2EC+E^2D = 3(a_4-3a_2a_3). \end{aligned}$$
(4.18)

Let \( E=e^{i\theta }\, \text{ then } A= e^{-2i\theta },\,\,\,\, C=re^{i\beta },\,\,\,\, B=se^{i\alpha }.\) From (4.15) we obtain \(e^{-2i\beta }=e^{2i\theta }\), which implies that \(e^{i(\beta +\theta )}=e^{p\pi i}\), where \( p\in \mathbb {Z}.\)

From (4.16), (4.16) we obtain the following relations

$$\begin{aligned}{} & {} se^{i\alpha }-2e^{-3i\theta }=-a_2 e^{-2i\theta } \end{aligned}$$
(4.19)
$$\begin{aligned}{} & {} re^{i(\theta +\beta )}-2se^{i\alpha }+e^{-3i\theta }= -6\,a_2^{2}\,e^{-i\theta }. \end{aligned}$$
(4.20)

Also using (4.14) in (4.17) we obtain

$$\begin{aligned}3(a_4-3a_2a_3)=B+\overline{B}-2EC=2{\text {Re}}(B)-2re^{i(\theta +\beta )}=2\,s\cos \alpha -2\,r\cos p\pi ,\end{aligned}$$

Thus,

$$\begin{aligned} (3a_2a_3-a_4)=\frac{2}{3}(r\,\cos \,p\pi -s\,\cos \,\alpha ). \end{aligned}$$
(4.21)

Let \(-a_2=|-a_2|e^{i\phi }=|a_2|e^{i\phi }\), then from (4.19) we have

$$\begin{aligned} s\,\cos \,\alpha -\cos \,3\theta =|a_2|\,\cos \,(\phi -2\theta )+\cos \,3\theta . \end{aligned}$$
(4.22)

Also from (4.20) we obtain

$$\begin{aligned} r\,\cos \,p\pi -s\,\cos \,\alpha -s\,\cos \,\alpha +\cos \,3\theta =-6|a_2|^2\,\cos (2\phi -\theta ), \end{aligned}$$
(4.23)

and using (4.22) in (4.23), we obtain

$$\begin{aligned} r\,\cos \,p\pi -s\,\cos \,\alpha =-6|a_2|^2\,\cos (2\phi -\theta )+|a_2|\,\cos \,(\phi -2\theta )+\cos \,3\theta . \end{aligned}$$
(4.24)

Let \(|a_2|=R\), \(0<R\le 2\) and consider the function

$$\begin{aligned}G(R,\theta ,\phi )=-6R^2\,\cos \,(2\phi -\theta )+R\,\cos (2\theta -\phi )+\cos \,3\theta .\end{aligned}$$

To find the maximum of G, first we need to find critical points. It is easy to see that

$$\begin{aligned} \frac{\partial G}{\partial R}= & {} -12R\,\cos \,(2\phi -\theta )+\cos \,(2\theta -\phi )=0,\\ \frac{\partial G}{\partial \theta }= & {} -6R^2\,\sin \,(2\phi -\theta )+2R\sin \,(\phi -2\theta )-3\sin \, 3\theta =0, \end{aligned}$$

and

$$\begin{aligned}\frac{\partial G}{\partial \phi }=12R^2\,\sin \,(2\phi -\theta )-R\,\sin (\phi -2\theta )=0.\end{aligned}$$

Hence

$$\begin{aligned}-12R\,e^{i(\theta -2\phi )}+e^{i(2\theta -\phi )}=0,\end{aligned}$$

This gives us that at the points which gives the maximum of \(G(R,\theta ,\phi )\), we have

$$\begin{aligned}R=\frac{1}{12}, \text{ and } e^{i\phi }=e^{-i\theta }.\end{aligned}$$

So at \(R=1/12,\) we have

$$\begin{aligned}G=(-6R^2+R+1)\,\cos \,3\phi \le 1.\end{aligned}$$

Now we check the value of G under the condition \(e^{i\phi }=e^{-i\theta }\) and obtain

$$\begin{aligned}G=(-6R^2+R+1)\,\cos \,3\phi =(6R^2-R-1)\, (-\cos \,3\phi )\le 21.\end{aligned}$$

So the maximum is attained at \(R=2\), which gives \(G\le 21,\) and so \(\Re (3a_2a_3-a_4)\le 14,\) i.e., \(|3a_2a_3-a_4|\le 14,\) with equality only for the Koebe function \(k(z)=z/(1-z)^2\) and its rotations, which completes the proof. \(\square \)