Abstract
We present a method to prove an inequality concerning a linear combination of symmetric monomial functions. This is based on Muirhead’s inequality combining with a graph theoretical setting. As an application we prove some interesting inequalities motivated from extremal combinatorics.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
We start with the following conjecture due to the last author.
Conjecture 1
Let \(r\ge 3\) be an integer, let \(a_1,\ldots ,a_r\in (0,1)\) be real numbers, and let \(a=a_1\cdots a_r\). Then we have
with equality holding if and only if \(a_1=\cdots =a_r\).
This conjecture is motivated by study of multiply intersecting hypergraphs, where one of the main tools is the so-called random walk method, see Chapter 15 of [1]. Here we briefly explain how Conjecture 1 is related to a problem of random walk. Let \(p\in (0,1-\frac{1}{r})\) be a real number, and let us define an infinite random walk \(W_p\) in the two-dimensional grid \({\mathbb {Z}}^2\). The walk \(W_p\) starts at the origin, and at each step it moves from (x, y) to \((x,y+1)\) (one step up) with probability p and from (x, y) to \((x+1,y)\) (one step right) with probability \(1-p\). Then \(W_p\) hits the line \(y=(r-1)x+1\) with probability \(\alpha _p\), where \(\alpha _p\in (0,1)\) is a unique root of the equation
Let \(p_1,\ldots ,p_r\in (0,1-\frac{1}{r})\) be real numbers. Let \(W'\) be another infinite random walk defined similarly to \(W_p\), but this time, at step j (\(j=1,2,\dots \)) the walk takes up with probability \(p_i\) and right with probability \(1-p_i\), where \(i:=j \bmod r\). This walk \(W'\) hits the line \(y=(r-1)x+r\) with probability \(\beta \), where \(\beta \in (0,1)\) is a unique root of the equation
We are interested in \(\beta \) because we can use it to bound the measure of multiply intersecting hypergraphs. We also mention that computing (or approximating) \(\beta \) is more difficult than that of \(\alpha _p\). Thus it is desirable that \(\beta \) is bounded in terms of \(\alpha _{p_i}\). Indeed we conjecture that \(\beta \le \alpha _{p_1} \alpha _{p_2}\cdots \alpha _{p_r}\), which follows from (1) (if true). See Appendix for the proof.
In this paper we prove Conjecture 1 for the cases \(3\le r\le 11\) with aid of computer search. For the proof we extend Muirhead’s inequality, and propose an approach to prove a more general inequality concerning a linear combination of symmetric monomial functions.
Let r and d be fixed positive integers, and let
where \(|\lambda |:=\lambda _1+\cdots +\lambda _r\), that is, \(\Lambda \) is a set of non-increasing sequences (vectors) representing a partition of d into r parts. For \(\lambda \in \Lambda \) we define the symmetric monomial functions \(m_\lambda (x)\) of degree d in variables \(x=(x_1,\ldots ,x_r)\) by
where the sums run over all distinct orderings (permutations) \(\sigma =(\sigma _1,\ldots ,\sigma _r)\) of the vector \(\lambda =(\lambda _1,\ldots ,\lambda _r)\). We also define the normalized symmetric monomial functions \({\bar{m}}_\lambda (x)\) by
For example, if \(r=d=4\) and \(\lambda =(2,2,0,0)\) then
and \({\bar{m}}_\lambda (x)=m_\lambda (x)/6\). (For more about monomial symmetric functions, see e.g., Chapter 7 of [5].) For \(\lambda ,\mu \in \Lambda \) we write \({\bar{m}}_\lambda \ge {\bar{m}}_\mu \) if \({\bar{m}}_\lambda (x)\ge {\bar{m}}_\mu (x)\) holds for all \(x\ge 0\), where \(x\ge 0\) means that \(x=(x_1,\ldots ,x_r)\) satisfies \(x_i\ge 0\) for all \(1\le i\le r\). In [4] (see also [2, 3]) Muirhead proved that
where \(\lambda\, \succ\, \mu \) means
Moreover \({\bar{m}}_\lambda (x)= {\bar{m}}_\mu (x)\) if and only if \(\lambda =\mu \) or \(x_1=\cdots =x_r\).
Now we define a bipartite graph \({\mathcal{G}}=(V({\mathcal{G}}),E({\mathcal{G}}))\) corresponding to \(\Lambda \) as follows. For the vertex set let \(V({\mathcal{G}})=U\sqcup U'\), where U and \(U'\) are distinct copies of \(\Lambda \). Then two vertices \(\lambda \in U\) and \(\lambda '\in U'\) are adjacent in \({\mathcal{G}}\) if and only if \(\lambda\, \succ\, \lambda '\). Let \(c:V({\mathcal{G}})\rightarrow {\mathbb {R}}_{\ge 0}\) be a given capacity function. We say that a flow \(\phi :E({\mathcal{G}})\rightarrow {\mathbb {R}}_{\ge 0}\) is optimal if
Let \(\alpha :\Lambda \rightarrow {\mathbb {R}}_{\ge 0}\), which can be viewed as a coefficient vector \(\alpha =(\alpha _\lambda :\lambda \in \Lambda )\in {\mathbb {R}}_{\ge 0}^{\Lambda }\). Then we define a linear combination of normalized symmetric monomial functions
Let \(\alpha ,\alpha '\in {\mathbb {R}}_{\ge 0}^{\Lambda }\) and let c be a capacity function on \(V({\mathcal{G}})=U\sqcup U'\) defined by \(c=\alpha \) on U and \(c=\alpha '\) on \(U'\). We say that \(\alpha \) suppresses \(\alpha '\), denoted by \(\alpha\, \succ\, \alpha '\), if \({\mathcal{G}}\) admits an optimal flow. While the Muirhead’s inequality says that if \(\lambda\, \succ\, \mu \) then \({\bar{m}}_\lambda\, \succ\, {\bar{m}}_\mu \), our result stated below claims that if \(\alpha\, \succ\, \alpha '\) then \({\bar{m}}(\alpha )\ge {\bar{m}}(\alpha ')\). This can be seen as an extension from the inequality about normalized symmetric monomial functions to the inequality about linear combinations of them.
Theorem 1
Let \(\alpha ,\alpha '\in {\mathbb {R}}_{\ge 0}^{\Lambda }\). If \(\alpha\, \succ\, \alpha '\) then \({\bar{m}}(\alpha )\ge {\bar{m}}(\alpha ')\) with equality holding if and only if the following two conditions are satisfied:
-
(i)
\(\sum _{\lambda \in e}\phi (e)=c(\lambda )\) for all \(\lambda \in U\), and
-
(ii)
\({\bar{m}}_\lambda \ge {\bar{m}}_{\lambda '}\) for all adjacent \(\lambda \in U\) and \(\lambda '\in U'\).
Proof
Let \({\mathcal{G}}\) be the bipartite graph on \(V({\mathcal{G}})=U\sqcup U'\) defined above, and let \(\phi \) be an optimal flow. If \(\lambda \in U\) and \(\lambda '\in U'\) are adjacent by an edge e in \({\mathcal{G}}\), then \({\bar{m}}_\lambda \ge {\bar{m}}_{\lambda '}\) and \(\phi (e){\bar{m}}_\lambda \ge \phi (e){\bar{m}}_{\lambda '}\). Using this trivial fact we have
One can readily verify the equality conditions. \(\square \)
We note that \({\bar{m}}(\alpha )\ge {\bar{m}}(\alpha ')\) does not necessarily imply \(\alpha \succ \alpha '\), see Example 1 in the last section. Using Theorem 1 we were able to verify Conjecture 1 for \(3\le r\le 11\).
Theorem 2
Conjecture 1 is true for \(3\le r\le 11\).
Though our proof of Theorem 2 is based on Theorem 1, we need two more ideas. First, we actually prove a stronger inequality (see Conjecture 2) holding for all non-negative variables, from which we derive the inequality (1). This is needed because (1) holds only for variables in the unit interval, while Theorem 1 only applies to inequalities valid for all non-negative variables. Second, as a bipartite graph applied to Theorem 1 we do not use the graph whose vertex set is \(\Lambda \) because it is too large. Instead we construct a graph on a much smaller vertex set which has a nicer poset structure induced by (3) (see Theorem 3). This reduces the computation markedly.
2 Proof of Theorem 2
Throughout this section let \(r\ge 3\) be a fixed integer. As mentioned in the end of the previous section the inequality (1) is not suitable for applying Theorem 1. So we should find an inequality which holds for all non-negative variables and implies (1). This inequality will be obtained by factorizing (1).
For \(s=(s_1,\ldots ,s_r)\) and \(1\le k\le r\) we define the elementary symmetric functions \(e_k(s)\) by
see, e.g., [5]. Let \(e_0(s):=1\). Then we have
To factorize the LHS of (1), we apply the above identity to \(s=(s_1,\ldots ,s_r)\), where
and \(z=1\) or \(z=a\). Then we have
Thus, writing \(e_k\) instead of \(e_k(s)\) for simplicity, the LHS of (1) is rewritten as
where
Letting
we have
and also
Consequently we obtain the following expression:
where
Finally to prove Conjecture 1 it suffices to show the following conjecture.
Conjecture 2
Let \(r\ge 3\) be an integer, and let \(a_1,\ldots ,a_r\) be non-negative real numbers. Then
where F and G are defined by (7) and (8) with (5) and (6). Moreover, equality holds if and only if \(a_1=\cdots =a_r\).
Note that Conjecture 2 is slightly stronger than Conjecture 1 because the condition for \(a_i\) is weaker. Moreover this condition is precisely what we need to apply Theorem 1. Note also that \(F-G\ge 0\) if
for every d, where \(F^{(d)}\) (resp. \(G^{(d)}\)) denote the degree d part of F (resp. G).
Now we translate the problem of showing \(F^{(d)}-G^{(d)}\ge 0\) (for fixed r and d) to a problem of finding an optimal flow. Recall the definitions of \(\Lambda \) and \({\bar{m}}(\alpha )\) from (2) and (4). Let \(\alpha ,\alpha '\in {\mathbb {R}}_{\ge 0}^\Lambda \) be such that
Then, by Theorem 1, \(F^{(d)}-G^{(d)}\ge 0\) follows from \(\alpha\, \succ\, \alpha '\). Thus our problem is translated to show \(\alpha\, \succ\, \alpha '\). However the number of non-zero entries in \(\alpha \) and \(\alpha '\) grows rapidly as r grows, and it is not so easy to verify \(\alpha\, \succ\, \alpha '\) in this naive setting in practice. To overcome the difficulty we look at the posets derived from the polynomials \(F^{(d)}\) and \(G^{(d)}\) in detail, and we reduce the complexity using the structure of the posets.
Let \(\Lambda (\alpha ')=\{\mu \in \Lambda :\alpha '_\mu >0\}\) be the set of partitions corresponding to \(G^{(d)}\). For \(\mu \in \Lambda (\alpha ')\) it follows from (8) that \(\mu _1\le r-2\). We partition \(\Lambda (\alpha ')\) by the value of \(\mu _r\). Let
Then we have \(\Lambda (\alpha ')=\bigsqcup _{h=0}^{r-2}{\tilde{Q}}_h\), where
Lemma 1
Let \(\Lambda (\alpha )=\{\lambda \in \Lambda :\alpha _\lambda >0\}\). Then we have \(\Lambda (\alpha )=\bigsqcup _{h=0}^{r-2}{\tilde{P}}_h\), where
Proof
In view of (7), \(\lambda \in {\tilde{P}}_h\) comes from \(f_{r-2-h}a^h\). Then, \(\lambda \) is decomposed into two parts \(\mu =(\mu _1,\ldots ,\mu _r)\) from \(f_{r-2-h}\) and \(\nu =(\nu _1,\ldots ,\nu _r)\) from \(a^{h}\). By (5) and (6) we have
We also have \(\nu _1=\cdots =\nu _r=h\). Then the set of sequences \(\lambda =\mu +\nu \) defines \({\tilde{P}}_h\). \(\square \)
We note that \(I:=\Lambda (\alpha )\cap \Lambda (\alpha ')\) is nonempty. For example we have
We want to look at \(\Lambda (\alpha )\setminus I\) and \(\Lambda (\alpha ')\setminus I\) rather than \(\Lambda (\alpha )\) and \(\Lambda (\alpha ')\). To this end we need some preparation. Recall that \(\Lambda \) itself is a poset, and the bipartite graph \({\mathcal{G}}\) is defined on \(V({\mathcal{G}})=U\sqcup U'\), where both U and \(U'\) are distinct copies of \(\Lambda \). For \(x\in {\mathbb {R}}^{\Lambda }\) let \(p(x):=(y_{\lambda }:\lambda \in \Lambda )\), where
that is, p(x) extracts positive entries from x. Let \(\beta :=p(\alpha -\alpha ')\) and \(\beta ':=p(\alpha '-\alpha )\). Then \(\alpha -\alpha '=\beta -\beta '\) and
So our aim is to show that \({\bar{m}}(\beta -\beta ')\ge 0\). These coefficient vectors \(\beta \) and \(\beta '\) define two subposets of \(\Lambda \):
These posets are equipped with a nice property as stated in the next theorem, and this is the reason why we can efficiently verify the existence of an optimal flow in \({\mathcal{G}}\) with the capacity function coming from \(\beta \) and \(\beta '\).
Theorem 3
There exist unique positive integer k and partitions
with representatives \(\tilde{\lambda }_1,\ldots ,\tilde{\lambda }_k\) and \(\tilde{\mu }_1,\ldots ,\tilde{\mu }_k\) satisfying \(\tilde{\lambda }_i=\min A_i\), \(\tilde{\mu }_i=\max B_i\), and \(\tilde{\lambda }_i\,\succ\, \tilde{\mu }_i\) for all \(1\le i\le k\).
For a concrete example of such partitions, see Example 2 in the next section. We remark that \(A_i\) (resp. \(B_i\)) does not necessarily have the maximum (resp. minimum) element, see Example 3.
We partition \(\Lambda (\beta )\) and \(\Lambda (\beta ')\) by the value of the last element. It follows from (9) and (10) that \(\Lambda (\beta )=\bigsqcup _{h=0}^{r-3}P_h\) and \(\Lambda (\beta ')=\bigsqcup _{h=0}^{r-3}Q_h\), where
Then the disjoint union \(P_h\sqcup Q_h\) is a proper subset of \(\Lambda _h\).
We have already fixed r and d, and now we fix h. We will define two maps \(D^\infty :P_h\rightarrow Q_h\) and \(U^\infty :Q_h\rightarrow P_h\), which will play a key role for the proof of Theorem 3. To this end we need two auxiliary maps D and U (down and up, respectively). Before going into the details of the proof we explain our plan. Let \(\lambda \in P_h\) and \(\mu \in Q_h\) with \(\lambda\,\succ\,\mu \). We can draw the Young diagrams corresponding to \(\lambda \) and \(\mu \), and starting from \(\lambda \) we can get \(\mu \) by moving a box at a right upper corner to a left lower corner one by one. (We will give a formal definition of such operations shortly.) In this process we can find \(D^\infty (\lambda )\) and \(U^\infty (\mu )\) such that \(\lambda\,\succ\,U^\infty (\mu )\,\succ\, D^\infty (\lambda )\,\succ\, \mu \) with some additional nice properties. Actually we will get \(D^\infty (\lambda )\) from \(\lambda \) by repeating the down map D finitely many times, and we will get \(U^\infty (\mu )\) from \(\mu \) by repeating the up map U finitely many times.
Definition 1
For \(\lambda \in \Lambda _h\setminus Q_h\) define p and q as follows.
(Case I) If \(\lambda _1\ge r-1\) then
(Case II) If \(\lambda _1<r-1\) and \(\lambda _{r-1-h}=\lambda _r\) then
Define a map \(D:\Lambda _h\rightarrow \Lambda _h\) by \(D(\lambda ):=\lambda '\) if \(\lambda \not \in Q_h\), where
and by \(D(\lambda ):=\lambda \) if \(\lambda \in Q_h\). (See Example 2.)
Definition 2
For \(\mu \in \Lambda _h\setminus P_h\) define p and q as follows.
(Case III) If \(\mu _{r-1-h}>\mu _r\) then
(Case IV) If \(\mu _{r-1-h}=\mu _r\) and \(\mu _1\le r-2\) then
Define a map \(U:\Lambda _h\rightarrow \Lambda _h\) by \(U(\mu ):=\mu '\) if \(\mu \not \in P_h\), where
and by \(U(\mu ):=\mu \) if \(\mu \in P_h\).
Lemma 2
Let \(\lambda \in \Lambda _h\setminus Q_h\) and \(\mu \in Q_h\). If \(\lambda\,\succ\,\mu \) then \(D(\lambda )\,\succ\, \mu \).
Proof
Let \(\lambda '=D(\lambda )\). By definition of \(\lambda '\) we only need to check that
for all \(p\le i<q\) because if \(i<p\) or \(i\ge q\) then the sum for \(\lambda '\) and the sum for \(\lambda \) are the same.
For Case I we have \(\lambda '_i\ge \lambda _i-1\ge r-2\ge \mu _i\) for each \(p\le i<q\), which yields (11).
For Case II we note that if \(p<j<q\) then \(h+1=\lambda _j\le \mu _j\) and \(h=\lambda _q<\mu _q\) (and if \(j>q\) then \(\lambda _j=\mu _j\)). Hence, for \(p\,\le\, i\,<\,q\), we have
Since \(|\lambda |=|\mu |=d\) it follows that
Thus we have
as needed. \(\square \)
Lemma 3
Let \(\mu \in \Lambda _h\setminus P_h\) and \(\lambda \in P_h\). If \(\lambda\,\succ\,\mu \) then \(\lambda\,\succ\,U(\mu )\).
Proof
Let \(\mu '=U(\mu )\). We only need to show that
for all \(p\le i<q\).
For Case III suppose, to the contrary, that
for some \(p\le i<q\). The RHS is equal to \(\mu _1+\cdots +\mu _i+1\), and at most \(\lambda _1+\cdots +\lambda _i+1\) because \(\lambda\,\succ\,\mu \). Thus we have
Since \(|\lambda |=|\mu |\) we also have \(\lambda _{i+1}+\cdots +\lambda _r=\mu _{i+1}+\cdots +\mu _r\). Moreover, noting \(\lambda _j<\mu _j\) for \(r-1-h\le j\le q\), and \(\lambda _j=\mu _j\) for \(q<j\le r\), we have that \(p\le i<r-1-h\) and
This together with \(\mu _{i+1}=\mu _{r-2-h}\) gives us \(\lambda _{i+1}>\mu _{i+1}\) and
On the other hand it follows from (13) and \(\lambda _1+\cdots +\lambda _{i-1}\ge \mu _1+\cdots +\mu _{i-1}\) that \(\lambda _i\le \mu _i\). This is a contradiction.
For Case IV suppose (12). Since \(\lambda _1\ge r-1\ge \mu _1+1=\mu '_1\) we may assume that \(2\le i<q\). Using \(\lambda _1>\mu _1\) and \(\mu _2=\mu _i\) with (13) we have \(\lambda _i<\mu _i\). Then \(\lambda _{i+1}\le \lambda _i<\mu _i=\mu _{i+1}\) and
which contradicts the assumption \(\lambda\,\succ\,\mu \). \(\square \)
Let \(D^n=(D\circ \cdots \circ D)\) (n times), and define \(D^\infty =\lim _{n\rightarrow \infty }D^n\). If \(\lambda \in P_h\) then \(D^\infty (\lambda )=D^n(\lambda )\) for some \(n\le r\). Indeed we first repeat Case I until \(\lambda _1<r-1\), and next repeat Case II until \(\lambda _{r-1-h}>\lambda _r\), and then eventually the resulting \(\lambda \) comes into \(Q_h\). Thus \(D^\infty \) is a map from \(P_h\) to \(Q_h\). Similarly we define a map \(U^\infty :Q_h\rightarrow P_h\) by \(U^\infty =\lim _{n\rightarrow \infty }U^n\), which is actually obtained by applying U at most r times. By Lemma 2 and Lemma 3 we get the following results. (See Example 3.)
Lemma 4
Let \(\lambda \in P_h\) and \(\mu \in Q_h\) with \(\lambda\,\succ\,\mu \).
-
(1)
\(D^\infty (\lambda )\,\succ\, \mu \) and \(D^\infty (\lambda )=\max \{\mu '\in Q_h:\lambda\,\succ\,\mu '\}\).
-
(2)
\(\lambda\,\succ\,U^\infty (\mu )\) and \(U^\infty (\mu )=\min \{\lambda '\in P_h:\lambda '\,\succ\, \mu \}\).
Lemma 5
It follows that
-
(i)
\((U^\infty \circ D^\infty )(\lambda )=\lambda \) for all \(\lambda \in U^\infty (Q_h)\), and
-
(ii)
\((D^\infty \circ U^\infty )(\mu )=\mu \) for all \(\mu \in D^\infty (P_h)\).
Proof
Let \(\mu \in Q_h\), and define \(\lambda :=U^\infty (\mu )\), \(\mu ':=D^\infty (\lambda )\), and \(\lambda ':=U^\infty (\mu ')\).
Since \(\lambda =U^\infty (\mu )\) we have \(\lambda\,\succ\,\mu \). Then by (1) of Lemma 4 we have
Thus \(\{\lambda ''\in P_h:\lambda ''\,\succ\, \mu '\}\subset \{\lambda ''\in P_h:\lambda ''\,\succ\, \mu \}\) and taking the minimum element of each set we get \(\lambda '\,\succ\, \lambda \). On the other hand \(\lambda\,\succ\,\mu '\) follows from \(\mu '=D^\infty (\lambda )\). Applying (2) of Lemma 4 we obtain \(\lambda\,\succ\,U^\infty (\mu ')=\lambda '\). Consequently \(\lambda =\lambda '\) and \((U^\infty \circ D^\infty )(\lambda )=U^\infty (\mu ')=\lambda '=\lambda \). This proves (i) of this lemma. One can show (ii) similarly. \(\square \)
Proof of Theorem 3
By Lemma 5 there exists \(k=k(h)\) such that \(U^\infty (Q_h)=\{\tilde{\lambda }_1,\ldots ,\tilde{\lambda }_k\}\) and \(D^\infty (P_h)=\{\tilde{\mu }_1,\ldots ,\tilde{\mu }_k\}\). For \(1\le i\le k\) let
Then \(P_h=A_1\sqcup \cdots \sqcup A_k\) and \(Q_h=B_1\sqcup \cdots \sqcup B_k\). Moreover, by Lemma 4, we have \(\tilde{\lambda }_i=\min A_i\) and \(\tilde{\mu }_i=\max B_i\). Finally we get the desired partitions \(\Lambda (\beta )=\bigsqcup _{h=0}^{r-3}\bigsqcup _{i=1}^{k(h)}A_i\) and \(\Lambda (\beta ')=\bigsqcup _{h=0}^{r-3}\bigsqcup _{i=1}^{k(h)}B_i\). \(\square \)
Using Theorem 3 we can further reduce the problem and decrease the computation sharply. This reduction is based on the following simple observation. By Theorem 3 we have \(\tilde{\lambda }_i=\min A_i\) implying
where \({\tilde{c}}_i:=\sum _{\lambda \in A_i}\beta _\lambda \), and similarly
where \({\tilde{c_i}}{'}_i:=\sum _{\mu \in B_i}\beta '_\mu \). So we define two coefficient vectors \(s,s'\in {\mathbb {R}}_{\ge 0}^{\Lambda }\) as follows: for each \(\lambda ,\mu \in \Lambda \) let
Then it follows from the definition that \({\bar{m}}(\beta )\ge {\bar{m}}(s)\), \({\bar{m}}(s')\ge {\bar{m}}(\beta ')\), and
Thus our aim is now reduced to showing \(s\,\succ\, s'\), which yields \({\bar{m}}(s-s')\ge 0\) and so \({\bar{m}}(\beta -\beta ')\ge 0\), as required. Finally, similarly as we changed from \(\alpha ,\alpha '\) to \(\beta ,\beta '\) we modify \(s,s'\) one last time to get \(t,t'\in {\mathbb {R}}_{\ge 0}^\Lambda \) by \(t:=p(s-s')\) and \(t':=p(s'-s)\). Since \(t-t'=s-s'\) it suffices to show \(t\,\succ\, t'\) to verify \(s\,\succ\, s'\), see Example 4 in the next section. In summary we have shown that
The point is that the number of non-zero entries in t and \(t'\) is much smaller than those in \(\alpha \) and \(\alpha '\), for example, if \(r=11\) and \(d=52\) then the former is only 367 while the latter is 6594. This is the reason why such reductions make the computation much faster. Consequently we can complete the proof of Conjecture 2 by showing \(t\,\succ\, t'\), and we have indeed verified this for \(3\le r\le 11\) (and all d) with aid of a computer.
Thus the inequality in Conjecture 2 holds for \(3\le r\le 11\). Moreover if \(a_1=\cdots =a_r\), then clearly \(F-G=0\). On the other hand if \(F-G=0\) then we need \({\bar{m}}(\alpha )-{\bar{m}}(\alpha ')=0\). This together with (ii) of Theorem 1 implies that \({\bar{m}}_\lambda ={\bar{m}}_{\lambda '}\) for all adjacent \(\lambda \) and \(\lambda '\) in the graph \({\mathcal{G}}\). Therefore all \(a_i\) are the same, which verifies the equality condition in Conjecture 2. This completes the proof of Theorem 2.
3 Some Examples
Example 1
We present an example showing that the converse of Theorem 1 does not hold, that is, an example satisfying \({\bar{m}}(\alpha )\ge {\bar{m}}(\alpha ')\) but \(\alpha \,\not\succ\,\alpha '\).
Let \(r=2\) and \(d=4\). Then \(\Lambda =\{(4,0),(3,1),(2,2)\}\), and
Let \(\alpha =(2,0,2)\) and \(\alpha '=(0,4,0)\). Then we have
A routine calculus shows that \({\bar{m}}(\alpha )\ge {\bar{m}}(\alpha ')\) for all \(x_1,x_2\ge 0\).
Let \({\mathcal{G}}\) be the corresponding bipartite graph. Then, by writing only vertices with positive capacities, we have \(V({\mathcal{G}})=\{(4,0),(2,2)\}\sqcup \{(3,1)\}\). There is only one edge joining (4, 0) and (3, 1), where the capacity of (4, 0) is 2 while the capacity of (3, 1) is 4. Thus there is no optimal flow, and \(\alpha \,\not\succ\, \alpha '\).
Example 2
Let \(r=8\), \(d=29\), and \(h=0\). Let \(\tilde{\lambda }= (7,7,5,4,4,2,0,0)\in P_0\) and \(\tilde{\mu }=(6,6,6,5,4,1,1,0)\in Q_0\). We get the following sequence by applying D:
This shows that \(D^\infty (\tilde{\lambda })=D^3(\tilde{\lambda })=\tilde{\mu }\). See Figure 1, where the map D sends a box with ‘\(\bullet \)’ to the position marked by ‘\(\times \).’
Example 3
(Continued from Example 2) Let
Then it follows that \(A=\{76554200,76555100,76644200,\tilde{\lambda }\}\), and \(B=\{\tilde{\mu }\}\), and moreover \(\min A=\tilde{\lambda }\) (but A does not have the maximum element) and \(\max B=\tilde{\mu }\), see Figure 2, where black arrows correspond to D and red arrows correspond to U.
Example 4
We describe the partitions in Theorem 3 for the case \(r=7\) and \(d=15\). Then \(\Lambda (\beta )=\{\lambda _1,\ldots ,\lambda _{19}\}\) and \(\Lambda (\beta ')=\{\mu _1,\ldots ,\mu _{19}\}\) listed below with their capacities, e.g., \(\lambda _1=(6,2,2,2,1,1,1)\) and \(c(\lambda _1)=140\).
Then we can view \(\Lambda (\beta )\) and \(\Lambda (\beta ')\) as posets, where the partial order is introduced by the majorization. Figure 3 shows the corresponding Hasse diagrams, where \(\lambda _i\) is denoted by and \(\mu _j\) is denoted by \(\boxed {j}\).
The partitions in Theorem 3 in this case are
where
The representatives \(\tilde{\lambda }_i=\min A_i\) and \(\tilde{\mu }_i=\max B_i\) are given by
Example 5
Here we present an example of the reduction after applying Theorem 3. Let \(r=8\), \(d=29\). In this case the number of partitions in Theorem 3 is \(k=12\), and the corresponding representatives are listed below.
The corresponding capacities are the following.
Then to construct t and \(t'\) we compute
Figure 4 shows the reduced graph for t and \(t'\), that is, the top five vertices are corresponding to \({\bar{m}}(t)\) and the bottom six vertices are corresponding to \({\bar{m}}(t')\), and \(\tilde{\lambda }\) and \(\tilde{\mu }\) are adjacent if \(\tilde{\lambda }\,\succ\, \tilde{\mu }\). In this picture we label the vertices by \({\tilde{c}}_i-{\tilde{c}}'_i\) instead of the capacity \(|{\tilde{c}}_i-{\tilde{c}}'_i|\). Then one of the optimal flows is shown in Fig. 5.
Change history
27 September 2021
The original version is updated due to the appearance of special characters in the HTML version.
References
Frankl, P., Tokushige, N.: Extremal problems for finite sets. Student Mathematical Library, 86. American Mathematical Society, Providence (2018)
Garling, D.: Inequalities: a journey into linear analysis. Cambridge University Press, Cambridge (2007)
Hardy, G., Littlewood, J., Pólya, G.: Inequalities. Reprint of the 1952 edition. Cambridge Mathematical Library, Cambridge University Press, Cambridge (1988)
Muirhead, R.F.: Some methods applicable to identities and inequalities of symmetric algebraic functions of \(n\) letters. Proc. Edinburgh Math. Soc. 21, 144–157 (1903)
Stanley, R.: Enumerative combinatorics. Vol. 2. Cambridge Studies in Advanced Mathematics, 62. Cambridge University Press, Cambridge (1999)
Acknowledgements
The authors thank the referees for their very careful reading and helpful suggestions. The second author is supported by JSPS KAKENHI 19K03398. The last author is supported by JSPS KAKENHI 18K03399.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Write \(a_i\) for \(\alpha _{p_i}\), and let \(q_i=1-p_i\).
Fact 1
Let \(t\ge 1\) be an integer, and let \(l_j\) be the line \(y=(r-1)x+j\). Then the walk \(W_{p_i}\) hits the line \(l_t\) with probability \(a_i^t\), and the walk \(W'\) hits the line \(l_{rt}\) with probability \(\beta ^t\).
Proof
Suppose that the probability that the walk \(W_{p_i}\) hits the line \(l_t\) is given by \(X^t\) for some \(X\in (0,1)\). After the first step, the walk is at (0, 1) with probability \(p_i\), and at (1, 0) with probability \(q_i\). From (0, 1) the probability for the walk hitting \(l_t\) is \(X^{t-1}\), and from (1, 0) the probability is \(X^{t-1+r}\). Then it follows
that is,
Thus \(X=a_i\), and the walk hits the line \(l_t\) with probability \(a_i^t\).
Next suppose that the probability that the walk \(W'\) hits the line \(l_{rt}\) is given by \(Y^t\) for some \(Y\in (0,1)\). After the first r steps, it is at \((x,r-x)\) with probability
where \([r]=\{1,2,\ldots ,r\}\). From \((x,r-x)\) the probability for the walk hitting \(l_{rt}\) is \(Y^{x+t-1}\). This yields
that is,
Thus \(Y=\beta \), and the walk hits the line \(l_{rt}\) with probability \(\beta ^t\). \(\square \)
Define a polynomial f(x) by
By definition \(f(\beta )=0\).
Fact 2
If \(0<y<1\) and \(f(y)\le 0\), then \(\beta \le y\).
Proof
This follows because \(f(0)>0\), \(f(1)=0\), \(f'(1)=-1+\sum _{i=1}^r q_i> -1+r\cdot \frac{1}{r}=0\) (here we used \(p_i<1-\frac{1}{r}\)), and \(f''(x)>0\) for \(x>0\). \(\square \)
Fact 3
The inequality \(a:=a_1\cdots a_r\le \beta \) follows from (1).
Proof
By Fact 7 it suffices to show \(f(a)\le 0\). Since \(a_i=p_i+q_ia_i^r\) we have
and
So we need to show
Solving \(a_i=p_i+(1-p_i)a_i^r\) for \(p_i\) gives
Then
Noting that \(0<a_i<1\) we see that (14) is equivalent to
and multiplying both sides by \(a=a_1\cdots a_r\) we get (1). \(\square \)
Rights and permissions
About this article
Cite this article
Kato, M., Kosuda, M. & Tokushige, N. Extending Muirhead’s Inequality. Graphs and Combinatorics 37, 1923–1941 (2021). https://doi.org/10.1007/s00373-021-02356-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00373-021-02356-z