1 Introduction

Given any group W, any generating set T for W, and any cost function

on the generators, one can extend the cost function to all of W by minimizing over all decompositions of w into products of elements of T:

$$\begin{aligned} \$(w) = \min _{\begin{array}{c} t_1,\ldots , t_k \in T: \\ t_1\cdots t_k = w \end{array}} \big \{\$(t_1) + \cdots + \$(t_k)\big \}. \end{aligned}$$

One family of common examples occurs in the case that \(\$(t) = 1\) for all \(t \in T\): then, \(\$(w)\) is the minimum length of an expression for w as a product of generators. For example, when \(W = S_n\) is the symmetric group and \(T = S:= \{(1~2), (2~3), \ldots , (n - 1~n)\}\) is the set of simple transpositions, we have that \(\$(w)\) is the length of the shortest possible expressions for w as a product of simple transpositions, which is also known to be the inversion number \(\ell _S(w)\) of w. If, instead, \(T = \{ (1~2), (2~3), (1~3), \ldots \}\) consists of all transpositions in \(S_n\) and \(\$(t) = 1\) for all \(t \in T\), then \(\$(w)\) is the reflection length \({\text {reflen}}(w)\) of w, which is also known as the absolute length. This is known to equal \(n - c(w)\), where c(w) is the number of cycles of the permutation w.

In [6], the authors considered the situation in which \(S_n\) is generated by the set \(T = \{(i~j)\}\) of all transpositions, and the cost function on transpositions is given by

$$\begin{aligned} \$((i~j)) = |j-i|, \end{aligned}$$
(1)

i.e., the cost of a transposition is equal to the distance between the points it transposes in the one-line notation of a permutation. They showed that for this function, the cost of a permutation is half of its total displacement:

$$\begin{aligned} \$(w) = \frac{1}{2} \sum _{i=1}^n |w(i)-i|. \end{aligned}$$
(2)

One theme of all of these examples is that a certain “extrinsic,” “extremal” quantity (the minimum of a cost function over the family of factorizations of a given element) can also be given by a simple “intrinsic” formula that can be computed directly from the element. The main thesis of this note is that formulas of this kind are beautiful and interesting, and that it would be nice to have more of them.

The symmetric groups are Coxeter group of finite type A, and the three preceding examples can all be rephrased at the level of generality of arbitrary Coxeter groups. In particular, in [6], it was shown that the cost \(\$((i~j)) = |j - i|\) can be described in terms of the associated root system: the transpositions in \(S_n\) are the reflections, and the cost \(|j - i|\) is the depth of the positive root associated with the reflection (i j) in the root system. This gives two perspectives to the work of [6]: the combinatorial view focuses on the definition of the cost function given in Eq. (1), while the algebraic view focuses on the definition of the cost function in terms of root depths.

In [1], the authors studied the depth-defined cost function for the Coxeter groups of types B and D (the signed and even-signed permutations). They were able to give formulas for the cost of arbitrary elements with this depth-defined cost function, but their results were, in a sense, less tidy than the results cited above. This raises the following question.

Question

Are there natural choices of cost functions on the reflections in types B and D, or on other related groups like the affine symmetric group, whose extension to the group is given by a simple, attractive formula?

Our main results are to answer this question in the affirmative, generalizing the combinatorial perspective on the cost function of Eq. (1). We work in the level of generality of the infinite families of Weyl and affine Weyl groups—finite types A, B, D and affine types \(\widetilde{A}\), \(\widetilde{B}\), \(\widetilde{C}\), \(\widetilde{D}\)—which Eriksson and Eriksson called George groups [4]. These groups share a common combinatorial description as permutation groups acting on \(\mathbb {Z}\) that commute with certain symmetries of the number line.

The details of these groups and of our cost function $, which is motivated directly by the cost given in Eq. (1), are presented in Sect. 2. The main results of our work are a pair of theorems—Theorem 3.1 and Theorem 3.7—giving the cost of arbitrary elements in the unbranched and finite George groups, respectively. Those results show that the cost of an arbitrary element can be computed directly from that element using a simple, intrinsic formula. Finally, in Sect. 4, we make some further remarks and state some open problems related to our work.

2 Background

2.1 Who Are the Groups?

The main objects of this paper are the classical (finite and affine) Coxeter groups, or, in the language of [4], the George groups. Each of these groups consists of bijections from (a subset of) \(\mathbb {Z}\) to itself that satisfy certain symmetry conditions. We describe them now, following [2, 4].

  • The symmetric group \(S_n\) is a Coxeter group of (finite) type A. It consists of all bijections from \([n] = \{1, \ldots , n\}\) to itself.

  • The hyperoctahedral group \(S^B_n\) is a Coxeter group of (finite) type B (and also finite type C). Its elements are the signed permutations, which are the bijections from \(\pm [n]:= \{\pm 1, \ldots , \pm n\}\) to itself satisfying the symmetry condition \(w(i) = -w(-i)\).

  • The group \(S^D_n\) is a Coxeter group of (finite) type D. It is the subgroup of \(S^B_n\) consisting of the even-signed permutations, those for which \(\#\{i \in [n]: w(i) < 0\}\) is even.

  • The affine symmetric group \(\widetilde{S}_n\) is a Coxeter group of affine type \(\widetilde{A}\). Its elements are the affine permutations. These are bijections \(w: \mathbb {Z}\rightarrow \mathbb {Z}\), such that

    • \(w(i + n) = w(i) + n\) for all \(i \in \mathbb {Z}\), and

    • \(w(1) + \cdots + w(n) = \left( {\begin{array}{c}n+1\\ 2\end{array}}\right) \).

  • The Coxeter group \(\widetilde{S}^C_n\) of affine type \(\widetilde{C}\) consists of the affine signed permutations. In the language of [4] (which differs from that in [2]—see Sect. 4.1), these are the bijections \(w: \mathbb {Z}\rightarrow \mathbb {Z}\) such that

    • \(w(-i) = -w(i)\) for all \(i \in \mathbb {Z}\), and

    • \(w(i + (2n + 2)) = w(i) + (2n + 2)\) for all \(i \in \mathbb {Z}\).

  • The Coxeter groups \(\widetilde{S}^B_n\) and \(\widetilde{S}^D_n\) of affine types \(\widetilde{B}\) and \(\widetilde{D}\) are subgroups of \(\widetilde{S}^C_n\) satisfying some evenness conditions analogous to the condition in finite type D.

We will refer to these groups collectively as the George groups of window size n. Due to the symmetry conditions in these groups, each element w is uniquely described by the window of data \([w(1), w(2), \ldots , w(n)]\). For example, for \(w = [-5, 6, 7] \in \widetilde{S}^C_3\) one has \(w(4) = 4\), \(w(5) = w(-3) + 8 = -w(3)+8 = 1\), \(w(6) = w(-2) + 8 = -w(2) + 8 = 2\), \(w(7) = w(-1) + 8 = -w(1) + 8 = 13\), \(w(8) = 8\), and so on.

Our work focuses on these five groups \(S_n\), \(S^B_n\), \(S^D_n\), \(\widetilde{S}_n\), and \(\widetilde{S}^C_n\), which we will think of as the union of two (overlapping) classes:

  • the unbranched George groups \(S_n\), \(S^B_n\), \(\widetilde{S}_n\), and \(\widetilde{S}^C_n\), and

  • the finite George groups \(S_n\), \(S^B_n\), and \(S^D_n\).

The “unbranched” terminology refers to the fact that the Dynkin diagrams for these groups have no vertices of degree greater than 2 (see [4, Table 1] or [2, §A1]). We will say more about the remaining two groups \(\widetilde{S}^B_n\) and \(\widetilde{S}^D_n\) in Sect. 4.2.

It is common to ease notation by writing \(\overline{i}:= -i\). The symmetry conditions satisfied by a George group W divide its domain into symmetry classes. For \(S_n\), with no symmetry conditions, these are simply the singleton classes \(\{1\}\), ..., \(\{n\}\). The single symmetry condition satisfied by \(S^B_n\) and \(S^D_n\) yields n symmetry classes \(\{1, \overline{1}\}\), ..., \(\{n, \overline{n}\}\). For the affine symmetric group \(\widetilde{S}_n\), the symmetry classes are precisely the congruence classes of integers modulo n. Affine signed permutations satisfy two symmetry conditions: elements of \(\widetilde{S}^C_n\) are the bijections on \(\mathbb {Z}\) that commute with reflection across 0 and translation by \(2(n + 1)\). It follows that \(w((2n + 2) - i) = (2n + 2) - w(i)\) for w in this group, so the affine signed permutations commute with reflection across \(n + 1\). By translation, affine signed permutations commute with reflection across \(k(n + 1)\) for all k, and so \(k(n + 1)\) is a fixed point of every affine signed permutation for all k. Thus, the symmetry classes for the remaining groups \(\widetilde{S}^C_n\), \(\widetilde{S}^B_n\), and \(\widetilde{S}^D_n\) consist of the nontrivial classes \(\{ m :m \equiv i \text { or } \overline{i} \pmod {2n + 2}\}\) for \(i = 1, \ldots , n\), as well as the trivial classes \(\{k(n + 1)\}\) for \(k \in \mathbb {Z}\).

2.2 What Are the Statistics?

We will study several permutation statistics in this article, and we highlight the key definitions here. Throughout this paper, n will be an arbitrary but fixed positive integer.

We begin with a definition from [4], giving a generic framework of transpositions that works for all of the groups we study, rather than naming the particular transpositions that apply in each case.

Definition 2.1

Given a George group W, a pair \(\{i,j\}\) of different positions in \(\mathbb {Z}\) is transposable if there exists at least one element w in W for which \(w(i) = j\) and \(w(j) = i\). If \(\{i,j\}\) is a transposable pair, then the transposition \(\langle (i~j ) \rangle \) is the extension of the transposition (i j) under the symmetry conditions of the group; this is always an element of W of multiplicative order 2.

The transpositions in a George group W are precisely the reflections (when W is viewed as a Coxeter group). In what follows, it will be useful to divide the transpositions into two flavors. First, all George groups contain transpositions of the form \(\langle (i~j)\rangle \) where i and j belong to different symmetry classes. Second, some groups contain transpositions that switch two integers in the same symmetry class: in particular, the groups \(S^B_n\) and \(\widetilde{S}^C_n\) contain transpositions of the form \(\langle (i~\overline{i})\rangle \), and the groups \(\widetilde{S}^B_n\) and \(\widetilde{S}^C_n\) contain transpositions of the form \(\langle (i~2n+2 - i) \rangle \).

In each of the George groups of window size n, there is a natural generating set of simple reflections that makes it a Coxeter group. For \(i \in [n]\), define \(s_i:= \langle (i~i+1) \rangle \). Additionally, we let \(s_0:= \langle (1~\overline{1}) \rangle \), \(s_1':= \langle (1~\overline{2}) \rangle \), and \(s_n':= \langle (n~n+2) \rangle \). Then

  • \(S^A_n:= S_n\) is generated by \(\{s_1, \ldots , s_{n-1}\}\),

  • \(S^B_n\) is generated by \(\{s_0, s_1, \ldots , s_{n-1}\}\),

  • \(S^D_n\) is generated by \(\{s_1', s_1, \ldots , s_{n-1}\}\),

  • \(\widetilde{S}_n\) is generated by \(\{s_1, \ldots , s_{n-1}, s_n\}\), and

  • \(\widetilde{S}^C_n\) is generated by \(\{s_0, s_1, \ldots , s_{n-1}, s_n'\}\).

In the symmetric group, one often considers inversions, which are pairs of elements of [n] that are out of order. The following definition extends this notion to all George groups.

Definition 2.2

Let w be an element in a George group. A (right) class inversion in w is a transposition \(\langle (i~j) \rangle \) such that \(i < j\) and \(w(i) > w(j)\).

The length \(\ell (w)\) of an element of a Coxeter group is the smallest number of simple reflections needed to multiply to give w. In the symmetric group, this is equal to the number of inversions of w. By [4, Thm. 15], the previous definition extends this to all George groups: for any element w in any George group, the length \(\ell (w)\) in that group is equal to the number of its class inversions.

Because an element of a George group of window size n is entirely determined by the data in its window \([w(1), \ldots , w(n)]\), we can define the following statistic on all such elements.

Definition 2.3

If w belongs to any of the George groups of window size n, then the total displacement [5] or Spearman’s disarray [3] of w is

$$\begin{aligned} {\text {dis}}(w):= \sum _{i = 1}^n |w(i) - i|. \end{aligned}$$

Remark 2.4

When W respects a symmetry group on a subset of \(\mathbb {Z}\) (made up of some set of reflections and translations), and if i and j are in the same symmetry class, then \(|w(i) - i| = |w(j) - j|\). Consequently, in Definition 2.3, the summation index set [n] may be replaced by any set that contains exactly one element in the same symmetry class as i for each i in [n].

As mentioned in the introduction, the statistic studied in this paper is the cost function \(\$(w)\), defined as follows: if t is a transposition in a George group, then

$$\begin{aligned} \$(t) = \frac{{\text {dis}}(w)}{2}, \end{aligned}$$

while for an arbitrary element w, we define

$$\begin{aligned} \$(w) = \min _{\begin{array}{c} t_1\cdots t_k = w \\ t_1,\ldots , t_k \in T \end{array}} \big \{\$(t_1) + \cdots + \$(t_k)\big \}. \end{aligned}$$
(3)

Concretely, the first condition means that \(\$(\langle (i~j)\rangle ) = |i - j|\) if i and j belong to different symmetry classes, while \(\$(\langle (i~j)\rangle ) = \frac{1}{2}|i - j|\) if i and j belong to the same symmetry class. For example, in the hyperoctahedral group \(S^B_n\), for ij distinct elements of [n], we have

  • \(\$( (i~j)(\overline{i}~\overline{j})) = |i - j|\),

  • \(\$( (i~\overline{i})) = i\), and

  • \(\$( (i~\overline{j})(\overline{i}~j)) = i + j\).

In [6] and subsequently [1], the authors consider a statistic on an arbitrary Coxeter group W that they call depth. For a reflection t, the depth is defined to beFootnote 1

$$\begin{aligned} {\text {dp}}(t) = \frac{1 + \ell (t)}{2}, \end{aligned}$$

and depth is extended to all of W by the same minimization as in Eq. (3). In the case of the symmetric group, one has that \(\ell ( (i~j)) = 2|j - i| - 1\). Thus, \({\text {dp}}( (i~j )) = \frac{1}{2} {\text {dis}}((i~j))\) for every transposition (i j) in \(S_n\), and consequently, \({\text {dp}}(w) = \$(w)\) for every permutation \(w \in S_n\); the main result of [6] establishes that \({\text {dp}}(w) = \$(w) = \frac{1}{2} {\text {dis}}(w)\) for \(w \in S_n\). In types B and D, we no longer have that \({\text {dp}}(t)\) and \(\$(t)\) are equal for all transpositions t; in particular, for a transposition \( \langle (i \ \overline{j}) \rangle = (i \ \overline{j})(\overline{i} \ j)\) with \(i, j > 0\), we have \(\$(\langle (i \ \overline{j}) \rangle ) = i + j\), while \({\text {dp}}^B(\langle (i \ \overline{j}) \rangle ) = i + j - 1\) and \({\text {dp}}^D(\langle (i \ \overline{j}) \rangle ) = i + j - 2\). Thus, the problems of computing, for arbitrary w in \(S^B_n\) or \(S^D_n\), the values of \({\text {dp}}(w)\) and \(\$(w)\) are different. The main result of [1] is to give formulas for \({\text {dp}}(w)\) for w in \(S^B_n\) or \(S^D_n\), in terms of another statistic they call the blocks of the signed permutation, which we will discuss in Sect. 3.2.

3 Main Theorems

The main results of this paper are about the cost of elements in the George groups. We will state and prove this cost first for the unbranched George groups, because arguments in those cases are susceptible to the same proof techniques. We will then state and prove analogous results for the finite George groups, again taking advantage of commonalities in the approaches for those cases. These two classes of groups overlap in \(S_n\) and \(S^B_n\), and, of course, the results are the same for those groups whether they are considered unbranched or finite.

3.1 First Main Theorem: Unbranched George Groups

In this section, we prove our first main theorem, which applies to the unbranched George groups \(S_n\), \(S^B_n\), \(\widetilde{S}_n\), and \(\widetilde{S}^C_n\).

Theorem 3.1

For any unbranched George group W and any element w in W, we have

$$\begin{aligned} \$(w) = \frac{{\text {dis}}(w)}{2}. \end{aligned}$$

In the case of \(S_n\), Theorem 3.1 recovers [6, Theorem 1.1]. Our proof here will be more akin to the proof in [1] than the one in [6]. In fact, the proof of Theorem 3.1 will follow the same general outline for each of the four unbranched George groups: first, we will show that the cost \(\$(w)\) is at least as large as \({\text {dis}}(w)/2\); then, we will show that a particular transposition can always be found in the unbranched George groups; finally, we will show that when that particular kind of transposition exists, equality can be achieved. The result will follow by induction.

Proposition 3.2

If u and w belong to any George group W, then

$$\begin{aligned} {\text {dis}}(uw) \le {\text {dis}}(u) + {\text {dis}}(w). \end{aligned}$$

Proof

Fix u and w in a George group W of window size n. Then, by definition and the triangle inequality, we have

$$\begin{aligned} {\text {dis}}(uw)&= \sum _{i = 1}^n |u(w(i)) - i| \\&= \sum _{i = 1}^n |u(w(i)) - w(i) + w(i) - i| \\&\le \sum _{i = 1}^n |u(w(i)) - w(i)| + \sum _{i = 1}^n|w(i) - i|. \end{aligned}$$

Since W is a George group, the set \(\{w(1), \ldots , w(n)\}\) contains exactly one element in the symmetry class of i for each i in [n]. Thus, by Remark 2.4, \( \sum _{i = 1}^n |u(w(i)) - w(i)| = {\text {dis}}(u)\), and so \({\text {dis}}(uw) \le {\text {dis}}(u) + {\text {dis}}(w)\), as claimed. \(\square \)

We can now prove one direction of Theorem 3.1.

Corollary 3.3

If w belongs to any George group, then

$$\begin{aligned} \$(w) \ge \frac{{\text {dis}}(w)}{2}. \end{aligned}$$

Proof

Given w, choose a minimum-cost transposition factorization \(w = t_1 \cdots t_k\) of w; that is, \(\$(w) = \$(t_1) + \cdots + \$(t_k)\). By definition, \(\$(t) = {\text {dis}}(t)/2\) for every transposition t. Thus, by Proposition 3.2, we have

$$\begin{aligned} \$(w) = \frac{{\text {dis}}(t_1) + \cdots + {\text {dis}}(t_k)}{2} \ge \frac{{\text {dis}}(t_1 \cdots t_k)}{2} = \frac{{\text {dis}}(w)}{2}, \end{aligned}$$

as claimed. \(\square \)

While Proposition 3.2 and Corollary 3.3 apply to elements of all George groups, the next result requires that the group under consideration be unbranched. Moreover, this is the step in our proof of Theorem 3.1 that involves a case analysis, by group. It is notable that the affine cases are somewhat delicate, and use explicit descriptions of which pairs are transposable in their groups.

Lemma 3.4

Let W be an unbranched George group (\(S_n\), \(S^B_n\), \(\widetilde{S}_n\), or \(\widetilde{S}^C_n\)). If w is a non-identity element of W, then there exists a transposable pair \(\{x,y\}\) for W, such that

$$\begin{aligned} w(x) \ge y > x \ge w(y). \end{aligned}$$

Proof

We give separate proofs for the different groups. All cases rely on the fact that w is not the identity, and so it has some non-fixed values.

Choose y so that w(y) is the smallest non-fixed value. The minimality means that, in particular, \(w(y) < y\). Thus, there is a value (namely, w(y)) less than y and in a position weakly to the right of y in the window of w (in fact, in position y). Therefore, there must be a value weakly larger than y and appearing strictly to the left of position y. That is, there exists \(x < y\) with \(w(x) \ge y\). Moreover, the definition of y means that \(x \in [w(y), y-1]\), so, in fact, we have

$$\begin{aligned} w(x) \ge y > x \ge w(y). \end{aligned}$$

The same proof works, with two adjustments: all references to positions in the window should refer to the “doubled window” \([w(-n), \ldots , w(-1), w(1), \ldots , w(n)]\), and w(y) should now be chosen as the smallest non-fixed value in the doubled window.

Divide the non-fixed values of w into two sets: the exceedances \(E:= \{i \in \mathbb {Z}: w(i) > i\}\) and the anti-exceedances \(A:= \{j \in \mathbb {Z}: w(j) < j\}\). Because w is not the identity, \(A \cup E\) is nonempty. Suppose, without loss of generality, that A is nonempty. (The argument in the case that E is nonempty is entirely analogous.) By the periodicity property of w, we have \(i \in A\) if and only if \(i + kn \in A\), so A has at least one element in [n]. But then, because \(\sum _{i=1}^n w(i) = \sum _{i=1}^n i\), we must have that E is nonempty, as well. Combining this with the periodicity, we have that for each element j in A, there is some element i in E with \(i < j\). Consequently, there exists a position \(y \in A\) for which the largest element of \(A \cup E\) less than y is an element of E; call the position of this exceedance x.

By the choice of x and y, we have \(y > x\), \(y > w(y)\), and \(w(x) > x\). Moreover, again by the choice of x and y, we have that \(w(z) = z\) for all z in \(\{x + 1, \ldots , y - 1\}\). Since w is a bijection, w(x) cannot be equal to any of \(w(x + 1) = x+ 1, \ldots , w(y - 1) = y - 1\), and therefore, \(w(x) \ge y\); similarly, \(w(y) \le x\). Thus

$$\begin{aligned} w(x) \ge y > x \ge w(y), \end{aligned}$$

as claimed. Finally, since \(y \in A\), every number of the form \(y + kn\) also belongs to A. Since x belongs to E (which is disjoint from A), x does not differ from y by a multiple of n, and therefore, \(\{x, y\}\) is a transposable pair.

Define the sets E and A as in Case 3. Again, because w is not the identity, we have \(A \cup E \ne \emptyset \). Suppose that \(i \in E\), so that \(w(i) > i\). Then, by the first symmetry property of w, we have that \(w(-i) = -w(i) < -i\), so \(-i \in A\). Moreover, since \(w(i + k(2n + 2)) = w(i) + k(2n + 2)\) for all k, we have \(i + k(2n + 2) \in E\) and \(-i + k(2n + 2) \in A\) for all k. Similarly, if \(j \in A\), then \(j + k(2n + 2) \in A\) and \(-j + k(2n + 2) \in E\) for all k. It follows that when \(E \cup A\) is nonempty, both E and A include arbitrarily large and arbitrarily small elements. Therefore, for each element j in A, there is some element i in E with \(i < j\). Consequently, there exists a position \(y \in A\) for which the largest element of \(A \cup E\) less than y is an element of E; call this position x.

By the choice of x and y, we have \(y > x\), \(y > w(y)\), and \(w(x) > x\). Moreover, again by the choice of x and y,  we have that \(w(z) = z\) for all z in \(\{x + 1, \ldots , y - 1\}\). Since w is a bijection, w(x) cannot be equal to any of \(w(x + 1) = x+ 1, \ldots , w(y - 1) = y - 1\), and therefore, \(w(x) \ge y\); similarly, \(w(y) \le x\). Thus

$$\begin{aligned} w(x) \ge y > x \ge w(y), \end{aligned}$$

as claimed. The sets E and A are disjoint, with \(x \in E\) and \(y \in A\). Thus, x does not differ from y by a multiple of \(2n + 2\), and therefore, \(\{x, y\}\) is a transposable pair. \(\square \)

The last step of our argument is to show that in the presence of a transposable pair such as that described in the statement of Lemma 3.4, we can peel off a transposition from w in an advantageous manner.

Lemma 3.5

Suppose that w is an element of a George group W and that \(\{x, y\}\) is a transposable pair for W such that \(w(x) \ge y > x \ge w(y)\). Then

$$\begin{aligned} {\text {dis}}(w \cdot \langle (x \ y)\rangle ) = {\text {dis}}(w) - {\text {dis}}(\langle (x \ y)\rangle ). \end{aligned}$$

Proof

Let wxy be as in the statement. Suppose first that x and y do not belong to the same symmetry class. In this case, it is possible to choose a set I of n integers that contains both x and y and that contains exactly one element from the symmetry class of i for each \(i \in [n]\) (as in Remark 2.4). With these choices, we have

$$\begin{aligned} {\text {dis}}(w)&= \sum _{i \in I}|w(i) - i| \\&= |w(x) - x| + |w(y) - y| + \sum _{i \in I \smallsetminus \{x, y\}}|w(i) - i| \\&= w(x) - x + y - w(y) + \sum _{i \in I \smallsetminus \{x, y\}}|w(i) - i|. \end{aligned}$$

Furthermore, because \(\langle (x \ y )\rangle \) is a transposition, \(\langle (x \ y )\rangle (i) = i\) for \(i \in I \smallsetminus \{x, y\}\), and therefore

$$\begin{aligned} {\text {dis}}(w\cdot \langle (x \ y )\rangle )&= \sum _{i \in I}|w(\langle (x \ y )\rangle (i)) - i| \\&= |w(y) - x| + |w(x) - y| + \sum _{i \in I \smallsetminus \{x, y\}}|w(i) - i| \\&= x - w(y) + w(x) - y+ \sum _{i \in I \smallsetminus \{x, y\}}|w(i) - i| \\&= {\text {dis}}(w) - 2(y - x). \end{aligned}$$

Since x and y belong to different symmetry classes, \(2(y - x) = {\text {dis}}(\langle (x \ y )\rangle )\), which completes the proof in this case.

On the other hand, if x and y belong to the same symmetry class, choose an index set I that contains y and one element from each other symmetry class. Because x and y are in the same symmetry class, we have \(|w(x) - x| = |w(y) - y|\). Thus

$$\begin{aligned} {\text {dis}}(w)&= |w(y) - y| + \sum _{i \in I \smallsetminus \{y\}}|w(i) - i| \\&= w(x) - x + \sum _{i \in I \smallsetminus \{y\}}|w(i) - i|. \end{aligned}$$

Furthermore,

$$\begin{aligned} {\text {dis}}(w\cdot \langle (x \ y )\rangle )&= |w(\langle (x \ y )\rangle (y)) - y| + \sum _{i \in I \smallsetminus \{y\}}|w(\langle (x \ y )\rangle (i)) - i| \\&= w(x) - y + \sum _{i \in I \smallsetminus \{y\}}|w(i) - i| \\&= {\text {dis}}(w) - (y - x). \end{aligned}$$

Since x and y belong to the same symmetry class, \(y - x = {\text {dis}}(\langle (x \ y )\rangle )\), as needed. \(\square \)

We are now prepared to give an inductive proof of Theorem 3.1.

Proof of Theorem 3.1

Thanks to Corollary 3.3, it remains to prove that \(\frac{\textrm{dis}(w)}{2}\) is at least \(\$(w)\) for all w. We will prove this by inducting on the length of w. Both the cost and the displacement of the identity are 0, establishing the base case. Now, suppose that \(\ell (w) > 0\), and assume that the result holds for all elements of length less than \(\ell (w)\).

Because w is an element of an unbranched George group W, Lemma 3.4 means that there is a transposable pair \(\{x,y\}\) for W such that

$$\begin{aligned} w(x) \ge y > x \ge w(y), \end{aligned}$$

and Lemma 3.5 implies that

$$\begin{aligned} {\text {dis}}(w \cdot \langle (x \ y )\rangle ) = {\text {dis}}(w) - {\text {dis}}(\langle (x \ y )\rangle ). \end{aligned}$$

Set \(v:= w \cdot \langle (x \ y )\rangle \). Since \(\langle (x \ y )\rangle \) is a (right) inversion of w, we have \(\ell (v) < \ell (w)\). Thus, the inductive hypothesis applies to v, and \(\$(v) = {\text {dis}}(v)/2\). Since \(\langle (x \ y )\rangle \) is a transposition, \(\$(\langle (x \ y )\rangle ) = {\text {dis}}(\langle (x \ y )\rangle )/2\) by definition, and therefore

$$\begin{aligned} {\text {dis}}(w)&= {\text {dis}}(v) + {\text {dis}}(\langle (x \ y )\rangle )\\&= 2\cdot \$(v) + 2 \cdot \$(\langle (x \ y )\rangle ). \end{aligned}$$

For any minimal-cost transposition factorization \(v = t_1 \cdots t_k\), we have \(w = t_1 \cdots t_k \cdot \langle (x \ y )\rangle \), and so

$$\begin{aligned} \$(w)&\le \$(t_1) + \cdots + \$(t_k) + \$(\langle (x \ y )\rangle )\\&= \$(v) + \$(\langle (x \ y )\rangle ). \end{aligned}$$

Combining these results yields

$$\begin{aligned} 2 \cdot \$(w) \le {\text {dis}}(w), \end{aligned}$$

completing the proof. \(\square \)

3.2 Second Main Theorem: Finite Type

In this section, we prove our second main theorem, for George groups of finite type (\(S^A_n:= S_n\), \(S^B_n\), and \(S^D_n\)). The statement of the theorem involves a statistic introduced in [1], which we recall now.

Every signed permutation can be expressed uniquely as a direct sum of indecomposable signed permutations (as in [1, §2]). For example,

$$\begin{aligned} {[}\overline{3},\overline{1},2,\overline{4},7,6,8,\overline{5}] = [\overline{3},\overline{1},2] \oplus [\overline{1}] \oplus [3,2,4,\overline{1}]. \end{aligned}$$

Definition 3.6

Let w be a signed permutation, with

$$\begin{aligned} w = w^1 \oplus \cdots \oplus w^k, \end{aligned}$$

where each \(w^i\) is an indecomposable signed permutation. These \(w^1, \ldots , w^k\) are the type B blocks of w, and

$$\begin{aligned} {\text {bl}}^B(w):= k. \end{aligned}$$

Of course, even-signed permutations can also be written as direct sums. If we require that the summands themselves be even-signed permutations, then those summands are the type D blocks of w, and \({\text {bl}}^D(w)\) is the number of type D blocks required.

For \(w = [\overline{3},\overline{1},2,\overline{4},7,6,8,\overline{5}] \in S_8^D \subset S_8^B\), the type B blocks and the type D blocks of w are given by the following decompositions, respectively:

$$\begin{aligned} w&= [\overline{3},\overline{1},2] \oplus [\overline{1}] \oplus [3,2,4,\overline{1}]\\&= [\overline{3},\overline{1},2] \oplus [\overline{1},4,3,5,\overline{2}]. \end{aligned}$$

From this, we see that \({\text {bl}}^B(w) = 3\), while \({\text {bl}}^D(w) = 2\).

For a (usual, unsigned) permutation \(w \in S_n\), the decomposition of w as a direct sum of indecomposables is the same as the decomposition if we think of w as belonging to \(S^B_n\) or \(S^D_n\). We say that these indecomposable summands are the type A blocks of w, and define \({\text {bl}}^A(w)\) to be the number of such blocks.

Theorem 3.7

For every (signed) permutation w in the finite George group \(S^X_n\), we have

$$\begin{aligned} \$(w) = \frac{{\text {dis}}(w)}{2} + {\text {bl}}^B(w) - {\text {bl}}^X(w). \end{aligned}$$

Proof

For w in \(S_n\) or \(S^B_n\) (so \(X = A\) or B), we have by Theorem 3.1 that \(\$(w) = {\text {dis}}(w)/2\), and we have by definition of blocks that the type-X blocks and type-B blocks of w are the same, so the result holds in these cases.

Let w be a permutation in \(S^D_n\), and let \(t_1 \cdots t_k\) be a $-minimizing factorization of w into transpositions. Recall from Sect. 2 that if \(i, j > 0\), then \(\$((i \ j)(\overline{i}\ \overline{j})) = {\text {dp}}^D((i \ j)(\overline{i}\ \overline{j}))\) and \(\$((i \ \overline{j})(\overline{i}\ j)) = {\text {dp}}^D((i \ \overline{j})(\overline{i}\ j)) + 2\). We refer to the transpositions in the second case as signed. Then, using the Iverson bracket,

$$\begin{aligned} \$(w) = \sum _i \$(t_i)&= \sum \limits _i \left( {\text {dp}}^D(t_i) + 2\llbracket t_i \text { signed}\rrbracket \right) \\&= \left( \sum \limits _i {\text {dp}}^D(t_i)\right) + 2\#\{t_i : t_i \text { is signed}\} \\&\ge {\text {dp}}^D(w) + {\text {neg}}(w). \end{aligned}$$

Next, we use [1, Corollary 2.10], which computes \({\text {dp}}^D(w)\) in terms of the statistics we have defined as well as \({\text {neg}}(w):= \{i \in [n]: w(i) < 0\}\):

$$\begin{aligned} \$(w)&\ge \left( \frac{{\text {dis}}(w)}{2} - {\text {neg}}(w) + {\text {bl}}^B(w) - {\text {bl}}^D(w)\right) + {\text {neg}}(w)\\&= \frac{{\text {dis}}(w)}{2} + {\text {bl}}^B(w) - {\text {bl}}^D(w). \end{aligned}$$

On the other hand, [1, §4.2] produces a factorization \(t'_1 \cdots t'_{k'}\) of w for which

$$\begin{aligned} \sum \limits _i \$(t'_i)&= \frac{{\text {dis}}(w)}{2} - {\text {neg}}(w) + {\text {bl}}^B(w) - {\text {bl}}^D(w) + {\text {neg}}(w)\\&= \frac{{\text {dis}}(w)}{2} + {\text {bl}}^B(w) - {\text {bl}}^D(w). \end{aligned}$$

And since \(\$(w) \le \sum \$(t'_i)\), we can conclude from these two inequalities that indeed

$$\begin{aligned} \$(w) = \frac{{\text {dis}}(w)}{2} + {\text {bl}}^B(w) - {\text {bl}}^D(w), \end{aligned}$$

as claimed. \(\square \)

4 Further Remarks and Open Questions

We conclude our work with commentary about our methods and a description of several possible directions for further research. Some of these possibilities involve specific conjectures, while others are more general questions or hopes for a deeper understanding.

4.1 Different Combinatorial Realizations in Affine Types

As originally observed in [4], in the definition of \(\widetilde{S}^C_n\), there is a choice about whether to have a mirror symmetry across the integer \(n + 1\) (corresponding to the translation by \(2n + 2\) in the definition) or to place the mirror between the integers n and \(n + 1\) (in which case the corresponding translation would be by \(2n + 1\) instead).Footnote 2 Indeed, in the standard reference [2] (and in the AffinePermutationGroup implementation on Sage [7]), the latter convention is chosen. This difference has no effect on the algebra of the group, but it changes the correspondence between the algebraic and combinatorial objects, and hence it changes fundamentally the answers to the questions we consider. For example, the window notation of the \(\widetilde{S}^C_n\)-simple reflection that we denoted \(s'_n\) in Sect. 2.2 is \([1, \ldots , n - 1, n + 2]\), with \({\text {dis}}(s'_n) = 2\), but in the alternate convention, it would be \([1, \ldots , n - 1, n + 1]\), with total displacement equal to 1. Our decision to follow the realization from [4] rather than the variation used in [2] is motivated by the fact that \({\text {dis}}(w)\) is even for every permutation, signed permutation, and affine permutation—and only in this realization is \({\text {dis}}(w)\) an even integer for every affine signed permutation w.

4.2 Conjectures in Affine Types \({\varvec{B}}\) and \({\varvec{D}}\)

There are two George groups of window size n that are neither unbranched nor finite: \(\widetilde{S}^B_n\) and \(\widetilde{S}^D_n\). They are defined relative to \(\widetilde{S}^C_n\) by the same sort of evenness conditions that define \(S^D_n\) as a subgroup of \(S^B_n\). In particular, \(\widetilde{S}^B_n\) is the group of affine signed permutations w such that \(\# \{ i > 0: w(i) < 0\}\) is even, and \(\widetilde{S}^D_n\) is the subgroup of \(\widetilde{S}^B_n\) consisting of those affine signed permutations w such that, in addition, \(\# \{i > n + 1: w(i) < n + 1\}\) is even.

Below, we state a precise conjecture for the value \(\$(w)\) when \(w \in \widetilde{S}^B_n\), and raise the question of computing \(\$(w)\) when \(w \in \widetilde{S}^D_n\). We begin by defining an analog of blocks for affine signed permutations.

Given an affine signed permutation w, say that an integer \(j \in [n - 1]\) is good if w restricts to a bijection on \(\pm [j]\) to itself. The number of good integers is at most \(n - 1\) (achieved, for example, on the identity, but also on the permutation \([-1, -2, -3, -4] \in \widetilde{S}^C_4\)) and can be as small as 0 (for example, in the permutation \([4, 5]\in \widetilde{S}^C_2\)). Define

$$\begin{aligned} {\text {bl}}^{\widetilde{C}}(w) = 1 + \# \{\text {good values for } w\}. \end{aligned}$$

Suppose that j is a good value for w. Say that j is (further) very good if the restriction of w to \(\pm [j]\) is an even-signed permutation. Define

$$\begin{aligned} {\text {bl}}^{\widetilde{B}}(w) = 1 + \# \{\text {very good values for } w\}. \end{aligned}$$

Example 4.1

Consider the affine signed permutation

$$\begin{aligned} w = [1, \overline{2}, 4, 3, 6, \overline{5}, 7, \overline{8}, 10 + 24, 9, 11] \in \widetilde{S}^B_{11}. \end{aligned}$$

Then, the good values of w are 1, 2, 4, 6, 7, 8, so that \({\text {bl}}^{\widetilde{C}}(w) = 7\), and the very good values of w are 1, 6, 7, so that \({\text {bl}}^{\widetilde{B}}(w) = 4\).

The values \({\text {bl}}^{\widetilde{C}}(w)\) and \({\text {bl}}^{\widetilde{B}}(w)\) are meant to be the analog of the numbers of blocks of w. The \(+1\) accounts for the “last” block stretching out to include the value n—unlike the others, that block can include values in the window outside of \(\pm [n]\).

The following conjecture extends Theorem 3.7 to affine type \(\widetilde{B}\).

Conjecture 4.2

If \(w \in \widetilde{S}^B_n\), then

$$\begin{aligned} \$(w) = \frac{1}{2} {\text {dis}}(w) + {\text {bl}}^{\widetilde{C}}(w) - {\text {bl}}^{\widetilde{B}}(w). \end{aligned}$$

If Conjecture 4.2 holds, then it would follow that:

$$\begin{aligned} \$(w) - \frac{{\text {dis}}(w)}{2} \le n \end{aligned}$$

for all \(w \in \widetilde{S}^B_n\). By Theorems 3.1 and 3.7, the same inequality holds for w in a finite or unbranched George group. However, in \(\widetilde{S}^D_n\), empirical evidence suggests that it is no longer true that the difference \( \$(w) - \frac{{\text {dis}}(w)}{2} \) is bounded as w varies in the group.

Conjecture 4.3

For all \(w \in \widetilde{S}^D_n\), we have

$$\begin{aligned} \frac{{\text {dis}}(w)}{2} \le \$(w) \le {\text {dis}}(w). \end{aligned}$$

Furthermore, the equality \(\$(w) = {\text {dis}}(w)\) holds if and only if w is of the form

$$\begin{aligned} w = [1, \ldots , i - 1, i + 2k \cdot (2n + 2), i + 1, \ldots , n] \end{aligned}$$

for some \(i \in [n]\) and \(k \in \mathbb {Z}\).

More generally, we can ask the following.

Question 4.4

Is there a formula for \(\$(w)\) for \(w \in \widetilde{S}^D_n\)?

4.3 An Aesthetically Pleasing Construction

As mentioned in Sect. 3.1, the proof of our first main theorem produces a factorization of an element w by successively adding factors on the right side. This differs from the elegant approach in [6], in which one considers some factors to have been added on the left and others to have been added on the right, thereby avoiding the need for technical lemmas akin to Lemmas 3.4 and 3.5. Is there a similarly elegant algorithm for producing/interpreting \(\$\)-minimizing factorizations in other types?

4.4 Ordering the Statistics

In [6], it is observed that various inequalities hold among the natural statistics considered, namely, that for every element w of a Coxeter group,

$$\begin{aligned} {\text {reflen}}(w) \le \frac{{\text {reflen}}(w) + \ell (w)}{2} \le {\text {dp}}(w) \le \ell (w) \end{aligned}$$
(4)

where \(\ell (w) = \ell _S(w)\) is the Coxeter length of w (the smallest number of simple reflections whose product equals w) and \({\text {reflen}}(w)\) is the reflection (absolute) length of w (the smallest number of arbitrary reflections whose product equals w). In [1, 6], the elements in which various equalities in (4) hold are classified and enumerated in finite types A, B, and D.

If W is an unbranched George group, so that \({\text {dis}}(s) = 1\) for every simple transposition s, it follows immediately from our work that (4) can be extended as follows:

$$\begin{aligned} {\text {reflen}}(w) \le \frac{{\text {reflen}}(w) + \ell (w)}{2} \le {\text {dp}}(w) \le \frac{1}{2}{\text {dis}}(w) = \$(w) \le \ell (w). \end{aligned}$$

Can one characterize and enumerate the elements for which \({\text {dp}}(w) = \$(w)\)? And, similarly, for which \(\$(w) = \ell (w)\)?

For the branched types, it is no longer true that either \({\text {dis}}(w)/2\) or \(\$(w)\) is bounded by \(\ell (w)\): indeed, this inequality is violated by the simple transposition \(s = \langle (1~\overline{2})\rangle \) in these types. Can anything interesting be said uniformly?

4.5 Depth in Affine Types

Recalling the algebraic perspective discussed in the introduction, we note that our work leaves open the question of computing the depth of affine (signed) permutations. We mention briefly the reason we believe the answer may not be as attractive as the formulas discussed above. In the affine symmetric group \(\widetilde{S}_n\), the depth of a transposition \(\langle (i~j)\rangle \) is given by

$$\begin{aligned} {\text {dp}}(\langle (i~j)\rangle ) = \frac{1 + \ell (\langle (i~j)\rangle )}{2} = |i - j| - \left\lfloor \frac{|i - j|}{n}\right\rfloor , \end{aligned}$$

and similar floor terms appear in formulas in other affine types (see, e.g., [2, (8.44)]). Thus, even in the simplest case of the depth of a single transposition, the formula is somewhat unattractive; it seems reasonable to expect the level of complication to grow for elements that require longer factorizations.