1 Introduction and Notation

An interest in averaging two or more equating functions can arise in various settings. As the motivation for the angle bisector method described later in this paper, Angoff (1971) mentioned situations with multiple estimates of the same linear equating function for which averaging the different estimates may be appropriate. In the nonequivalent groups with anchor test (NEAT) equating design, several possible linear and nonlinear equating methods are available. These are based on different assumptions about the missing data in that design (von Davier, Holland, & Thayer, 2004b). It might be useful to average the results of some of the options for a final compromise method. Other recent proposals include averaging an estimated equating function with the identity transformation to achieve more stability in small samples (Kim, von Davier, & Haberman, 2008) as well as creating hybrid equating functions that are averages of linear and equipercentile equating functions, putting more weight on one than on the other (von Davier, Fournier-Zajac, & Holland, 2006). In his discussion of the angle bisector, Angoff implicitly weighted the two linear functions equally. The idea of weighting the two functions differently is a natural and potentially useful added flexibility to the averaging process that we use throughout our discussion.

We denote by e 1(x) and e 2(x) two different equating functions for linking scores on test X to scores on test Y. We will assume that e 1(x) and e 2(x) are strictly increasing continuous functions of x over the entire real line. The use of the entire real line is appropriate for both linear equating functions and for the method of kernel equating (von Davier et al., 2004b). Our main discussion concerns averages of equating functions that are defined for all real x.

Suppose it is desired to average e 1(x) and e 2(x) in some way, putting weight w on e 1(x) and 1 – w on e 2(x). In order to have a general notation for this, we will let ⊕ denote an operator that forms a weighted average of two such functions, e 1 and e 2, and puts weight w on e 1 and 1 – w on e 2. At this point we do not define exactly what ⊕ is and let it stand for any method of averaging. Our notation for any such weighted average of e 1 and e 2 is

$$w{e_{\rm 1}} \oplus \left( {{\rm 1}-w} \right){e_{\rm 2}}$$
(6.1)

to denote the resulting equating function. We denote its value at some X-score, x, by

$$w{e_{\rm 1}} \oplus \left( {{\rm 1}-w} \right){e_{\rm 2}}\left( x \right).$$
(6.2)

If there are three such functions, e 1, e 2, e 3, then their weighted average function is denoted as

$${w_{\rm 1}}{e_{\rm 1}} \oplus {w_{\rm 2}}{e_{\rm 2}} \oplus {w_{\rm 3}}{e_{\rm 3}},$$
(6.3)

where the weights, w i , sum to 1.

2 Some Desirable Properties of Averages of Equating Functions

Using our notation we can describe various properties that the operator, ⊕, should be expected to possess. The first five appear to be obvious requirements for any type of averaging process.

2.1 Property 1

Property 1: The order of averaging does not matter, so that

$$w{e_{\rm 1}} \oplus \left( {{\rm 1}-w} \right){e_{\rm 2}} = {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}} \oplus w{e_{\rm 1}}.$$
(6.4)

2.2 Property 2

Property 2: The weighted average should lie between the two functions being averaged, so that

$${\rm if}\ {e_1}(x) \leq {e_2}(x),{\rm then}\ {e_1}(x) \leq w{e_1} \oplus (1 - w){e_2}(x) \leq {e_2}(x).$$
(6.5)

Property 2 also implies the following natural property:

2.3 Property 3

Property 3: If the two equating functions are equal at a score, x, the weighted average has that same common value at x, so that

$${\rm if}\ {e_{\rm 1}}\left( x \right){\rm } = {e_{\rm 2}}\left( x \right),{\rm then}\,w{e_{\rm 1}}\,\oplus\, \left( {{\rm 1}-w} \right){e_{\rm 2}}\left( x \right){\rm } = {e_{\rm 1}}\left( x \right).$$
(6.6)

2.4 Property 4

It also seems reasonable for the average of two equating functions (that are always strictly increasing and continuous) to have both of these conditions as well. Thus, our next condition is Property 4: For any w, we 1⊕(1 – w)e 2(x) is a continuous and strictly increasing function of x.

2.5 Property 5

When it is desired to average three equating functions, as in Equation 6.3, it also seems natural to require the averaging process to get the same result as first averaging of a pair of the functions and then averaging that average with the remaining function, that is, Property 5: If w 1, w 2, w 3 are positive and sum to 1.0, then

$${w_{\rm 1}}{e_{\rm 1}} \oplus {w_{\rm 2}}{e_{\rm 2}} \oplus {w_{\rm 3}}{e_{\rm 3}} = {w_{\rm 1}}{e_{\rm 1}} \oplus \left( {1 - {w_1}} \right)[\frac{{{w_2}}}{{1 - {w_1}}}{e_2} \oplus \frac{{{w_3}}}{{1 - {w_1}}}{e_3}].$$
(6.7)

Again, without dwelling on notational issues in Equation 6.7, the order of the pair-wise averaging should not matter, either.

2.6 Property 6

There are other, less obvious assumptions that one might expect of an averaging operator for equating functions. One of them is Property 6: If e 1 and e 2 are linear functions then so is we 1 ⊕ (1 – w)e 2, for any w. We think that Property 6 is a reasonable restriction to add to the list, because one justification for the linear equating function is its simplicity. An averaging process that changed linear functions to a nonlinear one seems to us to add a complication where there was none before.

2.7 Property 7

In addition to Properties 1–6 for ⊕, there is one very special property that has long been regarded as important for any equating function—the property of symmetry. This means that linking X to Y via the function y = e(x) is assumed to imply that the link from Y to X is given by the inverse function, x = e −1(y), as noted by Dorans and Holland (2000). The traditional interpretation of the symmetry condition when applied to averaging equating functions is that averaging the inverse functions, \(e_1^{ - 1}\) and \(e_2^{ - 1}\), results in the inverse function of the average of e 1 and e 2.

Using our notation for ⊕, the condition of symmetry may be expressed as Property 7:

$${\rm For\,any}\,w,\,{\left( {w\,{e_1} \oplus (1 - w){e_2}} \right)^{ - 1}} = we_1^{ - 1} \oplus (1 - w)e_2^{ - 1}.$$
(6.8)

From Equation 6.8 we can see that the symmetry property requires that the averaging operator, ⊕, be formally distributive relative to the inverse operator.

3 The Point-Wise Weighted Average

The simplest type of weighted average that comes to mind is the simple point-wise weighted average of e 1 and e 2. It is defined as

$$m\left( x \right){\rm } = w{e_{\rm 1}}\left( x \right){\rm } + {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}}\left( x \right),$$
(6.9)

where w is a fixed value, such as w = 1/2.

Geometrically, m is found by averaging the values of e 1 and e 2 along the vertical line located at x. For its heuristic value, our notation in Equations 6.1 and 6.2 was chosen to mimic Equation 6.9 as much as possible. In general, m(x) in Equation 6.9 will satisfy Properties 1–6, for any choice of w. However, m(x) will not always satisfy the symmetry property, Property 7. That is, if the inverses, \(e_1^{ - 1}(\,y)\) and \(e_2^{ - 1}(y)\), are averaged to obtain

$$m^*(y) = we_1^{ - 1}(\,y) + (1 - {\rm w})e_2^{ - 1}(y),$$
(6.10)

then only in special circumstances will m*(y) be the inverse of m(x) in Equation 6.9.

This is easiest to see when e 1 and e 2 are linear. For example, suppose e 1 and e 2 have the form

$${e_{\rm 1}}\left( x \right){\rm } = {a_{\rm 1}} + {b_{\rm 1}}x,{\rm \,and\,}\,{e_{\rm 2}}\left( x \right){\rm } = {\rm } = {a_{\rm 2}} + {b_{\rm 2}}x.$$
(6.11)

The point-wise weighted average of Equation 6.11 becomes

$${m}\left( {x} \right) = \bar a + \bar b{x},$$
(6.12)

where

$$\bar b = {w}{{\,b}_1} + (1 - {w}){{\,b}_2}\,\ {\rm and}\,\ \bar a = {w}{{\,a}_1} + (1 - {w}){{a}_2}.$$
(6.13)

However, the inverse functions for e 1 and e 2 are also linear with slopes 1/b 1 and 1/b 2, respectively. Thus, the point-wise average of the inverse functions, m*(x), has a slope that is the average of the reciprocals of the b i s:

$$b^* = w\left( {{\rm 1}/{b_{\rm 1}}} \right) + \left( {{\rm 1} - w} \right)(1/{b_2}).$$
(6.14)

The inverse function of m*(x) is also linear and has slope 1/b*, where b* is given in Equation 6.14. Thus, the slope of the inverse of m*(x) is the harmonic mean of b 1 and b 2. So, in order for the slope of the inverse of m*(x) to be the point-wise weighted average of the slopes of e 1 and e 2, the mean and the harmonic means of b 1 and b 2 must be equal. It is well known that this is only true if b 1 and b 2 are equal, in which case the equating functions are parallel. It is also easy to show that the intercepts do not add any new conditions. Thus we have Result 1 below.

3.1 Result 1

Result 1: The point-wise weighted average in Equation 6.9 satisfies the symmetry property for two linear equating functions if and only if the slopes, b 1 and b 2, are equal.

When e 1(x) and e 2(x) are non-linear they may still be parallel with a constant difference between them, that is

$${e_{\rm 1}}\left( x \right){\rm } = {e_{\rm 2}}\left( x \right){\rm } + c\ {\rm for\,\ all\ }\,x.$$
(6.15)

When Equation 6.15 holds, it is easy to establish Result 2.

3.2 Result 2

Result 2: If e 1 (x) and e 2 (x) are nonlinear but parallel so that Equation 6.15 holds, then the point-wise weighted average also will satisfy the symmetry property, Property 7. In this case, the point-wise average is simply a constant added (or subtracted) to either e 1 or e 2, for example,

$$m\left( x \right){\rm } = {e_{\rm 2}}\left( x \right){\rm } + wc = {e_{\rm 1}}\left( x \right){\rm }-{\rm }\left( {{\rm 1}-w} \right)c.$$
(6.16)

Thus, although the point-wise weighted average does not always satisfy the symmetry property, it does satisfy it if e 1(x) and e 2(x) are parallel curves or lines.

4 The Angle Bisector Method of Averaging Two Linear Functions

Angoff (1971) made passing reference to the angle bisector method of averaging two linear equating functions. In discussions with Angoff, Holland was informed that this method was explicitly proposed as a way of preserving the symmetry property, Property 7. Figure 6.1 illustrates the angle bisector, denoted by e AB .

Fig. 6.1
figure 1

The angle bisector is also the chord bisector

While the geometry of the angle bisector is easy to understand, for computations a formula is more useful. Holland and Strawderman (1989) give such a formula. We state their result next, and outline its proof in Section 6.5.

4.1 Result 3: Computation of the Unweighted Angle Bisector

Result 3: If e 1(x) and e 2(x) are two linear equating functions as in Equation 6.9 that intersect at a point, then the linear function that bisects the angle between them is the point-wise weighted average

$${e_{AB}} = W{e_{\rm 1}} + {\rm }\left( {{\rm 1}-W} \right){e_{\rm 2}},$$
(6.17)

with W given by

$$W = \frac{{{{(1 + b_1^2)}^{ - 1/2}}}}{{{{(1 + b_1^2)}^{ - 1/2}} + {{(1 + b_2^2)}^{ - 1/2}}}}.$$
(6.18)

Note that in Result 3, if the two slopes are the same then W = ½ and the formula for the angle bisector reduces to the equally weighted point-wise average of the two parallel lines. It may be shown directly that the angle bisector given by Equations 6.17 and 6.18 satisfies the symmetry property, Property 7, for any two linear equating functions. Thus, in order for the point-wise weighted average Equation 6.9 to satisfy Property 7 for any pair of linear equating functions, it is necessary for the weight, w, to depend on the functions being averaged. It cannot be the same value for all pairs of functions.

5 Some Generalizations of the Angle Bisector Method

One way to understand the angle bisector for two linear functions is to imagine a circle of radius 1 centered at the point of intersection of the two lines. For simplicity, and without loss of generality, assume that the intersection point is at the origin, (x, y) = (0, 0). This is also illustrated in Figure 6.1.

The linear function, e i , intersects the circle at the point (x i , y i ) = (x i , b i x i ), and because the circle has radius 1 we have

$$\begin{array}{ll}&{({x_i})^2} + {({b_i}{x_i})^2} = 1,\,{\rm or}\\&{x_{i}} = (1 + {({b_i})^2}){ - ^{1/2}}.\end{array}$$
(6.19)

Thus, the linear function, e i , intersects the circle at the point

$$\left( {{x_i},{y_i}} \right) = ((1 + {({b_i})^2}){ ^-{^1}{^/}{^2}},\ {b_i}(1 + {({b_i})^2}){ ^- {^1}{^/}{^2}}).$$
(6.20)

The line that bisects the angle between e 1 and e 2 also bisects the chord that connects the intersection points, (x 1, y 1) and (x 2, y 2) given in Equation 6.20. The point of bisection of the chord is ((x 1 + x 2)/2, (y 1 + y 2)/2). From this it follows that the line through the origin that goes through the point of bisection of the chord has the slope,

$$b = \frac{{{y_1} + {y_2}}}{{{x_1} + {x_2}}} = W{b_1} + (1 - W){b_2},$$
(6.21)

where W is given by Equation 6.18. This shows that the angle bisector is the point-wise weighted average given in Result 3.

One way to generalize the angle bisector to include weights, as in Equation 6.1, is to divide the chord between (x 1, y 1) and (x 2, y 2) given in Equation 6.20 proportionally to w and 1 – w instead of bisecting it. If we do this, the point on the chord that is w of the way from (x 2, y 2) to (x 1, y 1) is (wx 1 + (1 – w)x 2, wy 1 + (1 – w)y 2). It follows that the line through the origin that goes through this w-point on the chord has the slope

$$b = \frac{{w{y_1} + (1 - w){y_2}}}{{w{x_1} + (1 - w){x_2}}} = W{b_1} + (1 - W){b_2},$$
(6.22)

where W is now given by

$$W = \frac{{w{{(1 + b_1^2)}^{ - 1/2}}}}{{w{{(1 + b\,_1^2)}^{ - 1/2}} + (1 - w){{(1 + b\,_2^2)}^{ - 1/2}}}}.$$
(6.23)

Hence, a weighted generalization of the angle bisector of two linear equating functions is given by Equation 6.17, with W specified by Equation 6.23.

This generalization of the angle bisector will divide the angle between e 1 and e 2 proportionally to w and 1 – w only when w = ½. Otherwise, this generalization only approximately divides the angle proportionately. In addition, direct calculations show that this generalization of the angle bisector will satisfy all of the properties, Properties 1–7.

However, the angle bisector may be generalized in other ways as well. For example, instead of a circle centered at the point of intersection, suppose we place an L p -circle there instead. An L p -circle is defined by Equation 6.24:

$${\left| x \right|^p} + |y{|^p} = {\rm 1},$$
(6.24)

where p > 0. Examples of L p -circles for various choices of p = 1 and 3 are given in Figures 6.2 and 6.3.

Fig. 6.2
figure 2

Plot of the unit L p -circle, p = 1

Fig. 6.3
figure 3

Plot of the unit L p -circle, p = 3

If we now use the chord that connects the intersection points of the two lines with a given L p -circle, as we did above for the ordinary circle, we find the following generalization of the angle bisector. We form the point-wise weighted average in Equation 6.17, but we use as W the following weight:

$$W = \frac{{w{{(1 + b_1^p)}^{ - 1/p}}}}{{w{{(1 + b\hskip1pt_1^p)}^{ - 1/p}} + (1 - w){{(1 + b\hskip1pt_2^p)}^{ - 1/p}}}},$$
(6.25)

for some p > 0, and 0 < w < 1. It is a simple exercise to show that the use of W from Equation 6.25 as the weight in Equation 6.17 also will satisfy Properties 1–7 for any choice of p > 0, and 0 < w < 1. We will find the case of p = 1 of special interest later. In that case W has the form

$$W = \frac{{w{{(1 + b_1^{})}^{ - 1}}}}{{w{{(1 + b_1^{})}^{ - 1}} + (1 - w){{(1 + b_2^{})}^{ - 1}}}}.$$
(6.26)

Thus, the system of weighted averages (Equation 6.17) with weights that depend on the two slopes, as in Equation 6.25, produces a variety of ways to average linear equating functions that all satisfy Properties 1–7. Thus, the angle bisector is seen to be only one of an infinite family of possibilities. It is worth mentioning here that when w = ½, all of these averages of two linear equating functions using Equation 6.25 have the property of putting more weight on the line with the smaller slope. As a simple example, if b 1 = 1 and b 2 = 2, then for w = ½, Equation 6.26 gives the value W = 0.6 for the case of p = 1.

An apparent limitation of all of these circle methods of averaging two linear functions is that they do not immediately generalize to the case of three or more such functions. When there are three functions, they do not necessarily meet at a point; there could be three intersection points. In such a case, the idea of using an L p -circle centered at the “point of intersection” makes little sense. However, the condition Property 5 gives us a way out of this narrow consideration. Applying it to the point-wise weighted average results obtained so far, it is tedious but straight-forward to show that the multiple function generalization of Equation 6.17 coupled with Equation 6.25 is given by Result 4.

5.1 Result 4

Result 4: If {w i } are positive and sum to 1.0 and if {e i } are linear equating functions, then Property 5 requires that the pair-wise averages based on Equations 6.17 and 6.25 lead to

$${w_1}{e_1} \oplus {w_2}{e_2} \oplus {w_3}{e_3} \oplus \ldots = \sum\limits_i {{W_i}{e_i}}$$
(6.27)

where

$${W_i}\frac{{{w_i}{{(1 + b\,_i^p)}^{ - 1/p}}}}{{\sum\limits_j {{w_j}{{(1 + b\,_j^p)}^{ - 1/p}}} }}.$$
(6.28)

Result 4 gives a solution to the problem of averaging several different linear equating functions that is easily applied in practice, once choices for p and w are made.

Holland and Strawderman (1989) introduced the idea of the symmetric weighted average (swave) of two equating functions that satisfies conditions of Properties 1–7 for any pair of linear or nonlinear equating functions. In the next two sections we develop a generalization of the symmetric average.

6 The Geometry of Inverse Functions and Related Matters

To begin, it is useful to illustrate the geometry of a strictly increasing continuous function, y = e(x), and its inverse, x = e −1(y). First, fix a value of x in the domain of e(·), and let y = e(x). Then the four points, (x, y), (x, x), (y, y) and (y, x), form the four corners of a square in the (x, y) plane, where the length of each side is |xy|. The two points, (x, x) and (y, y), both lie on the 45-degree line; the other two points lie on opposite sides of the 45-degree line on a line that is at right angles, or orthogonal, to it. In addition, (x, y) and (y, x) are equidistant from the 45-degree line. However, by definition of the inverse function, when y = e(x), it is also the case that x = e -1(y). Hence, the four points mentioned above can be re-expressed as (x, e(x)), (x, x), (e(x), e(x)), and (y, e -1(y)), respectively.

The points (x, e(x)) and (y, e -1(y)) are equidistant from the 45-degree line and on opposite sides of it. Furthermore, the line connecting them is orthogonal to the 45-degree line and is bisected by it. These simple facts are important for the rest of this discussion. For example, from them we immediately can conclude that the graphs of e(·) and e -1(·) are reflections of each other about the 45-degree line in the (x, y) plane. This observation is the basis for the swave defined in Section 6.7.

Another simple fact that we will make repeated use of is that a strictly increasing continuous function of x, e(x), crosses any line that is orthogonal to the 45-degree line in exactly one place. This is illustrated in Figure 6.4 for the graphs of two functions. In order to have a shorthand term for lines that are orthogonal to the 45-degree line, we will call them the orthogonal lines when this is unambiguous.

Fig. 6.4
figure 4

An illustration of the intersections of e 1(x) and e 2(x) with an orthogonal line

We recall the elementary fact that the equation for what we are calling an orthogonal line is

$$y = - x + c,{\rm or}\,y + x = c,{\rm for\,some\,constant},c.$$
(6.29)

Thus, we have the relationship

$$e\left( {{x_{\rm 1}}} \right){\rm } + {x_{\rm 1}} = c = y + x$$
(6.30)

for any other point, (x, y), that is on the orthogonal line. Equation 6.30 plays an important role in the definition of the swave in Section 6.7. Finally, we note that if e is a strictly increasing continuous function, then its inverse, e −1, is one as well.

7 The Swave: The Symmetric w-Average of Two Equating Functions

With this preparation, we are ready to define the symmetric w-average or swave of two linear or nonlinear equating functions, e 1(x) and e 2(x). Note that from the above discussion, any orthogonal line, of the form given by Equation 6.29, will intersect e 1(x) at a point, x 1, and e 2(x) at another point, x 2. This is also illustrated in Figure 6.4.

The idea is that the value of the swave, e w (·), is given by the point on the orthogonal line that corresponds to the weighted average of the two points, (x 1, e 1(x 1)) and (x 2, e 2(x 2)):

$$\left( {\bar x,{{e}_w}\left( {\bar x} \right)} \right) = {\rm w}\left( {{{x}_{1,}}{{e}_1}\left( {{{x}_1}} \right)} \right) + \left( {1 - {\rm w}} \right)\left( {{{x}_2},{{e}_2}\left( {{{x}_2}} \right)} \right).$$
(6.31)

This is illustrated in Figure 6.5.

Fig. 6.5
figure 5

An illustration of the swave of e 1(x) and e 2(x) at x = w x 1 + (1 – w ) x 2 for a w greater than ½

The point, (\(\left( {\bar x,{e_w}\left( {\bar x} \right)} \right)\)), is the weighted average of the two points, (x 1, e 1(x 1)) and (x 2, e 2(x 2)). Thus,

$$\bar x = w{x_1} + \left( {1 - w} \right){x_2},$$
(6.32)

and

$${e_w}\left( {\bar x} \right) = w{e_1}({x_1}) + (1 - w){e_2}({x_2}).$$
(6.33)

In Equation 6.31, X is given by Equation 6.32. In order to define e w (x) for an arbitrary point, x, we start with x and define x 1 = x – (1 – w)t and x 2 = x + wt for some, as yet unknown, positive or negative value, t. Note that from the definitions of x 1 and x 2, their weighted average, wx 1 + (1 – w)x 2, equals x, so the given x can play the role X of Equation 6.32.

Next, we find a value of t such that (x 1, e 1(x 1)) and (x 2, e 2(x 2)) lie on the same orthogonal line, as in Figure 6.5. From Equation 6.30, this condition on t requires that Equation 6.34 is satisfied:

$${e_{\rm 1}}\left( {{x_{\rm 1}}} \right){\rm } + {x_{\rm 1}} = {e_{\rm 2}}\left( {{x_{\rm 2}}} \right){\rm } + {x_{\rm 2}}.$$
(6.34)

Equation 6.34 may be expressed in terms of x and t as

$${e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t} \right){\rm } + x-{\rm }\left( {{\rm 1}-w} \right)t = {e_{\rm 2}}\left( {x + wt} \right){\rm } + x + wt$$

or

$$t = {e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t} \right){\rm }-{e_{\rm 2}}\left( {x + wt} \right)$$
(6.35)

Equation 6.35 plays an important role in what follows.

In general, for each value of x, Equation 6.35 is a nonlinear equation in t. As we show in the Appendix, for any choice of x and w and for any strictly increasing continuous equating functions, e 1 and e 2, Equation 6.35 always has a unique solution for t. The solution of Equation 6.35 for t implicitly defines t as a function of x, which we denote by t(x). Once t(x) is in hand, the value of the swave at x, e w (x), is computed from the expression,

$${e_w}\left( x \right){\rm } = w{e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t\left( x \right)} \right){\rm } + {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}}\left( {x + wt\left( x \right)} \right)$$
(6.36)

The definition of e w in Equation 6.36 is an example of the operator ⊕ in Equation 6.1. In Equation 6.36, there is a clear sense in which the weight w is applied to e 1 and 1 – w is applied to e 2. We show later that the swave differs from the point-wise weighted average in Equation 6.9, except when the two equating functions are parallel, as discussed above. Moreover, the definition of the swave is a process that requires the whole functions, e 1 and e 2, rather than just their evaluation at the selected x-value. In the Appendix we show that the solution for t in Equations 6.35 and 6.36 is unique.

In the Appendix we show that the swave satisfies conditions in Properties 2 and 4. We discuss below the application of the swave to linear equating functions and show that it satisfies Property 6. That the swave satisfies Property 7, the symmetry property, is given in Result 5, next.

7.1 Result 5

Result 5: The swave, e w (x), defined by Equations 6.35 and 6.36, satisfies the symmetry property, Property 7.

Proof

Suppose we start with the inverse functions, \(e_1^{ - 1}\) and \(e_2^{ - 1}\), and form their swave, denoted e* w (y), for a given y-value. Then Equations 6.35 and 6.36 imply that for any choice of y there is a value t* that satisfies

$$t^* = e_1^{ - 1}\left( {y - \left( {1 - w} \right)t^*} \right) - e_2^{ - 1}\left( {x + wt^*} \right)$$
(6.37)

and

$$e{*_w}\left( y \right) = we_1^{ - 1}\left( {y - \left( {1 - w} \right)t*} \right) + \left( {1 - w} \right)e_2^{ - 1}\left( {y + wt*} \right).$$
(6.38)

Now let

$${y_{\rm 1}} = y-{\rm }\left( {{\rm 1}-w} \right)t^*,{\rm and}\,{y_{\rm 2}} = y + wt^*.$$

Also, define x, x 1, and x 2 by

$$x = e{*_w}\left( y \right),{x_1} = e_1^{ - 1}\left( {{y_1}} \right),\,{\rm and}\,{x_2} = e_2^{ - 1}\left( {{y_2}} \right).$$
(6.39)

Hence, by definition of the inverse,

$${y_{\rm 1}} = {e_{\rm 1}}\left( {{x_{\rm 1}}} \right),{y_{\rm 2}} = {e_{\rm 2}}\left( {{x_{\rm 2}}} \right),{\rm and}\,y = e{*_w}^{ - {\rm 1}}\left( x \right).$$
(6.40)

From the definition of the swave, the following three points are all on the same orthogonal line:

$$\left( {y,e{*_w}\left( y \right)} \right),{\rm }({y_{\rm 1}}, e^{-1}_{1} \left( {{y_{\rm 1}}} \right)),{\rm and}({y_{\rm 2}}, e^{-1}_{2} \left( {{y_{\rm 2}}} \right)).$$

However, using the relationships in Equations 6.38 and 6.39, these three points are the same as the following three points, which are also on that orthogonal line:

$$\left( {e{*_w}^{ - 1}\left( x \right),x} \right),\left( {{e_1}\left( {{x_1}} \right),{x_1}} \right),\,{\rm and}\,\left( {{{\rm e}_{\rm 2}}\left( {{x_2}} \right),{x_2}} \right).$$

Furthermore, the following three points are on that same orthogonal line:

$$\left( {x,e{*_w}^{ - {\rm 1}}\left( x \right)} \right),{\rm }\left( {{x_{\rm 1}},{e_{\rm 1}}\left( {{x_{\rm 1}}} \right)} \right),{\rm and}\left( {{x_{\rm 2}},{e_{\rm 2}}\left( {{x_{\rm 1}}} \right)} \right).$$

Yet, from Equation 6.38 it follows that

$$x = w{x_{\rm 1}} + {\rm }\left( {{\rm 1}-w} \right){x_{\rm 2}},$$
(6.41)

so we let t = x 2x 1, and therefore, x 1 = x – (1 – w)t, and x 2 = x + wt.

Furthermore, from the definitions of y 1 and y 2, we have

$$y = w{y_{\rm 1}} + {\rm }\left( {{\rm 1}-w} \right){y_{\rm 2}}$$

and therefore

$$y = w{e_{\rm 1}}\left( {{x_{\rm 1}}} \right) + \left( {{\rm 1} - w} \right){e_{\rm 2}}\left( {{x_2}} \right),$$

so that

$$y = e{*_w}^{ - {\rm 1}}\left( x \right){\rm } = w{e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t} \right){\rm } + {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}}\left( {x + wt} \right).$$
(6.42)

Thus, the inverse function, e* w −1, satisfies Equation 6.36 for the swave. The only question remaining is whether the value of t in Equation 6.42 satisfies Equation 6.35.

However, the points (x 1, e 1(x 1)) and (x 2, e 2(x 2)) are on the same orthogonal line. Therefore, they satisfy Equation 6.34, from which Equation 6.35 for t follows.

This shows that the inverse function, e* w −1, satisfies the condition of the symmetric w-average, e w , so that from the uniqueness of the solution to Equation 6.35 we have e* w −1 = e w , which proves Result 5. From the definitions of y 1, y 2, x 1, and x 2 in the proof of Result 5, it is easy to see that the t* that solves Equation 6.37 for the inverse functions and the t that solves Equation 6.35 for the original functions are related by t* = x 1x 2 = – t, so that t and t* have the same magnitude but the opposite sign.

8 The Swave of Two Linear Equating Functions

In this section we examine the form of the swave in the linear case. The equation for t, Equation 6.35, now becomes linear in t and can be solved explicitly. So assume that

$${e_{\rm 1}}\left( x \right){\rm } = {a_{\rm 1}} + {b_{\rm 1}}x,{\rm and}\,{e_{\rm 2}}\left( x \right){\rm } = {a_{\rm 2}} + {b_{\rm 2}}x.$$
(6.43)

Then, Equation 6.35 is

$$t = {a_{\rm 1}} + {\rm }{b_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t} \right){\rm }-{a_{\rm 2}}-{b_{\rm 2}}\left( {x + wt} \right).$$

Hence,

$$t\left( {{\rm 1} + {\rm }\left( {{\rm 1}-w} \right){b_{\rm 1}} + w{b_{\rm 2}}} \right){\rm } = {\rm }\left( {{a_{\rm 1}}-{a_{\rm 2}}} \right){\rm } + {\rm }\left( {{b_{\rm 1}}-{b_{\rm 2}}} \right)x$$

so that

$$t(x) = \frac{{({a_1} - {a_2}) + ({b_1} - {b_2})x}}{{1 + (1 - w){b_1} + w{b_2}}}.$$
(6.44)

Substituting the value of t(x) from Equation 6.44 into the equation for e w (x) in Equation 6.36 results in

$${e_w}(x) = \bar a + \bar bx - w(1 - w)({b_1} - {b_2})\frac{{{a_1} - {a_2} + ({b_1} - {b_2})x}}{{1 + (1 - w){b_1} + w{b_2}}},$$
(6.45)

where \(\bar a = w{a_1} + (1 - w){a_2}\) and \(\bar b = w{b_1} + (1 - w){b_2}\) denote the weighted averages of the intercepts and slopes of e 1 and e 2, respectively.

From Equation 6.45 we immediately see that, in the linear case, the swave, e w (x), is identical to the point-wise weighted average in Equation 6.9 if and only if the two slopes, b 1 and b 2, are identical, and the two linear functions are parallel. Simplifying Equation 6.45 further we obtain

$${e_w}\left( x \right){\rm } = W{e_{\rm 1}}\left( x \right){\rm } + {\rm }\left( {{\rm 1}-W} \right){e_{\rm 2}}\left( x \right),$$
(6.46)

where

$$W = \frac{{w{{(1 + {b_1})}^{ - 1}}}}{{w{{(1 + {b_1})}^{ - 1}} + (1 - w){{(1 + {b_2})}^{ - 1}}}}.$$
(6.47)

Thus, in the linear case, the swave is exactly the point-wise weighted average that arises for an L p -circle with p = 1, in other words, Equation 6.26, discussed in Section 6.5. From Result 5, we know that the swave always satisfies the symmetry condition, Property 7, but this is also easily shown directly. We see that, in the linear case, the swave also satisfies Property 6.