How to Average Equating Functions, If You Must

Holland, Paul W.; Strawderman, William E.

doi:10.1007/978-0-387-98138-3_6

Paul W. Holland² &
William E. Strawderman³

Part of the book series: Statistics for Social and Behavioral Sciences ((SSBS))

2395 Accesses
2 Citations

Abstract

An interest in averaging two or more equating functions can arise in various settings. As the motivation for the angle bisector method described later in this paper, Angoff (1971) mentioned situations with multiple estimates of the same linear equating function for which averaging the different estimates may be appropriate. In the nonequivalent groups with anchor test (NEAT) equating design, several possible linear and nonlinear equating methods are available. These are based on different assumptions about the missing data in that design (von Davier, Holland, & Thayer, 2004b). It might be useful to average the results of some of the options for a final compromise method. Other recent proposals include averaging an estimated equating function with the identity transformation to achieve more stability in small samples (Kim, von Davier, & Haberman, 2008) as well as creating hybrid equating functions that are averages of linear and equipercentile equating functions, putting more weight on one than on the other (von Davier, Fournier-Zajac, & Holland, 2006). In his discussion of the angle bisector, Angoff implicitly weighted the two linear functions equally. The idea of weighting the two functions differently is a natural and potentially useful added flexibility to the averaging process that we use throughout our discussion.

Access provided by Autonomous University of Puebla. Download chapter PDF

The Extended Anderson and Hauck Tests and Sample Size Procedures for Equivalence Assessment in Simple Linear Regressions

Article Open access 26 June 2024

Adventitious Error and Its Implications for Testing Relations Between Variables and for Composite Measurement Outcomes

Article Open access 04 July 2024

Optimizing Detection of True Within-Person Effects for Intensive Measurement Designs: A Comparison of Multilevel SEM and Unit-Weighted Scale Scores

Article 18 February 2020

1 Introduction and Notation

An interest in averaging two or more equating functions can arise in various settings. As the motivation for the angle bisector method described later in this paper, Angoff (1971) mentioned situations with multiple estimates of the same linear equating function for which averaging the different estimates may be appropriate. In the nonequivalent groups with anchor test (NEAT) equating design, several possible linear and nonlinear equating methods are available. These are based on different assumptions about the missing data in that design (von Davier, Holland, & Thayer, 2004b). It might be useful to average the results of some of the options for a final compromise method. Other recent proposals include averaging an estimated equating function with the identity transformation to achieve more stability in small samples (Kim, von Davier, & Haberman, 2008) as well as creating hybrid equating functions that are averages of linear and equipercentile equating functions, putting more weight on one than on the other (von Davier, Fournier-Zajac, & Holland, 2006). In his discussion of the angle bisector, Angoff implicitly weighted the two linear functions equally. The idea of weighting the two functions differently is a natural and potentially useful added flexibility to the averaging process that we use throughout our discussion.

We denote by e ₁(x) and e ₂(x) two different equating functions for linking scores on test X to scores on test Y. We will assume that e ₁(x) and e ₂(x) are strictly increasing continuous functions of x over the entire real line. The use of the entire real line is appropriate for both linear equating functions and for the method of kernel equating (von Davier et al., 2004b). Our main discussion concerns averages of equating functions that are defined for all real x.

Suppose it is desired to average e ₁(x) and e ₂(x) in some way, putting weight w on e ₁(x) and 1 – w on e ₂(x). In order to have a general notation for this, we will let ⊕ denote an operator that forms a weighted average of two such functions, e ₁ and e ₂, and puts weight w on e ₁ and 1 – w on e ₂. At this point we do not define exactly what ⊕ is and let it stand for any method of averaging. Our notation for any such weighted average of e ₁ and e ₂ is

$$w{e_{\rm 1}} \oplus \left( {{\rm 1}-w} \right){e_{\rm 2}}$$

(6.1)

to denote the resulting equating function. We denote its value at some X-score, x, by

$$w{e_{\rm 1}} \oplus \left( {{\rm 1}-w} \right){e_{\rm 2}}\left( x \right).$$

(6.2)

If there are three such functions, e ₁, e ₂, e ₃, then their weighted average function is denoted as

$${w_{\rm 1}}{e_{\rm 1}} \oplus {w_{\rm 2}}{e_{\rm 2}} \oplus {w_{\rm 3}}{e_{\rm 3}},$$

(6.3)

where the weights, w _i, sum to 1.

2 Some Desirable Properties of Averages of Equating Functions

Using our notation we can describe various properties that the operator, ⊕, should be expected to possess. The first five appear to be obvious requirements for any type of averaging process.

2.1 Property 1

Property 1: The order of averaging does not matter, so that

$$w{e_{\rm 1}} \oplus \left( {{\rm 1}-w} \right){e_{\rm 2}} = {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}} \oplus w{e_{\rm 1}}.$$

(6.4)

2.2 Property 2

Property 2: The weighted average should lie between the two functions being averaged, so that

$${\rm if}\ {e_1}(x) \leq {e_2}(x),{\rm then}\ {e_1}(x) \leq w{e_1} \oplus (1 - w){e_2}(x) \leq {e_2}(x).$$

(6.5)

Property 2 also implies the following natural property:

2.3 Property 3

Property 3: If the two equating functions are equal at a score, x, the weighted average has that same common value at x, so that

$${\rm if}\ {e_{\rm 1}}\left( x \right){\rm } = {e_{\rm 2}}\left( x \right),{\rm then}\,w{e_{\rm 1}}\,\oplus\, \left( {{\rm 1}-w} \right){e_{\rm 2}}\left( x \right){\rm } = {e_{\rm 1}}\left( x \right).$$

(6.6)

2.4 Property 4

It also seems reasonable for the average of two equating functions (that are always strictly increasing and continuous) to have both of these conditions as well. Thus, our next condition is Property 4: For any w, we ₁⊕(1 – w)e ₂(x) is a continuous and strictly increasing function of x.

2.5 Property 5

When it is desired to average three equating functions, as in Equation 6.3, it also seems natural to require the averaging process to get the same result as first averaging of a pair of the functions and then averaging that average with the remaining function, that is, Property 5: If w ₁, w ₂, w ₃ are positive and sum to 1.0, then

$${w_{\rm 1}}{e_{\rm 1}} \oplus {w_{\rm 2}}{e_{\rm 2}} \oplus {w_{\rm 3}}{e_{\rm 3}} = {w_{\rm 1}}{e_{\rm 1}} \oplus \left( {1 - {w_1}} \right)[\frac{{{w_2}}}{{1 - {w_1}}}{e_2} \oplus \frac{{{w_3}}}{{1 - {w_1}}}{e_3}].$$

(6.7)

Again, without dwelling on notational issues in Equation 6.7, the order of the pair-wise averaging should not matter, either.

2.6 Property 6

There are other, less obvious assumptions that one might expect of an averaging operator for equating functions. One of them is Property 6: If e ₁ and e ₂ are linear functions then so is we ₁ ⊕ (1 – w)e ₂, for any w. We think that Property 6 is a reasonable restriction to add to the list, because one justification for the linear equating function is its simplicity. An averaging process that changed linear functions to a nonlinear one seems to us to add a complication where there was none before.

2.7 Property 7

In addition to Properties 1–6 for ⊕, there is one very special property that has long been regarded as important for any equating function—the property of symmetry. This means that linking X to Y via the function y = e(x) is assumed to imply that the link from Y to X is given by the inverse function, x = e ⁻¹(y), as noted by Dorans and Holland (2000). The traditional interpretation of the symmetry condition when applied to averaging equating functions is that averaging the inverse functions, $e_1^{ - 1}$ and $e_2^{ - 1}$, results in the inverse function of the average of e ₁ and e ₂.

Using our notation for ⊕, the condition of symmetry may be expressed as Property 7:

$${\rm For\,any}\,w,\,{\left( {w\,{e_1} \oplus (1 - w){e_2}} \right)^{ - 1}} = we_1^{ - 1} \oplus (1 - w)e_2^{ - 1}.$$

(6.8)

From Equation 6.8 we can see that the symmetry property requires that the averaging operator, ⊕, be formally distributive relative to the inverse operator.

3 The Point-Wise Weighted Average

The simplest type of weighted average that comes to mind is the simple point-wise weighted average of e ₁ and e ₂. It is defined as

$$m\left( x \right){\rm } = w{e_{\rm 1}}\left( x \right){\rm } + {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}}\left( x \right),$$

(6.9)

where w is a fixed value, such as w = 1/2.

Geometrically, m is found by averaging the values of e ₁ and e ₂ along the vertical line located at x. For its heuristic value, our notation in Equations 6.1 and 6.2 was chosen to mimic Equation 6.9 as much as possible. In general, m(x) in Equation 6.9 will satisfy Properties 1–6, for any choice of w. However, m(x) will not always satisfy the symmetry property, Property 7. That is, if the inverses, $e_1^{ - 1}(\,y)$ and $e_2^{ - 1}(y)$, are averaged to obtain

$$m^*(y) = we_1^{ - 1}(\,y) + (1 - {\rm w})e_2^{ - 1}(y),$$

(6.10)

then only in special circumstances will m*(y) be the inverse of m(x) in Equation 6.9.

This is easiest to see when e ₁ and e ₂ are linear. For example, suppose e ₁ and e ₂ have the form

$${e_{\rm 1}}\left( x \right){\rm } = {a_{\rm 1}} + {b_{\rm 1}}x,{\rm \,and\,}\,{e_{\rm 2}}\left( x \right){\rm } = {\rm } = {a_{\rm 2}} + {b_{\rm 2}}x.$$

(6.11)

The point-wise weighted average of Equation 6.11 becomes

$${m}\left( {x} \right) = \bar a + \bar b{x},$$

(6.12)

where

$$\bar b = {w}{{\,b}_1} + (1 - {w}){{\,b}_2}\,\ {\rm and}\,\ \bar a = {w}{{\,a}_1} + (1 - {w}){{a}_2}.$$

(6.13)

However, the inverse functions for e ₁ and e ₂ are also linear with slopes 1/b ₁ and 1/b ₂, respectively. Thus, the point-wise average of the inverse functions, m*(x), has a slope that is the average of the reciprocals of the b _is:

$$b^* = w\left( {{\rm 1}/{b_{\rm 1}}} \right) + \left( {{\rm 1} - w} \right)(1/{b_2}).$$

(6.14)

The inverse function of m*(x) is also linear and has slope 1/b*, where b* is given in Equation 6.14. Thus, the slope of the inverse of m*(x) is the harmonic mean of b ₁ and b ₂. So, in order for the slope of the inverse of m*(x) to be the point-wise weighted average of the slopes of e ₁ and e ₂, the mean and the harmonic means of b ₁ and b ₂ must be equal. It is well known that this is only true if b ₁ and b ₂ are equal, in which case the equating functions are parallel. It is also easy to show that the intercepts do not add any new conditions. Thus we have Result 1 below.

3.1 Result 1

Result 1: The point-wise weighted average in Equation 6.9 satisfies the symmetry property for two linear equating functions if and only if the slopes, b ₁ and b ₂, are equal.

When e ₁(x) and e ₂(x) are non-linear they may still be parallel with a constant difference between them, that is

$${e_{\rm 1}}\left( x \right){\rm } = {e_{\rm 2}}\left( x \right){\rm } + c\ {\rm for\,\ all\ }\,x.$$

(6.15)

When Equation 6.15 holds, it is easy to establish Result 2.

3.2 Result 2

Result 2: If e ₁ (x) and e ₂ (x) are nonlinear but parallel so that Equation 6.15 holds, then the point-wise weighted average also will satisfy the symmetry property, Property 7. In this case, the point-wise average is simply a constant added (or subtracted) to either e ₁ or e ₂, for example,

$$m\left( x \right){\rm } = {e_{\rm 2}}\left( x \right){\rm } + wc = {e_{\rm 1}}\left( x \right){\rm }-{\rm }\left( {{\rm 1}-w} \right)c.$$

(6.16)

Thus, although the point-wise weighted average does not always satisfy the symmetry property, it does satisfy it if e ₁(x) and e ₂(x) are parallel curves or lines.

4 The Angle Bisector Method of Averaging Two Linear Functions

Angoff (1971) made passing reference to the angle bisector method of averaging two linear equating functions. In discussions with Angoff, Holland was informed that this method was explicitly proposed as a way of preserving the symmetry property, Property 7. Figure 6.1 illustrates the angle bisector, denoted by e _AB.

While the geometry of the angle bisector is easy to understand, for computations a formula is more useful. Holland and Strawderman (1989) give such a formula. We state their result next, and outline its proof in Section 6.5.

4.1 Result 3: Computation of the Unweighted Angle Bisector

Result 3: If e ₁(x) and e ₂(x) are two linear equating functions as in Equation 6.9 that intersect at a point, then the linear function that bisects the angle between them is the point-wise weighted average

$${e_{AB}} = W{e_{\rm 1}} + {\rm }\left( {{\rm 1}-W} \right){e_{\rm 2}},$$

(6.17)

with W given by

$$W = \frac{{{{(1 + b_1^2)}^{ - 1/2}}}}{{{{(1 + b_1^2)}^{ - 1/2}} + {{(1 + b_2^2)}^{ - 1/2}}}}.$$

(6.18)

Note that in Result 3, if the two slopes are the same then W = ½ and the formula for the angle bisector reduces to the equally weighted point-wise average of the two parallel lines. It may be shown directly that the angle bisector given by Equations 6.17 and 6.18 satisfies the symmetry property, Property 7, for any two linear equating functions. Thus, in order for the point-wise weighted average Equation 6.9 to satisfy Property 7 for any pair of linear equating functions, it is necessary for the weight, w, to depend on the functions being averaged. It cannot be the same value for all pairs of functions.

5 Some Generalizations of the Angle Bisector Method

One way to understand the angle bisector for two linear functions is to imagine a circle of radius 1 centered at the point of intersection of the two lines. For simplicity, and without loss of generality, assume that the intersection point is at the origin, (x, y) = (0, 0). This is also illustrated in Figure 6.1.

The linear function, e _i, intersects the circle at the point (x _i, y _i) = (x _i, b _i x _i), and because the circle has radius 1 we have

$$\begin{array}{ll}&{({x_i})^2} + {({b_i}{x_i})^2} = 1,\,{\rm or}\\&{x_{i}} = (1 + {({b_i})^2}){ - ^{1/2}}.\end{array}$$

(6.19)

Thus, the linear function, e _i, intersects the circle at the point

$$\left( {{x_i},{y_i}} \right) = ((1 + {({b_i})^2}){ ^-{^1}{^/}{^2}},\ {b_i}(1 + {({b_i})^2}){ ^- {^1}{^/}{^2}}).$$

(6.20)

The line that bisects the angle between e ₁ and e ₂ also bisects the chord that connects the intersection points, (x ₁, y ₁) and (x ₂, y ₂) given in Equation 6.20. The point of bisection of the chord is ((x ₁ + x ₂)/2, (y ₁ + y ₂)/2). From this it follows that the line through the origin that goes through the point of bisection of the chord has the slope,

$$b = \frac{{{y_1} + {y_2}}}{{{x_1} + {x_2}}} = W{b_1} + (1 - W){b_2},$$

(6.21)

where W is given by Equation 6.18. This shows that the angle bisector is the point-wise weighted average given in Result 3.

One way to generalize the angle bisector to include weights, as in Equation 6.1, is to divide the chord between (x ₁, y ₁) and (x ₂, y ₂) given in Equation 6.20 proportionally to w and 1 – w instead of bisecting it. If we do this, the point on the chord that is w of the way from (x ₂, y ₂) to (x ₁, y ₁) is (wx ₁ + (1 – w)x ₂, wy ₁ + (1 – w)y ₂). It follows that the line through the origin that goes through this w-point on the chord has the slope

$$b = \frac{{w{y_1} + (1 - w){y_2}}}{{w{x_1} + (1 - w){x_2}}} = W{b_1} + (1 - W){b_2},$$

(6.22)

where W is now given by

$$W = \frac{{w{{(1 + b_1^2)}^{ - 1/2}}}}{{w{{(1 + b\,_1^2)}^{ - 1/2}} + (1 - w){{(1 + b\,_2^2)}^{ - 1/2}}}}.$$

(6.23)

Hence, a weighted generalization of the angle bisector of two linear equating functions is given by Equation 6.17, with W specified by Equation 6.23.

This generalization of the angle bisector will divide the angle between e ₁ and e ₂ proportionally to w and 1 – w only when w = ½. Otherwise, this generalization only approximately divides the angle proportionately. In addition, direct calculations show that this generalization of the angle bisector will satisfy all of the properties, Properties 1–7.

However, the angle bisector may be generalized in other ways as well. For example, instead of a circle centered at the point of intersection, suppose we place an L _p-circle there instead. An L _p-circle is defined by Equation 6.24:

$${\left| x \right|^p} + |y{|^p} = {\rm 1},$$

(6.24)

where p > 0. Examples of L _p-circles for various choices of p = 1 and 3 are given in Figures 6.2 and 6.3.

If we now use the chord that connects the intersection points of the two lines with a given L _p-circle, as we did above for the ordinary circle, we find the following generalization of the angle bisector. We form the point-wise weighted average in Equation 6.17, but we use as W the following weight:

$$W = \frac{{w{{(1 + b_1^p)}^{ - 1/p}}}}{{w{{(1 + b\hskip1pt_1^p)}^{ - 1/p}} + (1 - w){{(1 + b\hskip1pt_2^p)}^{ - 1/p}}}},$$

(6.25)

for some p > 0, and 0 < w < 1. It is a simple exercise to show that the use of W from Equation 6.25 as the weight in Equation 6.17 also will satisfy Properties 1–7 for any choice of p > 0, and 0 < w < 1. We will find the case of p = 1 of special interest later. In that case W has the form

$$W = \frac{{w{{(1 + b_1^{})}^{ - 1}}}}{{w{{(1 + b_1^{})}^{ - 1}} + (1 - w){{(1 + b_2^{})}^{ - 1}}}}.$$

(6.26)

Thus, the system of weighted averages (Equation 6.17) with weights that depend on the two slopes, as in Equation 6.25, produces a variety of ways to average linear equating functions that all satisfy Properties 1–7. Thus, the angle bisector is seen to be only one of an infinite family of possibilities. It is worth mentioning here that when w = ½, all of these averages of two linear equating functions using Equation 6.25 have the property of putting more weight on the line with the smaller slope. As a simple example, if b ₁ = 1 and b ₂ = 2, then for w = ½, Equation 6.26 gives the value W = 0.6 for the case of p = 1.

An apparent limitation of all of these circle methods of averaging two linear functions is that they do not immediately generalize to the case of three or more such functions. When there are three functions, they do not necessarily meet at a point; there could be three intersection points. In such a case, the idea of using an L _p-circle centered at the “point of intersection” makes little sense. However, the condition Property 5 gives us a way out of this narrow consideration. Applying it to the point-wise weighted average results obtained so far, it is tedious but straight-forward to show that the multiple function generalization of Equation 6.17 coupled with Equation 6.25 is given by Result 4.

5.1 Result 4

Result 4: If {w _i} are positive and sum to 1.0 and if {e _i} are linear equating functions, then Property 5 requires that the pair-wise averages based on Equations 6.17 and 6.25 lead to

$${w_1}{e_1} \oplus {w_2}{e_2} \oplus {w_3}{e_3} \oplus \ldots = \sum\limits_i {{W_i}{e_i}}$$

(6.27)

where

$${W_i}\frac{{{w_i}{{(1 + b\,_i^p)}^{ - 1/p}}}}{{\sum\limits_j {{w_j}{{(1 + b\,_j^p)}^{ - 1/p}}} }}.$$

(6.28)

Result 4 gives a solution to the problem of averaging several different linear equating functions that is easily applied in practice, once choices for p and w are made.

Holland and Strawderman (1989) introduced the idea of the symmetric weighted average (swave) of two equating functions that satisfies conditions of Properties 1–7 for any pair of linear or nonlinear equating functions. In the next two sections we develop a generalization of the symmetric average.

6 The Geometry of Inverse Functions and Related Matters

To begin, it is useful to illustrate the geometry of a strictly increasing continuous function, y = e(x), and its inverse, x = e ⁻¹(y). First, fix a value of x in the domain of e(·), and let y = e(x). Then the four points, (x, y), (x, x), (y, y) and (y, x), form the four corners of a square in the (x, y) plane, where the length of each side is |x – y|. The two points, (x, x) and (y, y), both lie on the 45-degree line; the other two points lie on opposite sides of the 45-degree line on a line that is at right angles, or orthogonal, to it. In addition, (x, y) and (y, x) are equidistant from the 45-degree line. However, by definition of the inverse function, when y = e(x), it is also the case that x = e ^-1(y). Hence, the four points mentioned above can be re-expressed as (x, e(x)), (x, x), (e(x), e(x)), and (y, e ^-1(y)), respectively.

The points (x, e(x)) and (y, e ^-1(y)) are equidistant from the 45-degree line and on opposite sides of it. Furthermore, the line connecting them is orthogonal to the 45-degree line and is bisected by it. These simple facts are important for the rest of this discussion. For example, from them we immediately can conclude that the graphs of e(·) and e ^-1(·) are reflections of each other about the 45-degree line in the (x, y) plane. This observation is the basis for the swave defined in Section 6.7.

Another simple fact that we will make repeated use of is that a strictly increasing continuous function of x, e(x), crosses any line that is orthogonal to the 45-degree line in exactly one place. This is illustrated in Figure 6.4 for the graphs of two functions. In order to have a shorthand term for lines that are orthogonal to the 45-degree line, we will call them the orthogonal lines when this is unambiguous.

We recall the elementary fact that the equation for what we are calling an orthogonal line is

$$y = - x + c,{\rm or}\,y + x = c,{\rm for\,some\,constant},c.$$

(6.29)

Thus, we have the relationship

$$e\left( {{x_{\rm 1}}} \right){\rm } + {x_{\rm 1}} = c = y + x$$

(6.30)

for any other point, (x, y), that is on the orthogonal line. Equation 6.30 plays an important role in the definition of the swave in Section 6.7. Finally, we note that if e is a strictly increasing continuous function, then its inverse, e ⁻¹, is one as well.

7 The Swave: The Symmetric w-Average of Two Equating Functions

With this preparation, we are ready to define the symmetric w-average or swave of two linear or nonlinear equating functions, e ₁(x) and e ₂(x). Note that from the above discussion, any orthogonal line, of the form given by Equation 6.29, will intersect e ₁(x) at a point, x ₁, and e ₂(x) at another point, x ₂. This is also illustrated in Figure 6.4.

The idea is that the value of the swave, e _w(·), is given by the point on the orthogonal line that corresponds to the weighted average of the two points, (x ₁, e ₁(x ₁)) and (x ₂, e ₂(x ₂)):

$$\left( {\bar x,{{e}_w}\left( {\bar x} \right)} \right) = {\rm w}\left( {{{x}_{1,}}{{e}_1}\left( {{{x}_1}} \right)} \right) + \left( {1 - {\rm w}} \right)\left( {{{x}_2},{{e}_2}\left( {{{x}_2}} \right)} \right).$$

(6.31)

This is illustrated in Figure 6.5.

The point, ($\left( {\bar x,{e_w}\left( {\bar x} \right)} \right)$), is the weighted average of the two points, (x ₁, e ₁(x ₁)) and (x ₂, e ₂(x ₂)). Thus,

$$\bar x = w{x_1} + \left( {1 - w} \right){x_2},$$

(6.32)

and

$${e_w}\left( {\bar x} \right) = w{e_1}({x_1}) + (1 - w){e_2}({x_2}).$$

(6.33)

In Equation 6.31, X is given by Equation 6.32. In order to define e _w(x) for an arbitrary point, x, we start with x and define x ₁ = x – (1 – w)t and x ₂ = x + wt for some, as yet unknown, positive or negative value, t. Note that from the definitions of x ₁ and x ₂, their weighted average, wx ₁ + (1 – w)x ₂, equals x, so the given x can play the role X of Equation 6.32.

Next, we find a value of t such that (x ₁, e ₁(x ₁)) and (x ₂, e ₂(x ₂)) lie on the same orthogonal line, as in Figure 6.5. From Equation 6.30, this condition on t requires that Equation 6.34 is satisfied:

$${e_{\rm 1}}\left( {{x_{\rm 1}}} \right){\rm } + {x_{\rm 1}} = {e_{\rm 2}}\left( {{x_{\rm 2}}} \right){\rm } + {x_{\rm 2}}.$$

(6.34)

Equation 6.34 may be expressed in terms of x and t as

$${e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t} \right){\rm } + x-{\rm }\left( {{\rm 1}-w} \right)t = {e_{\rm 2}}\left( {x + wt} \right){\rm } + x + wt$$

or

$$t = {e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t} \right){\rm }-{e_{\rm 2}}\left( {x + wt} \right)$$

(6.35)

Equation 6.35 plays an important role in what follows.

In general, for each value of x, Equation 6.35 is a nonlinear equation in t. As we show in the Appendix, for any choice of x and w and for any strictly increasing continuous equating functions, e ₁ and e ₂, Equation 6.35 always has a unique solution for t. The solution of Equation 6.35 for t implicitly defines t as a function of x, which we denote by t(x). Once t(x) is in hand, the value of the swave at x, e _w(x), is computed from the expression,

$${e_w}\left( x \right){\rm } = w{e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t\left( x \right)} \right){\rm } + {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}}\left( {x + wt\left( x \right)} \right)$$

(6.36)

The definition of e _w in Equation 6.36 is an example of the operator ⊕ in Equation 6.1. In Equation 6.36, there is a clear sense in which the weight w is applied to e ₁ and 1 – w is applied to e ₂. We show later that the swave differs from the point-wise weighted average in Equation 6.9, except when the two equating functions are parallel, as discussed above. Moreover, the definition of the swave is a process that requires the whole functions, e ₁ and e ₂, rather than just their evaluation at the selected x-value. In the Appendix we show that the solution for t in Equations 6.35 and 6.36 is unique.

In the Appendix we show that the swave satisfies conditions in Properties 2 and 4. We discuss below the application of the swave to linear equating functions and show that it satisfies Property 6. That the swave satisfies Property 7, the symmetry property, is given in Result 5, next.

7.1 Result 5

Result 5: The swave, e _w(x), defined by Equations 6.35 and 6.36, satisfies the symmetry property, Property 7.

Proof

Suppose we start with the inverse functions, $e_1^{ - 1}$ and $e_2^{ - 1}$, and form their swave, denoted e*_w(y), for a given y-value. Then Equations 6.35 and 6.36 imply that for any choice of y there is a value t* that satisfies

$$t^* = e_1^{ - 1}\left( {y - \left( {1 - w} \right)t^*} \right) - e_2^{ - 1}\left( {x + wt^*} \right)$$

(6.37)

and

$$e{*_w}\left( y \right) = we_1^{ - 1}\left( {y - \left( {1 - w} \right)t*} \right) + \left( {1 - w} \right)e_2^{ - 1}\left( {y + wt*} \right).$$

(6.38)

Now let

$${y_{\rm 1}} = y-{\rm }\left( {{\rm 1}-w} \right)t^*,{\rm and}\,{y_{\rm 2}} = y + wt^*.$$

Also, define x, x ₁, and x ₂ by

$$x = e{*_w}\left( y \right),{x_1} = e_1^{ - 1}\left( {{y_1}} \right),\,{\rm and}\,{x_2} = e_2^{ - 1}\left( {{y_2}} \right).$$

(6.39)

Hence, by definition of the inverse,

$${y_{\rm 1}} = {e_{\rm 1}}\left( {{x_{\rm 1}}} \right),{y_{\rm 2}} = {e_{\rm 2}}\left( {{x_{\rm 2}}} \right),{\rm and}\,y = e{*_w}^{ - {\rm 1}}\left( x \right).$$

(6.40)

From the definition of the swave, the following three points are all on the same orthogonal line:

$$\left( {y,e{*_w}\left( y \right)} \right),{\rm }({y_{\rm 1}}, e^{-1}_{1} \left( {{y_{\rm 1}}} \right)),{\rm and}({y_{\rm 2}}, e^{-1}_{2} \left( {{y_{\rm 2}}} \right)).$$

However, using the relationships in Equations 6.38 and 6.39, these three points are the same as the following three points, which are also on that orthogonal line:

$$\left( {e{*_w}^{ - 1}\left( x \right),x} \right),\left( {{e_1}\left( {{x_1}} \right),{x_1}} \right),\,{\rm and}\,\left( {{{\rm e}_{\rm 2}}\left( {{x_2}} \right),{x_2}} \right).$$

Furthermore, the following three points are on that same orthogonal line:

$$\left( {x,e{*_w}^{ - {\rm 1}}\left( x \right)} \right),{\rm }\left( {{x_{\rm 1}},{e_{\rm 1}}\left( {{x_{\rm 1}}} \right)} \right),{\rm and}\left( {{x_{\rm 2}},{e_{\rm 2}}\left( {{x_{\rm 1}}} \right)} \right).$$

Yet, from Equation 6.38 it follows that

$$x = w{x_{\rm 1}} + {\rm }\left( {{\rm 1}-w} \right){x_{\rm 2}},$$

(6.41)

so we let t = x ₂ – x ₁, and therefore, x ₁ = x – (1 – w)t, and x ₂ = x + wt.

Furthermore, from the definitions of y ₁ and y ₂, we have

$$y = w{y_{\rm 1}} + {\rm }\left( {{\rm 1}-w} \right){y_{\rm 2}}$$

and therefore

$$y = w{e_{\rm 1}}\left( {{x_{\rm 1}}} \right) + \left( {{\rm 1} - w} \right){e_{\rm 2}}\left( {{x_2}} \right),$$

so that

$$y = e{*_w}^{ - {\rm 1}}\left( x \right){\rm } = w{e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t} \right){\rm } + {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}}\left( {x + wt} \right).$$

(6.42)

Thus, the inverse function, e*_w ⁻¹, satisfies Equation 6.36 for the swave. The only question remaining is whether the value of t in Equation 6.42 satisfies Equation 6.35.

However, the points (x ₁, e ₁(x ₁)) and (x ₂, e ₂(x ₂)) are on the same orthogonal line. Therefore, they satisfy Equation 6.34, from which Equation 6.35 for t follows.

This shows that the inverse function, e*_w ⁻¹, satisfies the condition of the symmetric w-average, e _w, so that from the uniqueness of the solution to Equation 6.35 we have e*_w ⁻¹ = e _w, which proves Result 5. From the definitions of y ₁, y ₂, x ₁, and x ₂ in the proof of Result 5, it is easy to see that the t* that solves Equation 6.37 for the inverse functions and the t that solves Equation 6.35 for the original functions are related by t* = x ₁ – x ₂ = – t, so that t and t* have the same magnitude but the opposite sign.

8 The Swave of Two Linear Equating Functions

In this section we examine the form of the swave in the linear case. The equation for t, Equation 6.35, now becomes linear in t and can be solved explicitly. So assume that

$${e_{\rm 1}}\left( x \right){\rm } = {a_{\rm 1}} + {b_{\rm 1}}x,{\rm and}\,{e_{\rm 2}}\left( x \right){\rm } = {a_{\rm 2}} + {b_{\rm 2}}x.$$

(6.43)

Then, Equation 6.35 is

$$t = {a_{\rm 1}} + {\rm }{b_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)t} \right){\rm }-{a_{\rm 2}}-{b_{\rm 2}}\left( {x + wt} \right).$$

Hence,

$$t\left( {{\rm 1} + {\rm }\left( {{\rm 1}-w} \right){b_{\rm 1}} + w{b_{\rm 2}}} \right){\rm } = {\rm }\left( {{a_{\rm 1}}-{a_{\rm 2}}} \right){\rm } + {\rm }\left( {{b_{\rm 1}}-{b_{\rm 2}}} \right)x$$

so that

$$t(x) = \frac{{({a_1} - {a_2}) + ({b_1} - {b_2})x}}{{1 + (1 - w){b_1} + w{b_2}}}.$$

(6.44)

Substituting the value of t(x) from Equation 6.44 into the equation for e _w(x) in Equation 6.36 results in

$${e_w}(x) = \bar a + \bar bx - w(1 - w)({b_1} - {b_2})\frac{{{a_1} - {a_2} + ({b_1} - {b_2})x}}{{1 + (1 - w){b_1} + w{b_2}}},$$

(6.45)

where $\bar a = w{a_1} + (1 - w){a_2}$ and $\bar b = w{b_1} + (1 - w){b_2}$ denote the weighted averages of the intercepts and slopes of e ₁ and e ₂, respectively.

From Equation 6.45 we immediately see that, in the linear case, the swave, e _w(x), is identical to the point-wise weighted average in Equation 6.9 if and only if the two slopes, b ₁ and b ₂, are identical, and the two linear functions are parallel. Simplifying Equation 6.45 further we obtain

$${e_w}\left( x \right){\rm } = W{e_{\rm 1}}\left( x \right){\rm } + {\rm }\left( {{\rm 1}-W} \right){e_{\rm 2}}\left( x \right),$$

(6.46)

where

$$W = \frac{{w{{(1 + {b_1})}^{ - 1}}}}{{w{{(1 + {b_1})}^{ - 1}} + (1 - w){{(1 + {b_2})}^{ - 1}}}}.$$

(6.47)

Thus, in the linear case, the swave is exactly the point-wise weighted average that arises for an L _p-circle with p = 1, in other words, Equation 6.26, discussed in Section 6.5. From Result 5, we know that the swave always satisfies the symmetry condition, Property 7, but this is also easily shown directly. We see that, in the linear case, the swave also satisfies Property 6.

Author information

Authors and Affiliations

Paul Holland Consulting Corporation, 200 4th Ave South, Apt 100, St Petersburg, FL, 33701, USA
Paul W. Holland
Department of Statistics, Rutgers University, 110 Frelinghuysen Rd., Room 561 Hill Center Building for the Mathematical Sciences, Busch Campus, Piscataway, NJ, N08854, USA
William E. Strawderman

Authors

Paul W. Holland
View author publications
You can also search for this author in PubMed Google Scholar
William E. Strawderman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Paul W. Holland .

Editor information

Editors and Affiliations

Educational Testing Service, Rosedale Road MS 06 P, Princeton, 08541, New Jersey, USA
Alina A. von Davier

Chapter 6 Appendix

1.1 A.1 Computing the Swave for Two Equating Functions

The key to computing e _w is Equation 6.35. This equation for t(x) is nonlinear in general, so computing t(x) requires numerical methods. A derivative-free approach that is useful in this situation is Brent’s method. To use this method to solve Equation 6.35 for t we first define g(t) as follows:

$$g\left( t \right){\rm } = t-{e_{\rm 1}}\left( {x-\left( {{\rm 1}-w} \right)t} \right) + {e_{\rm 2}}\left( {x + w\,t} \right).$$

(6.A.1)

If t ₀ solves Equation 6.35, then t ₀ is a zero of g(t) in Equation 6.A.1. Brent’s method is a way of finding the zeros of functions. It requires that two values of t are known, one for which g(t) is positive and one for which g(t) is negative. Theorem 1 summarizes several useful facts about g(t) and provides the two needed values of t for use in Brent’s method.

Theorem 1

If e₁ and e₂ are strictly increasing continuous functions, then g(t) defined in Equation 6.A.1 is a strictly increasing continuous function that has a unique zero at t ₀. Furthermore, t ₀ is positive if and only if e₁(x) – e₂(x) is positive. Consequently, if e₁(x) – e₂(x) is positive, then g(0) is negative and g(e₁(x) – e₂(x)) is positive; furthermore, if e₁(x) – e₂(x) is negative, then g(0) is positive and g(e₁(x) – e₂(x)) is negative.

Proof

The functions t, – e ₁(x – (1 – w) t), and e ₂(x + w t) are all strictly increasing continuous functions of t so that their sum, g(t), is also a strictly increasing continuous function of t. Hence, if g(t) has a zero at t ₀, this is its only zero. In order to show that g(t) does have a zero at some t ₀ it suffices to show that, for large enough t, g(t) > 0 and, for small enough t, g(t) < 0. But if t > 0, it follows from the strictly increasing (in t) nature of – e ₁(x – (1 – w) t) and of e ₂(x + w t) that

$$g\left( t \right){\rm } \,>\, t-{\rm }\left[ {{e_{\rm 1}}\left( x \right){\rm }-{e_{\rm 2}}\left( x \right)} \right].$$

(6.A.2)

The right side of Equation 6.A.2 is greater than 0 if t is larger than e ₁(x) – e ₂(x). Similarly, if t < 0, it also follows that

$$g\left( t \right){\rm } \,<\, t-{\rm }\left[ {{e_{\rm 1}}\left( x \right){\rm }-{e_{\rm 2}}\left( x \right)} \right].$$

(6.A.3)

The right side of Equation 6.A.3 is less than 0 if t is less than e ₁(x) – e ₂(x). Hence, these two inequalities show that g(t) always has a single zero at a value we denote by t ₀.

Now, suppose that t ₀ > 0. Then g(t ₀) = 0 by definition so that

$$0{\rm } \,<\, {t_0} = {e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)\,{t_0}} \right){\rm }-{e_{\rm 2}}\left( {x + w\,{t_0}} \right)$$

(6.A.4)

But by the strict monotonicity of e ₁ and e ₂, we have

$${e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)\,{t_0}} \right){\rm } <\, {e_{\rm 1}}\left( x \right),{\rm and}-{e_{\rm 2}}\left( {x + w{\,t_0}} \right){\rm } < {\rm }-{e_{\rm 2}}\left( x \right)$$

so that

$${e_{\rm 1}}\left( {x-{\rm }\left( {{\rm 1}-w} \right)\,{t_0}} \right){\rm }-{e_{\rm 2}}\left( {x + w{t_0}} \right)\,{\rm } < \,{e_{\rm 1}}\left( x \right){\rm }-{e_{\rm 2}}\left( x \right).$$

(6.A.5)

Combining Equations 6.A.4 and 6.A.5 shows that if t ₀ > 0, then e ₁(x) – e ₂(x) > 0.

A similar argument shows that if t ₀ < 0, then e ₁(x) – e ₂(x) < 0. Hence t ₀ is positive if and only if e ₁(x) – e ₂(x) is positive. Note that we can always compute e ₁(x) – e ₂(x) because it is assumed that these functions are given to us. Thus, from the relative sizes of e ₁(x) and e ₂(x) we can determine the sign of the zero, t ₀.

Because g(t) is strictly increasing we have the following additional result. If e ₁(x) – e ₂(x) is positive, then t ₀ is also positive and therefore g(0) is negative. Also, if e ₁(x) – e ₂(x) is negative, then t ₀ is also negative and therefore g(0) is positive.

Now suppose again that e ₁(x) – e ₂(x) is positive so that t ₀ is also positive. However, from Equation 6.19, for any positive t, g(t) > t – [e ₁(x) – e ₂(x)], so let t = t ₀. Hence,

$$0{\rm } = g\left( {{t_0}} \right){\rm } \,>\, {t_0}-{\rm }\left[ {{e_{\rm 1}}\left( x \right){\rm }-{e_{\rm 2}}\left( x \right)} \right],$$

(6.A.6)

so that

$$0{\rm } \,<\, {t_0} \,<\, {e_{\rm 1}}\left( x \right){\rm }-{e_{\rm 2}}\left( x \right).$$

(6.A.7)

Hence, g(e ₁(x) – e ₂(x)) is positive as well. Thus, whenever e ₁(x) – e ₂(x) is positive, then g(0) is negative and g(e ₁(x) – e ₂(x)) is positive. When e ₁(x) – e ₂(x) is negative, a similar argument shows that

$${e_{\rm 1}}\left( x \right){\rm }-{e_{\rm 2}}\left( x \right){\rm } \,<\, {t_0} \,<\, {\rm }0.$$

(6.A.8)

Hence g(e ₁(x) – e ₂(x)) is negative. This finishes the proof of Theorem 1.

1.2 A.2 Properties of the Swave

Theorem 2

The swave, e_w(x), satisfies Property 2 and lies strictly between e₁(x) and e₂(x), for all x.

Proof

Consider the case when e ₁(x) > e ₂(x) (the reverse case is proved in a similar way). We wish to show that e ₁(x) > e _w(x) > e ₂(x). Because e ₁(x) > e ₂(x), from Theorem 1 it follows that t(x) > 0 as well. From the strictly increasing natures of e ₁ and e ₂, it follows that

$${e_{\rm 1}}\left( {{{x}_{\rm 1}}} \right) \,<\, {e_{\rm 1}}\left( {x} \right),{\rm and}\,{e_{\rm 2}}\left( {{x_2}} \right) \,>\, {e_2}(x).$$

We wish to show that e ₁(x) > e _w(x) > e ₂(x), so consider first the upper bound. By definition,

$${e_w}\left( x \right){\rm } = w{e_{\rm 1}}\left( {{{x}_{\rm 1}}} \right){\rm } + {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}}\left( {{x_{\rm 2}}} \right)\,{\rm } <\, w{e_{\rm 1}}\left( {x} \right){\rm } + {\rm }\left( {{\rm 1}-w} \right){e_{\rm 2}}\left( {{x_{\rm 2}}} \right).$$

However,

$$0{\rm } \,<\, {\rm }t\left( x \right){\rm } = {\rm }e_1\left( {x_1} \right){\rm }-{\rm }e_2\left( {x_2} \right),{\rm so\ that\ }e_2\left( {x_2} \right){\rm } <\, {\rm }e_1\left( {x_1} \right){\rm } <\, {\rm }e_1\left( x \right).$$

Combining these results give us

$${e_w}\left( x \right){\rm } \,<\, {w}{{e}_{\rm 1}}\left( {\rm x} \right){\rm } + {\rm }\left( {{\rm 1}-w} \right){e_{\rm 1}}\left( x \right){\rm } = {e_{\rm 1}}\left( x \right),$$

the result we wanted to prove. The lower bound is found in an analogous manner.

Theorem 3

The swave is strictly increasing if e₁ and e₂ are.

Facts: e(x) monotone implies c(x) = x + e(x) is strictly monotone (since it is a sum of a monotone and a strictly monotone function). Also, c(x*) > c(x) implies x* > x and e(x*) > e(x).

Let c _i(x) = x + e _i(x), i = 1, 2. Also let e _w(x) = we ₁(x ₁) + (1 – w)e ₂(x ₂), where x = w x ₁ + (1 – w)x ₂ and c ₁(x ₁) = c ₂(x ₂), i.e., x ₁ + e ₁(x ₁) = x ₂ + e ₂(x ₂) so that (x _i, e _i(x _i)) are on same orthogonal line.

Assume e _i(x) are both monotone increasing. Now suppose x* > x where x = wx ₁ + (1 – w)x ₂ and x* = wx ₁* + (1 – w)x ₂* and suppose further that c ₁(x ₁) = c ₂(x ₂) and that c ₁(x ₁*) = c ₂(x ₂*). Then, (x _i, e _i(x _i)) are both on the same orthogonal line and (x _i*, e _i(x _i*)) are too (but possibly a different line). We want to conclude that x ₁* > x ₁ and x ₂* > x ₂. This will allow us to conclude that e _i(x _i*) > e _i(x _i) and hence that e _w(x*) > e _w(x), thereby proving the monotonicity of e _w.

Proof

Assume to the contrary that x ₁* ≤ x ₁. Then c ₂(x ₂*) = c ₁(x ₁*) ≤ c ₁(x ₁) = c ₂(x ₂), so that x ₂* ≤ x ₂. This in turn implies that x* = wx ₁* + (1 – w)x ₂* ≤ wx ₁ + (1 – w)x ₂ = x, or x* ≤ x, contradicting the assumption that x* > x. A similar argument shows that x ₂* > x ₂. Hence, e _i(x _i*) > e _i(x _i) and e _w(x*) > e _w(x), thereby proving the monotonicity of e _w.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Holland, P.W., Strawderman, W.E. (2009). How to Average Equating Functions, If You Must. In: von Davier, A. (eds) Statistical Models for Test Equating, Scaling, and Linking. Statistics for Social and Behavioral Sciences. Springer, New York, NY. https://doi.org/10.1007/978-0-387-98138-3_6

Download citation

DOI: https://doi.org/10.1007/978-0-387-98138-3_6
Published: 15 September 2010
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-98137-6
Online ISBN: 978-0-387-98138-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

How to Average Equating Functions, If You Must

Abstract

Similar content being viewed by others

The Extended Anderson and Hauck Tests and Sample Size Procedures for Equivalence Assessment in Simple Linear Regressions

Adventitious Error and Its Implications for Testing Relations Between Variables and for Composite Measurement Outcomes

Optimizing Detection of True Within-Person Effects for Intensive Measurement Designs: A Comparison of Multilevel SEM and Unit-Weighted Scale Scores

1 Introduction and Notation

2 Some Desirable Properties of Averages of Equating Functions

2.1 Property 1

2.2 Property 2

2.3 Property 3

2.4 Property 4

2.5 Property 5

2.6 Property 6

2.7 Property 7

3 The Point-Wise Weighted Average

3.1 Result 1

3.2 Result 2

4 The Angle Bisector Method of Averaging Two Linear Functions

4.1 Result 3: Computation of the Unweighted Angle Bisector

5 Some Generalizations of the Angle Bisector Method

5.1 Result 4

6 The Geometry of Inverse Functions and Related Matters

7 The Swave: The Symmetric w-Average of Two Equating Functions

7.1 Result 5

Proof

8 The Swave of Two Linear Equating Functions

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Chapter 6 Appendix

Chapter 6 Appendix

1.1 A.1 Computing the Swave for Two Equating Functions

Theorem 1

Proof

1.2 A.2 Properties of the Swave

Theorem 2

Proof

Theorem 3

Proof

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation