Interval-Level Variables

Berry, Kenneth J.; Johnston, Janis E.; Mielke, Paul W.

doi:10.1007/978-3-319-98926-6_7

Kenneth J. Berry⁴,
Janis E. Johnston⁵ &
Paul W. Mielke Jr.⁶

791 Accesses

Abstract

This chapter describes permutation statistical methods for measures of association designed for two or more interval-level variables. Included in this chapter are simple and multiple ordinary least squares (OLS) regression, simple and multiple least absolute deviation (LAD) regression, point-biserial correlation, and biserial correlation. Fisher’s Z transform for non-normal distributions is examined and evaluated. This chapter concludes with a discussion of the intraclass correlation coefficient.

Access provided by CONRICYT-eBooks. Download chapter PDF

Contingency Tables

Correlation and Regression

One-Sample Tests

Chapter 7 of The Measurement of Association applies exact and Monte Carlo permutation statistical methods to measures of association designed for two or more interval-level variables. While permutation statistical methods are commonly associated with non-parametric statistics and, therefore, thought by many to be limited to nominal- and ordinal-level measurements, such is certainly not the case, as noted by Feinstein in 1973 [12]. In fact, a great strength of exact and Monte Carlo permutation statistical methods is in the analysis of interval-level measurements [6]. Chapter 7 begins with a discussion and comparison of simple and multiple ordinary least squares (OLS) regression and simple and multiple least absolute deviation (LAD) regression using permutation statistical methods. Multiple regression with multiple independent variables and multivariate dependent variables is described and illustrated. Point-biserial and biserial correlation coefficients are described and analyzed with exact and Monte Carlo permutation methods. Fisher’s z transform is examined and evaluated as to its utility in transforming skewed distributions for both hypothesis testing and confidence intervals. Chapter 7 concludes with a discussion of permutation statistical methods applied to Pearson’s intraclass correlation coefficient.

7.1 Ordinary Least Squares (OLS) Linear Regression

Ordinary least squares (OLS) regression with a single predictor is a popular statistical measure of the degree of association (correlation) between two interval-level variables, usually denoted as x and y. The assumption of normality comes into play when the null hypothesis is tested by conventional means. Permutation statistical methods do not assume normality and, therefore, are often more useful than conventional statistical methods, especially when the sample size is small. Let r _xy denote the Pearson product-moment correlation coefficient for variables x and y given by

$$\displaystyle \begin{aligned} r_{xy} = \frac{\displaystyle\sum_{i=1}^{N}(x_{i}-\bar{x})(y_{i}-\bar{y})}{\sqrt{\left[ \displaystyle\sum_{i=1}^{N}(x_{i}-\bar{x})^{2} \right] \left[ \displaystyle\sum_{i=1}^{N}(y_{i}-\bar{y})^{2} \right]}}\;, \end{aligned}$$

where $\bar {x}$ and $\bar {y}$ denote the arithmetic means of variables x and y, respectively, and N is the number of bivariate measurements. The conventional test of significance is given by

$$\displaystyle \begin{aligned} t = \frac{r_{xy} \sqrt{N-2}}{\sqrt{1-r_{xy}^{2}}}\;, \end{aligned}$$

which is distributed as Student’s t with N − 2 degrees of freedom, under the assumption of normality.

More useful than simple OLS regression and correlation is multiple OLS regression with p predictors, x ₁, x ₂, …, x _p. Let $R_{y.x_{1},\,x_{2},\,\ldots ,\,x_{p}}$ indicate the multiple correlation coefficient for variables y and x ₁, x ₂, …, x _p given by

$$\displaystyle \begin{aligned} R_{x_{1},\,x_{2},\,\ldots,\,x_{p}}^{2} = \boldsymbol{\beta}^{\prime}{\mathbf{r}}_{y}\;, \end{aligned}$$

where β ^′ is the transposed vector of standardized regression weights and r _y is the vector of zero-order correlation coefficients of y with x ₁, x ₂, …, x _p. The conventional test of significance is given by

$$\displaystyle \begin{aligned} F = \frac{(N-p-1)R_{y.x_{1},\,x_{2},\,\ldots,\,x_{p}}^{2}}{p(1-R_{y.x_{1},\,x_{2},\,\ldots,\,x_{p}}^{2})}\;, \end{aligned}$$

which is distributed as Snedecor’s F with p and N − p − 1 degrees of freedom, under the assumption of normality.

7.1.1 Univariate Example of OLS Regression

Consider the example set of bivariate data listed in Table 7.1 for N = 11 subjects. For the bivariate data listed in Table 7.1, the Pearson product-moment correlation coefficient is r _xy = +0.8509. An exact permutation analysis requires random shuffles of either the x or the y values with the other set of values held constant. For this small example there are

$$\displaystyle \begin{aligned} M = N! = 11! = 39{,}916{,}800 \end{aligned}$$

possible, equally-likely arrangements in the reference set of all permutations of the observed bivariate data, making an exact permutation analysis feasible. Monte Carlo resampling methods are generally preferred for permutation correlation analyses since N! is usually a very large number, e.g., with N = 13 there are 13! = 6, 227, 020, 800 possible arrangements. Let r _o indicate the observed value of r _xy. Then, based on L = 1, 000, 000 random arrangements of the observed data under the null hypothesis, there are 861 |r _xy| values equal to or greater than |r _o| = 0.8509, yielding a Monte Carlo resampling two-sided probability value of P = 861∕1, 000, 000 = 0.8610×10⁻³.

Table 7.1 Example bivariate OLS correlation data on N = 11 subjects

Interval-Level Variables

Abstract

Similar content being viewed by others

Contingency Tables

Correlation and Regression

One-Sample Tests

7.1 Ordinary Least Squares (OLS) Linear Regression

7.1.1 Univariate Example of OLS Regression

7.1.2 Multivariate Example of OLS Regression

7.2 Least Absolute Deviation (LAD) Regression

7.2.1 Illustration of Effects of Extreme Values

7.2.1.1 Distance

7.2.1.2 Leverage

7.2.1.3 Influence

7.2.2 Univariate Example of LAD Regression

7.2.3 Multivariate Example of LAD Regression

7.3 LAD Multivariate Multiple Regression

7.3.1 Example of Multivariate Multiple Regression

7.3.1.1 Analysis of Factor A

7.3.1.2 Analysis of Factor B

7.4 Comparison of OLS and LAD Linear Regression

7.4.1 Ordinary Least Squares (OLS) Analysis

7.4.2 Least Absolute Deviation (LAD) Analysis

7.4.3 Ordinary Least Squares (OLS) Analysis

7.4.4 Least Absolute Deviation (LAD) Analysis

7.5 Fisher’s r xy to z Transformation

7.5.1 Distributions

7.5.1.1 Normal Distribution

7.5.1.2 Generalized Logistic Distribution

7.5.1.3 Symmetric Kappa Distribution

7.5.2 Confidence Intervals

7.5.3 Hypothesis Testing

7.5.4 Discussion

7.6 Point-Biserial Linear Correlation

7.6.1 Example

7.6.2 Problems with the Point-Biserial Coefficient

7.7 Biserial Linear Correlation

7.7.1 Example

7.8 Intraclass Correlation

Case 1, Form 1: ICC(1, 1)

Case 1, Form k: ICC(1,k)

Case 2, Form 1: ICC(2, 1)

Case 2, Form k: ICC(2, k)

Case 3, Form 1: ICC(3, 1)

Case 3, Form k: ICC(3, k)

7.8.1 Example

7.8.2 A Permutation Analysis

7.8.3 Interclass and Intraclass Linear Correlation

7.9 Coda

Notes

References

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation

7.5 Fisher’s r _xy to z Transformation